Python Agent.get_multiplier_last_action 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: agents

클래스/타입: Agent

메소드/함수: get_multiplier_last_action

hotexamples.com에서의 예제들: 1

Python Agent.get_multiplier_last_action - 1개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 agents.Agent.get_multiplier_last_action에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

Agent(30)

run_episode(4)

act(4)

__init__(3)

name(3)

get_move(3)

eval(3)

mark(2)

move(2)

get_action(2)

from_conf(2)

reset(2)

learn(2)

test(2)

build_trajectories(2)

ships(1)

log_activity_active(1)

log_activity_idle(1)

update_t_pref(1)

update_belief(1)

train_model(1)

train(1)

on(1)

parameters(1)

ppo_update(1)

precepts(1)

symbol(1)

set_train_mode(1)

stop(1)

preference_position(1)

step(1)

program(1)

state(1)

load_curve_certificate(1)

start(1)

sample_duration_current_state(1)

save(1)

set_current_activity_end(1)

reset_graph_info(1)

input_vector(1)

load(1)

choose_state(1)

ac_model(1)

add_actuator(1)

add_event(1)

add_id(1)

add_sensor(1)

append_sample(1)

bombs_left(1)

예제 #1

파일 보기

파일: spades.py 프로젝트: IanMcLaughlin19/SpadesAI

 def reward_function(self, agent: Agent):
     max_score = 0
     for player in self.players:
         score = self.scores[player.index]
         if score > max_score:
             max_score = score
     if self.terminal_test():
         if self.scores[agent.index] == max_score:
             reward = 100
         else:
             reward = -150
     else:
         multiplier = agent.get_multiplier_last_action()
         player_score_intial = agent.last_score
         current_score = self.scores[agent.index]
         player_won_turn = current_score > player_score_intial
         if player_won_turn:
             reward = 5 * multiplier
         else:
             reward = -10 * multiplier
     return reward