Python Agent.build_trajectoriesの例

プログラミング言語: Python

名前空間/パッケージ名: agents

クラス/型: Agent

メソッド/関数: build_trajectories

hotexamples.comのコード掲載数: 2

Python Agent.build_trajectories - 2件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのagents.Agent.build_trajectoriesの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

よく使われるメソッド

表示非表示

Agent(30)

run_episode(4)

act(4)

__init__(3)

name(3)

get_move(3)

eval(3)

mark(2)

move(2)

get_action(2)

from_conf(2)

reset(2)

learn(2)

test(2)

build_trajectories(2)

ships(1)

log_activity_active(1)

log_activity_idle(1)

update_t_pref(1)

update_belief(1)

train_model(1)

train(1)

on(1)

parameters(1)

ppo_update(1)

precepts(1)

symbol(1)

set_train_mode(1)

stop(1)

preference_position(1)

step(1)

program(1)

state(1)

load_curve_certificate(1)

start(1)

sample_duration_current_state(1)

save(1)

set_current_activity_end(1)

reset_graph_info(1)

input_vector(1)

load(1)

choose_state(1)

ac_model(1)

add_actuator(1)

add_event(1)

add_id(1)

add_sensor(1)

append_sample(1)

bombs_left(1)

コード例 #1

ファイルを表示

ファイル: dbg.py プロジェクト: trevormcinroe/reinforcement_learning

# TODO: CHOICE TO ϕ DICTIONARY???

# Init the Agent's environment
env = Environment()

# Init the expert agent
# Feed it the expert trajectories
a = Agent(type='expert',
          action_list=['a', 'b', 'c', 'd', 'e', 'f', 'g'],
          environment=env,
          trajectories=[['a', 'b', 'c', 'e', 'g', 'b', 'c', 'e', 'g', 'c'],
                        ['a', 'b', 'c', 'a', 'g', 'g', 'a', 'g', 'g', 'c'],
                        ['c', 'd', 'f', 'b', 'c', 'a', 'd', 'f', 'b', 'c']])

# Build said expert trajectories
a.build_trajectories()

# Build the Agent's initial state distribution
a.build_D()

# Init a standalone environment for the state itself
simul_env = Environment()

# Init the simulation
sim = Simulation(agents=a, environment=simul_env, alpha=1)

# Need to initialize Q(s,a)

# This method will initalize a matrix of state-action pairs and their values (currently set to init all
# at 0). This  will build a matrix that represents all of the states that the expert agent has visited
sim.reset_q(trajectories=sim.agents['expert'].state_trajectories)

コード例 #2

ファイルを表示

# Feed it the expert trajectories
a = Agent(type='expert',
          action_list=e_action_list,
          environment=e_env,
          trajectories=e_trajs)

# a.environment._reset(action_list=a.action_list, attribute_based=True)
#
#
# a.environment._update_state(action='1', attribute_based=True)
#
# print(e_action_map['1'])
# print(a.environment.current_state)

# Build said expert trajectories
a.build_trajectories(attribute_based=True)

# # Build the Agent's initial state distribution
a.build_D()

# Building the feature expectations for the expert
# Init a standalone environment for the state itself
simul_env = Environment(attribute_mapping=e_action_map)

# Init the simulation
sim = Simulation(agents=a, environment=simul_env, alpha=.2)

# Computing the feature expectation of the expert
mu_e = sim.μ_estimate(trajectories=sim.agents['expert'].trajectories,
                      gamma=0.99)