Python Policy.B Examples

Programming Language: Python

Namespace/Package Name: policy

Class/Type: Policy

Method/Function: B

Examples at hotexamples.com: 1

Python Policy.B - 1 examples found. These are the top rated real world Python examples of policy.Policy.B extracted from open source projects. You can rate examples to help us improve the quality of examples.

Frequently Used Methods

Show Hide

Policy(30)

action_prob(20)

__init__(13)

act(12)

checkWin(6)

build_deterministic(5)

action(4)

MakeMove(3)

build(3)

CheckLegal(3)

CRAWLER_NUMBER(2)

query(2)

qFunc(2)

choose_action(2)

fromString(2)

INVALID(2)

epsilonGreedy(2)

check_policy(1)

user(1)

classifier(1)

group(1)

script(1)

set_probability(1)

APPLY_TIME_INTERVAL(1)

actions_probas_from(1)

check(1)

calculate_probs(1)

apply_accumulated_gradients(1)

add_models(1)

B(1)

action_masks(1)

_placeholders(1)

_func(1)

__getitem__(1)

W(1)

TIME_INTERVAL_ST(1)

TIME_INTERVAL_ED(1)

CRAWLER_TYPE(1)

weights(1)

Example #1

Show file

game = '2h2o-v0'
gen = 1
data = np.load('./champions/' + game + '/' + game + '_' + str(gen) + '.npz')
cpus = 4

env = gym.make(game)
s0 = env.reset()
shape = s0.shape[0]
num_actions = env.action_space.shape[0]
a_bound = [env.action_space.low, env.action_space.high]

hidden_units = data['h']

champion = Policy(shape, hidden_units, num_actions, a_bound, game)
champion.W = data['w']
champion.B = data['b']

pool = Pool(processes=cpus)
champions = []
for k in range(5):
    champions.append(champion)

scores = pool.map(evaluate_policy_single, champions)
scores = np.array(scores)
score = np.mean(score)

print('Champion Average Score = ' + str(score))

vis_policy(champion)