Esempi in Python per State.get_state

Linguaggio di programmazione: Python

Spazio dei nomi/nome del pacchetto: game

Classe/tipologia: State

Metodo/funzione: get_state

Esempi su hotexamples.com: 3

State.get_state in Python: 3 esempi trovati. Questi sono i migliori esempi reali in Python per game.State.get_state, estratti da progetti open source. Li puoi valutare, per aiutarci a migliorare la qualità dei nostri esempi.

Metodi utilizzati di frequente

Mostra Nascondi

State(30)

is_done(30)

next(30)

legal_actions(24)

is_first_player(20)

is_lose(14)

position_to_action(4)

next_state(3)

get_state(3)

pieces_array(3)

get_available_actions(3)

terminal(3)

tick_time(3)

total_time(3)

opponent(2)

put_obstacles(2)

update(2)

make_move(2)

whiteDisplay(2)

generateActionFromString(2)

blackDisplay(2)

get_state_result(2)

get_next_state(2)

gameOver(2)

leader(1)

board(1)

copy(1)

step(1)

sid(1)

show_board(1)

get_cell(1)

put_center(1)

apply_moves(1)

players(1)

player_name(1)

play_move(1)

get_legal_moves(1)

piece_count(1)

initialise(1)

is_player_won(1)

legal_move(1)

Esempio n. 1

Mostra file

File: q2.py Progetto: axelahmer/easy21

def sarsa(lamb: int, num_episodes: int, Qstar, record=False):
    Q = state_action_map(plus=True)
    N = state_action_map()
    N_s = state_map(plus=True)
    mses = []
    for k in range(num_episodes):
        E = state_action_map()
        s = State(deal=True)
        a = get_e_greedy_action(Q, N_s, s)
        while not s.terminal():
            N_s[s.get_state()] += 1
            N[s.get_state(), a] += 1
            s_dash, r = step(s, a)
            a_dash = get_e_greedy_action(Q, N_s, s_dash)
            delta = r + Q[s_dash.get_state(), a_dash] - Q[s.get_state(), a]
            E[s.get_state(), a] += 1

            for d in DEALER_RANGE:
                for p in PLAYER_RANGE:
                    for action in ACTIONS:
                        Q[(d, p),
                          action] += (1 /
                                      (N[(d, p), action] + 1e-9)) * delta * E[
                                          (d, p), action]
                        E[(d, p), action] *= lamb
            s = s_dash
            a = a_dash
        if record:
            mses.append(calc_mse(Q, Qstar))
    return Q, mses

Esempio n. 2

Mostra file

File: utils.py Progetto: axelahmer/easy21

def sample_episode(pi):
    history = []
    s = State(deal=True)

    while not s.terminal():
        a = pi[s.get_state()]
        # rewards do not need to be appended to history as rewards are only *rewarded* when entering the terminal state.
        history.append([s.get_state(), a])
        s, r = step(s, a)

    return history, r

Esempio n. 3

Mostra file

File: utils.py Progetto: axelahmer/easy21

def get_e_greedy_action(Q: dict, N: dict, state: State):
    epsilon = 100 / (100 + N[state.get_state()])
    chosen_action = None
    if np.random.uniform() > epsilon:
        max_q = -1e9
        for a in ACTIONS:
            q = Q[state.get_state(), a]
            if q > max_q:
                max_q = q
                chosen_action = a
    else:
        chosen_action = random.choice(ACTIONS)
    return chosen_action