Python State.get_all_states примеры использования

Язык программирования: Python

Пространство имен/Пакет: model.state

Класс/Тип: State

Метод/Функция: get_all_states

Примеров на hotexamples.com: 3

Python State.get_all_states - 3 примера найдено. Это лучшие примеры Python кода для model.state.State.get_all_states, полученные из open source проектов. Вы можете ставить оценку каждому примеру, чтобы помочь нам улучшить качество примеров.

Основные методы

Показать Скрыть

State(30)

move_with(7)

from_deal(3)

initial_state(3)

ability_ready(3)

get_all_states(3)

enemy_buildings(2)

enemy_squads(2)

forges_buildings(2)

print_board(2)

neutral_buildings(2)

checkWinningState(2)

my_squads(2)

my_buildings(2)

is_solvable(2)

construct_from_state_file(1)

merge(1)

promptSave(1)

next_states(1)

add_token(1)

change_piece(1)

loadGame(1)

is_terminal(1)

cpu_moves(1)

get_state_id_config(1)

get_player_turn(1)

game_set(1)

clear_board(1)

fromXml(1)

enemy_active_abilities(1)

saveGame(1)

Пример #1

Показать файл

Файл: policy.py Проект: pmikolajczyk41/Blackjack-reinforcement-learning

 def from_values(cls, values: dict):
     mapping = dict()
     for s in State.get_all_states():
         if s.current_sum < 12: mapping[s] = Action.HIT
         elif values[StateActionPair(s, Action.STICK)] > values[StateActionPair(s, Action.HIT)]:
             mapping[s] = Action.STICK
         else: mapping[s] = Action.HIT
     return Policy.from_deterministic_mapping(mapping)

Пример #2

Показать файл

Файл: policy.py Проект: pmikolajczyk41/Blackjack-reinforcement-learning

 def epsilon_greedy_from_values(cls, values: dict, exploring_prob: Callable):
     mapping = dict()
     for s in State.get_all_states():
         if values[StateActionPair(s, Action.STICK)] > values[StateActionPair(s, Action.HIT)]:
             mapping[s] = [1. - exploring_prob(), exploring_prob()]
         else:
             mapping[s] = [exploring_prob(), 1. - exploring_prob()]
     return Policy.from_probabilistic_mapping(mapping)

Пример #3

Показать файл

Файл: learning_utils.py Проект: pmikolajczyk41/Blackjack-reinforcement-learning

from itertools import product

from model.actions import Action
from model.policy import Policy
from model.state import State, StateActionPair

ALL_STATES = State.get_all_states()
ALL_STATE_ACTION_PAIRS = [
    StateActionPair(s, a) for s, a in product(ALL_STATES, list(Action))
]


class Algorithm:
    @classmethod
    def _create_sap_unif_mapping(cls, value):
        return {sap: value for sap in ALL_STATE_ACTION_PAIRS}

    @property
    def policy(self) -> Policy:
        raise NotImplemented

    def __init__(self):
        self._Q = Algorithm._create_sap_unif_mapping(0.)

    def train(self, rounds: int) -> None:
        raise NotImplemented


class MonteCarloAlgorithm(Algorithm):
    def __init__(self):
        super().__init__()