Python TabQLearningAgent 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: agents

클래스/타입: TabQLearningAgent

hotexamples.com에서의 예제들: 7

Python TabQLearningAgent - 7개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 agents.TabQLearningAgent에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

TabQLearningAgent(7)

epsilon(2)

alpha(1)

자주 사용되는 메소드들

TabQLearningAgent (7)

epsilon (2)

alpha (1)

예제 #1

파일 보기

from agents import DeepQLearningAgent, RandomAgent, TabQLearningAgent, DeepQLearningExperienceReplayAgent, \
    DoubleDeepQLearningAgent
from environments.battle_royale import BattleRoyalGameWorldTerminal, BattleRoyale
from runners import run_for_n_games_and_print_stats, run_step
import tensorflow as tf
if __name__ == "__main__":
    tf.compat.v1.disable_eager_execution()
    list_agent=[DoubleDeepQLearningAgent(action_space_size=48) if i <7  else TabQLearningAgent() for i in range(2)]
    for _ in range(100):
        gs = BattleRoyalGameWorldTerminal(0,numberofPlayer=2,list_agent = list_agent)
        gs.run()

    list_agent[0].epsilon = -1
    list_agent[1].epsilon = -1
    gs2 = BattleRoyale(numberofPlayer=2,list_agent=list_agent)
    gs2.run()

예제 #2

파일 보기

파일: experiment_battle_royale_Terminal_with_Tabular_Q_learning_agent.py 프로젝트: Citaman/2020_5A_IABD_DRL_Gym

from agents import DeepQLearningAgent, RandomAgent, TabQLearningAgent
from environments.battle_royale import BattleRoyalGameWorldTerminal, BattleRoyale
from runners import run_for_n_games_and_print_stats, run_step

if __name__ == "__main__":
    list_agent = [
        TabQLearningAgent() if i < 7 else RandomAgent() for i in range(6)
    ]
    for i in range(1000):
        gs = BattleRoyalGameWorldTerminal(i, list_agent=list_agent)
        gs.run()

    #list_agent[0].epsilon =-1
    #list_agent[1].epsilon = -1
    gs2 = BattleRoyale(list_agent=list_agent)
    gs2.run()

예제 #3

파일 보기

from agents import TabQLearningAgent
from environments import GridWorldGameState
from runners import run_for_n_games_and_print_stats, run_step

if __name__ == "__main__":
    gs = GridWorldGameState()
    agent = TabQLearningAgent()

    for _ in range(500):
        run_for_n_games_and_print_stats([agent], gs, 100)

    agent.epsilon = -1.0
    run_for_n_games_and_print_stats([agent], gs, 100)

    gs = gs.clone()
    while not gs.is_game_over():
        run_step([agent], gs)
        print(gs)

예제 #4

파일 보기

from agents import DeepQLearningAgent, RandomAgent, TabQLearningAgent, DeepQLearningExperienceReplayAgent, \
    DoubleDeepQLearningAgent, DoubleDeepQLearningExprerienceReplayAgent
from environments.battle_royale import BattleRoyalGameWorldTerminal, BattleRoyale
from runners import run_for_n_games_and_print_stats, run_step
import tensorflow as tf
if __name__ == "__main__":
    tf.compat.v1.disable_eager_execution()
    list_agent = [
        DoubleDeepQLearningExprerienceReplayAgent(
            action_space_size=48) if i < 7 else TabQLearningAgent()
        for i in range(2)
    ]
    for i in range(100):
        gs = BattleRoyalGameWorldTerminal(i,
                                          numberofPlayer=2,
                                          list_agent=list_agent)
        gs.run()

    #list_agent[0].epsilon = -1
    #list_agent[1].epsilon = -1
    gs2 = BattleRoyale(numberofPlayer=2, list_agent=list_agent)
    gs2.run()

예제 #5

파일 보기

파일: experiment_tictactoe_with_tab_Q_learning_agent_training_and_command_line_play.py 프로젝트: LorgneSchilooch/5IABD-ML

from agents import TabQLearningAgent, CommandLineAgent, RandomAgent
from environments.tictactoe import TicTacToeGameState
from runners import run_for_n_games_and_print_stats, run_step

if __name__ == "__main__":
    gs = TicTacToeGameState()
    agent0 = TabQLearningAgent()
    agent1 = TabQLearningAgent()
    agent0.alpha = 0.1
    agent0.epsilon = 0.005
    agent1.alpha = 0.1
    agent1.epsilon = 0.005

    for _ in range(100):
        run_for_n_games_and_print_stats([agent0, agent1], gs, 5000)

    agent0.epsilon = -1.0
    agent1.epsilon = -1.0
    run_for_n_games_and_print_stats([agent0, agent1], gs, 100)

    gs_clone = gs.clone()
    while not gs_clone.is_game_over():
        run_step([agent0, CommandLineAgent()], gs_clone)
        print(gs_clone)

    gs_clone = gs.clone()
    while not gs_clone.is_game_over():
        run_step([CommandLineAgent(), agent1], gs_clone)
        print(gs_clone)

예제 #6

파일 보기

파일: experiment_battle_royale_with_tabular_Q_learning_agent.py 프로젝트: Citaman/2020_5A_IABD_DRL_Gym

from agents import RandomAgent, TabQLearningAgent
from environments.battle_royale import BattleRoyale
from runners import run_for_n_games_and_print_stats, run_step

if __name__ == "__main__":
    list_agent = list([TabQLearningAgent() for i in range(6)])
    gs = BattleRoyale(list_agent=list_agent)
    gs.run()

예제 #7

파일 보기

파일: battle_royale_commande_line.py 프로젝트: Citaman/2020_5A_IABD_DRL_Gym

def run_BattleRoyal(i):
    list_agent = [DeepQLearningAgent(action_space_size=48) if i < 3 else TabQLearningAgent() for i in range(6)]
    Terminalworld = BattleRoyalGameWorldTerminal(i,list_agent=list_agent)
    a = Terminalworld.run()
    return a