Python RLLogger 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: neorl

클래스/타입: RLLogger

hotexamples.com에서의 예제들: 6

Python RLLogger - 6개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 neorl.RLLogger에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

RLLogger(6)

자주 사용되는 메소드들

RLLogger (6)

예제 #1

파일 보기

파일: test_acer.py 프로젝트: mradaideh/neorl

def test_acer():
    def Sphere(individual):
            """Sphere test objective function.
                    F(x) = sum_{i=1}^d xi^2
                    d=1,2,3,...
                    Range: [-100,100]
                    Minima: 0
            """
            return sum(x**2 for x in individual)
    
    nx=5
    bounds={}
    for i in range(1,nx+1):
            bounds['x'+str(i)]=['int', -100, 100]
    
    #create an enviroment class
    env=CreateEnvironment(method='acer', fit=Sphere, 
                          bounds=bounds, mode='min', episode_length=50)
    #create a callback function to log data
    cb=RLLogger(check_freq=1, mode='min')
    #create an acer object based on the env object
    acer = ACER(MlpPolicy, env=env, n_steps=25, q_coef=0.55, ent_coef=0.02)
    #optimise the enviroment class
    acer.learn(total_timesteps=2000, callback=cb)
    #print the best results
    print('--------------- ACER results ---------------')
    print('The best value of x found:', cb.xbest)
    print('The best value of y found:', cb.rbest)
    
    return

예제 #2

파일 보기

파일: test_acer.py 프로젝트: XuboGU/neorl

def test_acer():
    #create an object from the class
    env = IntegerSphere()
    #create a callback function to log data
    cb = RLLogger(check_freq=1)
    #create an acer object based on the env object
    acer = ACER(MlpPolicy, env=env, n_steps=25, q_coef=0.55, ent_coef=0.02)
    #optimise the enviroment class
    acer.learn(total_timesteps=2000, callback=cb)
    #print the best results
    print('--------------- ACER results ---------------')
    print('The best value of x found:', cb.xbest)
    print('The best value of y found:', cb.rbest)

    return

예제 #3

파일 보기

def test_ppo():
    #create an object from the class
    env = Sphere()
    #create a callback function to log data
    cb = RLLogger(check_freq=1)
    #create an a2c object based on the env object
    ppo = PPO2(MlpPolicy, env=env, n_steps=12)
    #optimise the enviroment class
    ppo.learn(total_timesteps=2000, callback=cb)
    #print the best results
    print('--------------- PPO results ---------------')
    print('The best value of x found:', cb.xbest)
    print('The best value of y found:', cb.rbest)

    return

예제 #4

파일 보기

def test_dqn():
    #create an object from the class
    env = IntegerSphere()
    #create a callback function to log data
    cb = RLLogger(check_freq=1)
    #create an a2c object based on the env object
    dqn = DQN(DQNPolicy, env=env)
    #optimise the enviroment class
    dqn.learn(total_timesteps=2000, callback=cb)
    #print the best results
    print('--------------- DQN results ---------------')
    print('The best value of x found:', cb.xbest)
    print('The best value of y found:', cb.rbest)

    return

예제 #5

파일 보기

                    F(x) = sum_{i=1}^d xi^2
                    d=1,2,3,...
                    Range: [-100,100]
                    Minima: 0
            """
        #-1 is used to convert minimization to maximization
        return -sum(x**2 for x in individual)

    def reset(self):
        self.done = False
        return self.action_space.sample()

    def render(self, mode='human'):
        pass


#--------------------------------------------------------
# RL Optimisation
#--------------------------------------------------------
#create an object from the class
env = Sphere()
#create a callback function to log data
cb = RLLogger(check_freq=1)
#create an acktr object based on the env object
acktr = ACKTR(MlpPolicy, env=env, n_steps=12)
#optimise the enviroment class
acktr.learn(total_timesteps=2500, callback=cb)
#print the best results
print('--------------- ACKTR results ---------------')
print('The best value of x found:', cb.xbest)
print('The best value of y found:', cb.rbest)

예제 #6

파일 보기

파일: ex_acktr.py 프로젝트: mradaideh/neorl

def Sphere(individual):
    """Sphere test objective function.
                F(x) = sum_{i=1}^d xi^2
                d=1,2,3,...
                Range: [-100,100]
                Minima: 0
        """
    return sum(x**2 for x in individual)


nx = 5
bounds = {}
for i in range(1, nx + 1):
    bounds['x' + str(i)] = ['float', -10, 10]

#create an enviroment class
env = CreateEnvironment(method='acktr',
                        fit=Sphere,
                        bounds=bounds,
                        mode='min',
                        episode_length=50)
#create a callback function to log data
cb = RLLogger(check_freq=1, mode='min')
#create an acktr object based on the env object
acktr = ACKTR(MlpPolicy, env=env, n_steps=12, seed=1)
#optimise the enviroment class
acktr.learn(total_timesteps=2000, callback=cb)
#print the best results
print('--------------- ACKTR results ---------------')
print('The best value of x found:', cb.xbest)
print('The best value of y found:', cb.rbest)