Python Q 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: pybrain.rl.learners.valuebased.q

클래스/타입: Q

hotexamples.com에서의 예제들: 3

Python Q - 3개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 pybrain.rl.learners.valuebased.q.Q에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

__init__(2)

learn(1)

예제 #1

파일 보기

파일: q.py 프로젝트: jaegs/AI_Practicum

    def learn(self):
        """
            Performs Q learning based on observations but also
            performs learning on states that are adjacent time periods,
            albeit with a slower learning rate.
            
            For example, traffic at 4:30PM on an edge will be somewhat similar to
            traffic at 5:00. Hence, we can use an observation at 4:30 to update 5:00.
        """
        self.alpha = const.ALPHA
        Q.learn(self) #do normal learning
        for seq in self.dataset:
            self.dataset2.newSequence()
            for state, action, reward in seq: #add states of adjacent time periods
                #print(state, action, reward)
                period = state % const.PERIODS
                node = int(state / const.PERIODS)
                self.dataset2.addSample(node * const.PERIODS + (period + 1) % const.PERIODS, action, reward)
                self.dataset2.addSample(node * const.PERIODS + (period - 1) % const.PERIODS, action, reward)
        temp = self.dataset 
        self.dataset = self.dataset2       
        self.alpha = const.ALPHA_ADJ_PERIOD
        Q.learn(self)
        self.dataset = temp
        self.dataset2.clear()
#GPSLearner().learn()

예제 #2

파일 보기

파일: q.py 프로젝트: jaegs/AI_Practicum

 def __init__(self, ):
     Q.__init__(self, const.ALPHA, const.GAMMA)
     self.explorer = FeasibleEpsilonGreedyExplorer(const.EPSILON, const.DECAY)
     self.dataset2 = ReinforcementDataSet(1, 1)

예제 #3

파일 보기

파일: q.py 프로젝트: jaegs/AI_Practicum

 def __init__(self, ):
     Q.__init__(self, const.ALPHA, const.GAMMA)
     self.explorer = FeasibleEpsilonGreedyExplorer(const.EPSILON, const.DECAY)