Python plot 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: easy21_TDLearning

메소드/함수: plot

hotexamples.com에서의 예제들: 2

Python plot - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 easy21_TDLearning.plot에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

파일: Depricated.py 프로젝트: cjmcmurtrie/Easy22-and-Reinforcement-Learning

                if action==1: hiteligible = [sum(x) for x in zip(hiteligible, features)]
                else: stickeligible = [sum(x) for x in zip(stickeligible, features)]

                state, reward = step(state, action)
                features = linear(state)

                hitdelta = reward - sum([x[0]*x[1] for x in zip(features, hitparam)])
                stickdelta = reward - sum([x[0]*x[1] for x in zip(features, stickparam)])

                if action==1:
                    actionvalue = update(actionvalue, features, action, hitparam)
                    hitdelta += actionvalue[(tuple(features), 1)]
                else:
                    actionvalue = update(actionvalue, features, action, stickparam)
                    stickdelta += actionvalue[(tuple(features), 0)]

                hitparam = [sum(x) for x in zip(hitparam, [a * hitdelta * h for h in hiteligible])]
                stickparam = [sum(x) for x in zip(stickparam, [a * stickdelta * s for s in stickeligible])]

                hiteligible = [lamBda * h for h in hiteligible]
                stickeligible = [lamBda * s for s in stickeligible]

                action = greedy(features, actionvalue, e)

        if lamBda in (0.0, 1.0):
            mses += [(game, mse(MCactionvalue, actionvalue))]
            plot(mses, 'Game', 'Mean square error', 'Lambda ' + str(lamBda))

        meansquarerror.append((lamBda, mse(MCactionvalue, actionvalue)))
    plot(meansquarerror, 'Lambda', 'Mean square error', 'MSE: Lambda 0.0-1.0')

예제 #2

파일 보기

파일: easy21_LinearApproximation.py 프로젝트: cjmcmurtrie/Easy22-and-Reinforcement-Learning

                mses += [(game, mse(MCactionvalue, actionvalue))]

            Z = [0.0] * (3 * 6 * 2)
            state = State()
            action = greedysoft(state, actionvalue, w, e, 1)
            features = linear(state, action)

            while state.gameover == 0:

                # Z = features; traces = 'Replaced traces'
                Z = [sum(x) for x in zip([lamBda * z for z in Z], features)]
                traces = "Accumulated traces"

                state, reward = step(state, action)
                d = reward - sum([x[0] * x[1] for x in zip(features, w)])

                if state.gameover == 1:
                    w = [sum(x) for x in zip(w, [a * d * z for z in Z])]
                    break

                action, actionvalue, features = greedysoft(state, actionvalue, w, e, 0)
                d += actionvalue[tuple(features)]
                w = [sum(x) for x in zip(w, [a * d * z for z in Z])]

        if lamBda in (0.0, 1.0):
            mses += [(game, mse(MCactionvalue, actionvalue))]
            plot(mses, "Game", "Mean square error", "Lambda = " + str(lamBda) + " . " + traces)

        meansquarerror.append((lamBda, mse(MCactionvalue, actionvalue)))
    plot(meansquarerror, "Lambda", "Mean square error", "MSE: Lambda 0.0-1.0")