Python Model.predict_one 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: model_class

클래스/타입: Model

메소드/함수: predict_one

hotexamples.com에서의 예제들: 2

Python Model.predict_one - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 model_class.Model.predict_one에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

Model(10)

train(3)

save(3)

eval(3)

predict_one(2)

save_model(2)

plot_losses(2)

name(1)

parameters(1)

add_layer(1)

predict(1)

predict_batch(1)

load_state_dict(1)

del_layer(1)

calc_RSquare(1)

state_dict(1)

analyse(1)

train_batch(1)

예제 #1

파일 보기

                samples[i].append(None)
            memory.add_sample(samples[i])

        sample_batch = memory.sample_samples(batch_size)
        actual_batch_size = len(sample_batch)
        state_batch = np.zeros((actual_batch_size, 9))
        next_state_batch = np.zeros((actual_batch_size, 9))
        action_batch = [sample[1] for sample in sample_batch]
        
        for i, sample in enumerate(sample_batch):
            state_batch[i] = sample[0]
            if sample[3] is not None:
                next_state_batch[i] = sample[3]
            
        qsa_batch = model.predict_batch(state_batch, sess)

        for i in range(actual_batch_size):
            for choice in range(9):
                if state_batch[i, choice] != 0:
                    qsa_batch[i, choice] = -2
            if sample_batch[i][3] is None:
                qsa_batch[i, action_batch[i]] = sample_batch[i][2]
            else:
                qsa_batch[i, action_batch[i]] = sample_batch[i][2] + gamma*np.amax(model.predict_one(next_state_batch[i].reshape((1,9)), sess))
            
        model.train_batch(state_batch, qsa_batch, sess)
        
        epsilon = 0.9*np.exp(-0.001*game)
    model.save(sess)
    model.plot_losses()

예제 #2

파일 보기

        sample_batch = memory.sample_samples(batch_size)
        actual_batch_size = len(sample_batch)
        state_batch = np.zeros((actual_batch_size, 9))
        next_state_batch = np.zeros((actual_batch_size, 9))
        action_batch = [sample[1] for sample in sample_batch]

        for i, sample in enumerate(sample_batch):
            state_batch[i] = sample[0]
            if sample[3] is not None:
                next_state_batch[i] = sample[3]

        qsa_batch = model.predict_batch(state_batch, sess)

        for i in range(actual_batch_size):
            for choice in range(9):
                if state_batch[i, choice] != 0:
                    qsa_batch[i, choice] = invalid_move_reward
            if sample_batch[i][3] is None:
                qsa_batch[i, action_batch[i]] = sample_batch[i][2]
            else:
                qsa_batch[
                    i, action_batch[i]] = sample_batch[i][2] + gamma * np.amax(
                        model.predict_one(next_state_batch[i].reshape(
                            (1, 9)), sess))

        model.train_batch(state_batch, qsa_batch, sess)

        epsilon = 0.9 * np.exp(-0.001 * game)
    model.save(sess, 'tic_tac_toe_model')
    model.plot_losses()