Python SenseEnv.is_touching 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: env

클래스/타입: SenseEnv

메소드/함수: is_touching

hotexamples.com에서의 예제들: 2

Python SenseEnv.is_touching - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 env.SenseEnv.is_touching에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

SenseEnv(4)

action_space_n(3)

classification_n(3)

is_touching(2)

mkdir_p(2)

observation_space(2)

action_space(1)

render(1)

reset(1)

step(1)

예제 #1

파일 보기

파일: pytorch_cnn.py 프로젝트: DailyActie/AI_DATA-sensenet

classifier_optimizer = torch.optim.Adam(cnn.parameters(), lr=0.001)

running_reward = 0
batch = []
labels = []
total_steps = 0
if args.mode == "train" or args.mode == "all":
  for i_episode in count(1000):
    observation = env.reset()
    print("episode: ", i_episode)
    for t in range(1000):
      action = select_action(observation,env.action_space_n(),args.epsilon)
      observation, reward, done, info = env.step(action)
      model.rewards.append(reward)
      
      if env.is_touching():
        print("touching!")
        #print("batch size", len(batch))
        if len(batch) > args.batch_size:
          #TODO GPU support
          #batch = torch.from_numpy(np.asarray(batch))
          batch = torch.LongTensor(torch.from_numpy(np.asarray(batch)))
          labels = torch.from_numpy(np.asarray(labels))
          #labels = torch.LongTensor(torch.from_numpy(np.asarray(labels)))
          if args.gpu and torch.cuda.is_available():
            batch = batch.cuda()
            labels = labels.cuda()
          batch = Variable(batch)
          labels = Variable(labels)
          classifier_optimizer.zero_grad()
          outputs = cnn(batch)

예제 #2

파일 보기

파일: dqn_tensorflow.py 프로젝트: cryptomental/sensenet

            done = False
            while not done:

                action = RL.choose_action(observation)
                observation_, reward, done, info = env.step(action)

                # something to consider - should we modify the reward if it's the terminal state and
                # we haven't touched yet? Massive penalty for finishing the round with no touch
                RL.store_transition(observation, action, reward, observation_)

                ep_r[i_episode] += reward

                if total_steps > 1000:
                    cost = RL.learn()

                if env.is_touching():
                    print('\ntouching at step', env.steps, 'total reward is ',
                          ep_r[i_episode])
                    games_where_touched += 1
                    cnn_features_TD[TD_cnt] = observation_
                    cnn_labels_TD[TD_cnt] = env.class_label
                    TD_cnt += 1
                    ep_touch[i_episode] += 1

                if (env.steps % 500 == 0):
                    print('\nepisode: ', i_episode + 1, 'step: ', env.steps,
                          'episode reward ', ep_r[i_episode])

                observation = observation_
                total_steps += 1