Python get_fingerprint 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: mol_dqn.chemgraph.dqn.deep_q_networks

메소드/함수: get_fingerprint

hotexamples.com에서의 예제들: 2

Python get_fingerprint - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 mol_dqn.chemgraph.dqn.deep_q_networks.get_fingerprint에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

 def test_get_fingerprint(self):
     hparams = deep_q_networks.get_hparams(fingerprint_length=64)
     fingerprint = deep_q_networks.get_fingerprint('c1ccccc1', hparams)
     self.assertListEqual(fingerprint.tolist(), [
         1, 0, 0, 0, 0, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 1, 0, 0, 0, 0,
         0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
         0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0
     ])

예제 #2

파일 보기

파일: run_dqn.py 프로젝트: MitchellTesla/google-research

def _step(environment, dqn, memory, episode, hparams, exploration, head):
  """Runs a single step within an episode.

  Args:
    environment: molecules.Molecule; the environment to run on.
    dqn: DeepQNetwork used for estimating rewards.
    memory: ReplayBuffer used to store observations and rewards.
    episode: Integer episode number.
    hparams: HParams.
    exploration: Schedule used for exploration in the environment.
    head: Integer index of the DeepQNetwork head to use.

  Returns:
    molecules.Result object containing the result of the step.
  """
  # Compute the encoding for each valid action from the current state.
  steps_left = hparams.max_steps_per_episode - environment.num_steps_taken
  valid_actions = list(environment.get_valid_actions())
  observations = np.vstack([
      np.append(deep_q_networks.get_fingerprint(act, hparams), steps_left)
      for act in valid_actions
  ])
  action = valid_actions[dqn.get_action(
      observations, head=head, update_epsilon=exploration.value(episode))]
  action_t_fingerprint = np.append(
      deep_q_networks.get_fingerprint(action, hparams), steps_left)
  result = environment.step(action)
  steps_left = hparams.max_steps_per_episode - environment.num_steps_taken
  action_fingerprints = np.vstack([
      np.append(deep_q_networks.get_fingerprint(act, hparams), steps_left)
      for act in environment.get_valid_actions()
  ])
  # we store the fingerprint of the action in obs_t so action
  # does not matter here.
  memory.add(
      obs_t=action_t_fingerprint,
      action=0,
      reward=result.reward,
      obs_tp1=action_fingerprints,
      done=float(result.terminated))
  return result