Exemplos de mlp em Python

Linguagem de programação: Python

Espaço para nome / nome do pacote: rlcomp.util

Método / Função: mlp

Exemplos em hotexamples.com: 4

mlp em Python - 4 exemplos encontrados. Esses são os exemplos do mundo real mais bem avaliados de rlcomp.util.mlp em Python extraídos de projetos de código aberto. Você pode avaliar os exemplos para nos ajudar a melhorar a qualidade deles.

Relacionados

Headers

getLoadDict

ExcelMgr

catalan

evaluate

SambaConf

ModPythonRequest

normal_feature_dataset

getstate

OTCmd2

Related in langs

mmcache_gc (PHP)

user_password_hash (PHP)

IACProfessorService (C#)

ObstacleIgnoreSizeEnum (C#)

scalable_aligned_free (C++)

QIO_close_read (C++)

FieldPath (Go)

ConnectionPool (Go)

ClassLoader (Java)

JsonDataWriter (Java)

Exemplo n.º 1

0

Exibir arquivo

Arquivo: dpg.py Projeto: hans/rlcomp

def policy_model(inp, mdp, spec, name="policy", reuse=None, track_scope=None): """ Predict actions for the given input batch. Returns: actions: `batch_size * action_dim` """ # TODO remove magic numbers with tf.variable_scope(name, reuse=reuse, initializer=tf.truncated_normal_initializer(stddev=0.5)): return util.mlp(inp, mdp.state_dim, mdp.action_dim, hidden=spec.policy_dims, track_scope=track_scope)

Exemplo n.º 2

0

Exibir arquivo

Arquivo: dpg.py Projeto: hans/rlcomp

def critic_model(inp, actions, mdp, spec, name="critic", reuse=None, track_scope=None): """ Predict the Q-value of the given state-action pairs. Returns: `batch_size` vector of Q-value predictions. """ with tf.variable_scope(name, reuse=reuse): output = util.mlp(tf.concat(1, [inp, actions]), mdp.state_dim + mdp.action_dim, 1, hidden=spec.critic_dims, bias_output=True, track_scope=track_scope) return tf.squeeze(output)

Exemplo n.º 3

0

Exibir arquivo

Arquivo: dpg.py Projeto: Somnus1990/rlcomp

def policy_model(inp, mdp, spec, name="policy", reuse=None, track_scope=None): """ Predict actions for the given input batch. Returns: actions: `batch_size * action_dim` """ # TODO remove magic numbers with tf.variable_scope( name, reuse=reuse, initializer=tf.truncated_normal_initializer(stddev=0.5)): return util.mlp(inp, mdp.state_dim, mdp.action_dim, hidden=spec.policy_dims, track_scope=track_scope)

Exemplo n.º 4

0

Exibir arquivo

Arquivo: dpg.py Projeto: Somnus1990/rlcomp

def critic_model(inp, actions, mdp, spec, name="critic", reuse=None, track_scope=None): """ Predict the Q-value of the given state-action pairs. Returns: `batch_size` vector of Q-value predictions. """ with tf.variable_scope(name, reuse=reuse): output = util.mlp(tf.concat(1, [inp, actions]), mdp.state_dim + mdp.action_dim, 1, hidden=spec.critic_dims, bias_output=True, track_scope=track_scope) return tf.squeeze(output)