Python is_discrete 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: utils

메소드/함수: is_discrete

hotexamples.com에서의 예제들: 2

Python is_discrete - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 utils.is_discrete에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

        rewardNormalization = "returns"

    env = DummyVecEnv([
        makeEnvLambda(args.gym_id,
                      args.seed,
                      normOb=normOb,
                      rewardNormalization=rewardNormalization,
                      clipOb=clipOb,
                      clipRew=clipRew,
                      gamma=args.gamma)
    ])

    np.random.seed(args.seed)
    tf.set_random_seed(args.seed)

    discreteActionsSpace = utils.is_discrete(env)

    inputLength = env.observation_space.shape[0]
    outputLength = env.action_space.n if discreteActionsSpace else env.action_space.shape[
        0]

    #summeries placeholders and summery scalar objects
    epRewTestPh = tf.placeholder(
        tf.float32,
        shape=None,
        name='episode_test_real_reward_latest_mean_summary')
    epRewTrainPh = tf.placeholder(
        tf.float32,
        shape=None,
        name='episode_train_real_reward_latest_mean_summary')
    epTotalRewPh = tf.placeholder(dtype,

예제 #2

파일 보기

파일: td3_paper.py 프로젝트: nspasic96/RL-algorithms

        "\nBuffer size not specified. Taking value of {} which is the same as total_train_steps, as suggested by the paper\n"
        .format(args.buffer_size))

graph = tf.Graph()
with tf.Session(graph=graph) as sess:

    env = gym.make(args.gym_id)
    env = EnvironmentWrapper(env.env, args.norm_obs, args.norm_rew,
                             args.clip_obs, args.clip_rew)
    np.random.seed(args.seed)
    env.seed(args.seed)
    env.action_space.seed(args.seed)
    env.observation_space.seed(args.seed)
    tf.set_random_seed(args.seed)

    if utils.is_discrete(env):
        exit("TD3 can only be applied to continuous action space environments")

    inputLength = env.observation_space.shape[0]
    outputLength = env.action_space.shape[0]

    #summeries placeholders and summery scalar objects
    epRewPh = tf.placeholder(tf.float32,
                             shape=None,
                             name='episode_reward_summary')
    epRewLatestMeanPh = tf.placeholder(
        tf.float32, shape=None, name='episode_reward_latest_mean_summary')
    epLenPh = tf.placeholder(tf.float32,
                             shape=None,
                             name='episode_length_summary')
    expVarPh = tf.placeholder(tf.float32,