Python Config.extend示例

编程语言: Python

命名空间/包名称: surreal.session

类/类型: Config

方法/功能: extend

hotexamples.com的示例: 2

Python Config.extend - 已找到2个示例。这些是从开源项目中提取的最受好评的surreal.session.Config.extend现实Python示例。您可以评价示例，以帮助我们提高示例质量。

常用方法

显示隐藏

Config(6)

extend(2)

dump_file(1)

seed(1)

示例#1

显示文件

文件： ppo_configs.py 项目： sundyCoders/surreal

            'clip_range': (0.05, 0.3),  # range of the adapted penalty factor
            'scale_constant': 1.2,
        },
    },
    'replay': {
        # 'replay_class': 'FIFOReplay',
        'batch_size': 64,
        'memory_size': 96,
        'sampling_start_size': 64,
        'replay_shards': 1,
    },
    'parameter_publish': {
        'exp_interval': 4096,
    },
})
PPO_DEFAULT_LEARNER_CONFIG.extend(BASE_LEARNER_CONFIG)

PPO_DEFAULT_ENV_CONFIG = Config({
    'env_name': '',
    'action_repeat': 1,
    'pixel_input': False,
    'use_grayscale': False,
    'use_depth': False,
    'frame_stacks': 1,
    'sleep_time': 0,
    'video': {
        'record_video': False,
        'save_folder': None,
        'max_videos': 500,
        'record_every': 5,
    },

示例#2

显示文件

文件： ddpg_configs.py 项目： sundyCoders/surreal

    },
    'replay': {
        'batch_size': 512,
        'memory_size':
        int(1000000 /
            3),  # The total replay size is memory_size * replay_shards
        'sampling_start_size': 3000,
        'replay_shards': 3,
    },
    'parameter_publish': {
        # Minimum amount of time (seconds) between two parameter publish
        'min_publish_interval': 3,
    },
})

DDPG_DEFAULT_LEARNER_CONFIG.extend(BASE_LEARNER_CONFIG)

DDPG_DEFAULT_ENV_CONFIG = Config({
    'env_name': '_str_',
    'num_agents': '_int_',
    'demonstration': None,
    'use_depth': False,
    'render': False,
    'use_demonstration': False,
    # If true, DDPG will expect an image at obs['pixel']['camera0']
    'pixel_input': False,
    'use_grayscale': False,
    # Stacks previous image frames together to provide history information
    'frame_stacks': 3,
    # Each action will be played this number of times. The reward of the consecutive actions will be the the reward
    # of the last action in the sequence