MTA

Implementation of the paper "Faster and More Accurate Trace-based Policy Evaluation via Overall Target Error Meta-Optimization" [1].

The "ringworld" tests use our implemented version of the environment.

This repository also contains our reproduced $\lambda$-greedy algorithm [2], with some additional tools or scripts to draw the figures showed in the paper [1].

References

[1] Zhao, et al., Faster and More Accurate Trace-based Policy Evaluation via Overall Target Error Meta-Optimization, 2019

[2] White and White, A Greedy Approach to Adapting the Trace Parameter for Temporal Difference Learning, 2016

Requirements

Python 3.6+
Dependent python modules

Cite

Please kindly cite our work if necessary:

@article{zhao2019faster,
title={Faster and More Accurate Trace-based Policy Evaluation via Overall Target Error Meta-Optimization},
author={Zhao, Mingde and Porada, Ian and Luan, Sitao and Chang, Xiao-Wen and Precup, Doina},
journal={arXiv},
volume={1904.11439},
year={2019},
url={https://arxiv.org/abs/1904.11439},
}

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
LATEX		LATEX
frozenlake		frozenlake
gadgets		gadgets
ringworld		ringworld
.gitattributes		.gitattributes
.gitignore		.gitignore
MC.py		MC.py
README.md		README.md
RingWorld.py		RingWorld.py
compare_kappa.m		compare_kappa.m
frozen_lake.py		frozen_lake.py
frozenlake.sh		frozenlake.sh
frozenlake_MC.py		frozenlake_MC.py
frozenlake_MTA.py		frozenlake_MTA.py
frozenlake_compare_kappa.m		frozenlake_compare_kappa.m
frozenlake_lambda.m		frozenlake_lambda.m
frozenlake_truths_4x4.npz		frozenlake_truths_4x4.npz
frozenlake_value.m		frozenlake_value.m
get_frozen_lake_ground_truth.py		get_frozen_lake_ground_truth.py
get_truth_MC.py		get_truth_MC.py
greedy.py		greedy.py
methods.py		methods.py
mta.py		mta.py
ringworld.sh		ringworld.sh
ringworld_MC.py		ringworld_MC.py
ringworld_MTA.py		ringworld_MTA.py
ringworld_togtd.py		ringworld_togtd.py
true_online_GTD.py		true_online_GTD.py
utils.py		utils.py

shubhampachori12110095/MTA

Folders and files

Latest commit

History

Repository files navigation

MTA

References

Requirements

Cite

About

Resources

Stars

Watchers

Forks

Languages