MOVED

We have implemented PPO+D into PytorchRL https://github.com/PyTorchRL/pytorchrl where is mantained.

Deprected README

This reposotory is no longer used.

Implementation of the paper "Guided Exploration with Proximal PolicyOptimization using a Single Demonstration" https://arxiv.org/pdf/2007.03328.pdf

Create new conda environment with requirements.txt

cd ppo_D

mkdir RUNS

Test run for sparse lunar lander

python train_ppo_bullet.py --seed 16 --device 'cuda:0' --use-gae --lr 2e-4 --clip-param 0.2 --value-loss-coef 0.3 --num-processes 12 --num-steps 2048 --num-mini-batch 32 --entropy-coef 0.02 --num-env-steps 60000000 --log-dir ../RUNS/exp_lunar_lander_1 --frame-stack 1 --cnn MLP --gamma 0.99 --save-interval 50 --gae-lambda 0.95 --ppo-epoch 10 --state-stack 16 --rho 0.1 --phi 0.0 --size-buffer 50 --size-buffer-V 0 --demo-dir ../datasets/sparse_lunar_lander/recordings/ --threshold-reward 0.0 --task 'SparseLunarLander-v1'

Test run for sparse reacher python train_ppo_bullet.py --seed 42 --device 'cuda:0' --use-gae --lr 2e-4 --clip-param 0.2 --value-loss-coef 0.3 --num-processes 64 --num-steps 2048 --num-mini-batch 32 --entropy-coef 0.02 --num-env-steps 500000000 --log-dir ../RUNS/exp_reacher_1 --frame-stack 1 --cnn MLP --gamma 0.99 --save-interval 50 --gae-lambda 0.95 --ppo-epoch 10 --state-stack 16 --rho 0.3 --phi 0.0 --size-buffer 40 --size-buffer-V 0 --demo-dir ../datasets/sparse_reacher/recorded_reacher_threshold_1_10 --threshold-reward 0.001 --task 'SparseReacher-v1'

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
animal		animal
atari		atari
bullet		bullet
datasets		datasets
imgs		imgs
main		main
models		models
ppo		ppo
videos		videos
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
monitor.ipynb		monitor.ipynb
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MOVED

Deprected README

About

Releases

Packages

Contributors 2

Languages

License

compsciencelab/ppo_D

Folders and files

Latest commit

History

Repository files navigation

MOVED

Deprected README

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages