Compare Episodic and Step-based Reinforcement Learning

Author: Gong, Yuhe

Overview

This repository builds a framework to easily compare episodic and step-based reinforcement learning with several environments (DeepMind Control Suite , ALR environments, OpenAI environment).

For step-based reinforcement learning, we use PPO, SAC from stable-baselines3 framework.
For episodic reinforcement learning, we use DMP, ProMP from ALR's framework

We use different reward function to compare the performance in each environment:

Sparse reward function: in one episode, only one step has the reward according to the task, other steps' rewards is 0.
Dense reward function: in one episode, every step has the reward according to the task.

Training with dense reward

Name	PPO	DMP	ProMP
`ALRHoleReacher`	✔️
`ALRBallInACup`	✔️
`DeepMindBallInCup`	✔️	✔️	✔️

Training with sparse reward

Name	DMP	ProMP
`ALRHoleReacher`
`ALRBallInACup`
`DeepMindBallInCup`	✔️	✔️

Command

For training environment

Step-based algo

python train.py --algo ppo --env_id ALRBallInACupSimpleDense-v0

python train.py --algo ppo --env_id DeepMindBallInCupDense-v0

python train.py --algo ppo --env_id DeepMindBallInCup-v0

python train.py --algo sac --env_id DeepMindBallInCupDense-v0 --seed 0

python train.py --algo ppo --env_id HoleReacherDense-v0

Episodic algo

python train.py --algo dmp --env_id DeepMindBallInCupDMP-v0 --stop_cri True

python train.py --algo dmp --env_id DeepMindBallInCupDenseDMP-v0

python train.py --algo promp --env_id DeepMindBallInCupProMP-v0

python train.py --algo promp --env_id DeepMindBallInCupDenseProMP-v0

python train.py --algo dmp --env_id ALRReacherBalanceDMP-v0

For continue training

python train_continue.py --algo ppo --env_id ALRBallInACupSimpleDense-v0 --model_id 1

python train_continue.py --algo ppo --env_id DeepMindBallInCupDense-v0 --model_id 1

python train_continue.py --algo ppo --env_id ALRReacher-v0 --model_id 5

For enjoy a well-trained model:

python enjoy.py --algo ppo --env_id ALRBallInACupSimpleDense-v0 --model_id 20 --step 300

python enjoy.py --algo dmp --env_id DeepMindBallInCupDenseDMP-v0 --model_id 2 --step 300

python enjoy.py --algo promp --env_id DeepMindBallInCupDenseProMP-v0 --model_id 4 --step 300

python enjoy.py --algo ppo --env_id DeepMindBallInCup-v0 --model_id 3 --step 400

python enjoy.py --algo sac --env_id DeepMindBallInCup-v0 --model_id 3 --step 400

python enjoy.py --algo ppo --env_id ALRReacherBalance-v0 --model_id 10 --step 400

Name		Name	Last commit message	Last commit date
Latest commit History 318 Commits
.idea		.idea
config		config
logs		logs
src		src
utils		utils
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
cluster.py		cluster.py
cw2run.py		cw2run.py
enjoy.py		enjoy.py
requirements.txt		requirements.txt
train.py		train.py
train_continue.py		train_continue.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Compare Episodic and Step-based Reinforcement Learning

Overview

Training with dense reward

Training with sparse reward

Command

For training environment

For continue training

For enjoy a well-trained model:

About

Releases

Packages

Contributors 2

Languages

YuheGong/Compare-Episodic-and-Step-based-Reinforcement-Learning

Folders and files

Latest commit

History

Repository files navigation

Compare Episodic and Step-based Reinforcement Learning

Overview

Training with dense reward

Training with sparse reward

Command

For training environment

For continue training

For enjoy a well-trained model:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages