This repository contains accompanying code for the paper Reinforcement Learning from Passive Data via Latent Intentions.
The code is built off of jaxrl_m (see dibyaghosh/jaxrl_m for better documentation) -- all new code is in the src/ directory.
Add this directory to your PYTHONPATH. Install the dependencies for jaxrl_m (the usual suspects: jax, flax, optax, distrax, wandb, ml_collections), and additional dependencies depending on which environments you want to try (see requirements.txt).
The XMagical dataset is available on Google Drive
To train an ICVF agent on the Antmaze dataset, run:
python experiments/antmaze/train_icvf.py --env_name=antmaze-large-diverse-v2
To train an ICVF agent on the XMagical dataset, run:
python experiments/xmagical/train_icvf.py
- jaxrl_m/: A carbon copy of https://github.com/dibyaghosh/jaxrl_m
- icvf_envs/: Environment wrappers and dataset loaders
- src/: New code for ICVF
- icvf_learner.py: Core algorithmic logic
- icvf_networks.py: ICVF network architecture
- extra_agents/: Finetuning downstream RL agents from the ICVF representation
- experiments/: Launchers for ICVF experiments