Name		Name	Last commit message	Last commit date
Latest commit History 81 Commits
.github		.github
docs		docs
games		games
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
models.py		models.py
muzero.py		muzero.py
notebook.ipynb		notebook.ipynb
replay_buffer.py		replay_buffer.py
requirements.lock		requirements.lock
requirements.txt		requirements.txt
self_play.py		self_play.py
shared_storage.py		shared_storage.py
trainer.py		trainer.py

Repository files navigation

Continuous MuZero General

Adaptation of MuZero General for continuous action space environments like MuJoCo and PyBullet.

Features

Multi-dimension continuous action space
Fully connected network and Residual Network

Demo

Testing MuJoCo InvertedDoublePendulum-v2:

Games already implemented

MuJoCo InvertedPendulum-v2 (Tested with the fully connected network)
MuJoCo InvertedDoublePendulum-v2 (Tested with the fully connected network)
MuJoCo Swimmer-v2 (Tested with the fully connected network)
MuJoCo Hopper-v2
MuJoCo Walker2d-v2
PyBullet InvertedPendulumBulletEnv-v0 (Tested with the fully connected network)
PyBullet InvertedDoublePendulumBulletEnv-v0 (Tested with the fully connected network)
PyBullet HopperBulletEnv-v0

Tests are done on Ubuntu with 16 GB RAM / Intel i7 / GTX 1050Ti Max-Q. We make sure to obtain a progression and a level which ensures that it has learned. But we do not systematically reach a human level. For certain environments, we notice a regression after a certain time. The proposed configurations are certainly not optimal and we do not focus for now on the optimization of hyperparameters. Any help is welcome.

Getting started

Installation

git clone https://github.com/werner-duvaud/muzero-general.git
cd muzero-general
git checkout continuous

pip install -r requirements.txt

For MuJoCo environments, follow the instructions here for the installation.

Run

python muzero.py

To visualize the training results, run in a new terminal:

tensorboard --logdir ./results

Authors

Xuxi Yang
Werner Duvaud
Aurèle Hainaut
Contributors

Please use this bibtex if you want to cite this repository (master branch) in your publications:

@misc{muzero-general,
  author       = {Werner Duvaud, Aurèle Hainaut},
  title        = {MuZero General: Open Reimplementation of MuZero},
  year         = {2019},
  publisher    = {GitHub},
  journal      = {GitHub repository},
  howpublished = {\url{https://github.com/werner-duvaud/muzero-general}},
}

Getting involved

GitHub Issues: For reporting bugs.
Pull Requests: For submitting code contributions.
Discord server: For discussions about development or any general questions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Continuous MuZero General

Features

Demo

Games already implemented

Getting started

Installation

Run

Authors

Getting involved

About

Contributors 16

Languages

License

werner-duvaud/muzero-general

Folders and files

Latest commit

History

Repository files navigation

Continuous MuZero General

Features

Demo

Games already implemented

Getting started

Installation

Run

Authors

Getting involved

About

Topics

Resources

License

Stars

Watchers

Forks

Contributors 16

Languages