Skip to content
Navigation Menu
Toggle navigation
Sign in
Product
Actions
Automate any workflow
Packages
Host and manage packages
Security
Find and fix vulnerabilities
Codespaces
Instant dev environments
GitHub Copilot
Write better code with AI
Code review
Manage code changes
Issues
Plan and track work
Discussions
Collaborate outside of code
Explore
All features
Documentation
GitHub Skills
Blog
Solutions
By size
Enterprise
Teams
Startups
By industry
Healthcare
Financial services
Manufacturing
By use case
CI/CD & Automation
DevOps
DevSecOps
Resources
Topics
AI
DevOps
Security
Software Development
View all
Explore
Learning Pathways
White papers, Ebooks, Webinars
Customer Stories
Partners
Open Source
GitHub Sponsors
Fund open source developers
The ReadME Project
GitHub community articles
Repositories
Topics
Trending
Collections
Enterprise
Enterprise platform
AI-powered developer platform
Available add-ons
Advanced Security
Enterprise-grade security features
GitHub Copilot
Enterprise-grade AI features
Premium Support
Enterprise-grade 24/7 support
Pricing
Search or jump to...
Search code, repositories, users, issues, pull requests...
Search syntax tips
Provide feedback
Saved searches
Use saved searches to filter your results more quickly
Sign in
Sign up
Reseting focus
You signed in with another tab or window.
Reload
to refresh your session.
You signed out in another tab or window.
Reload
to refresh your session.
You switched accounts on another tab or window.
Reload
to refresh your session.
Dismiss alert
{{ message }}
werner-duvaud
/
muzero-general
Public
Notifications
You must be signed in to change notification settings
Fork
606
Star
2.5k
Code
Issues
47
Pull requests
11
Actions
Wiki
Security
Insights
Additional navigation options
Code
Issues
Pull requests
Actions
Wiki
Security
Insights
Commits
Breadcrumbs
History for
muzero-general
self_play.py
on
master
User selector
All users
Datepicker
All time
Commit History
Commits on Mar 11, 2022
Add workflow (
#188
)
ahainaut
and
werner-duvaud
committed
Mar 11, 2022
0c4c335
Commits on Nov 11, 2020
Fix GPU availability in actors with PyTorch 1.7
werner-duvaud
committed
Nov 11, 2020
f0f73ec
Commits on Sep 16, 2020
Fix reanalyse and format
werner-duvaud
committed
Sep 16, 2020
6a273e0
Commits on Sep 6, 2020
Improve docstring and fix load replay buffer
#75
werner-duvaud
committed
Sep 6, 2020
e21c90c
Commits on Aug 20, 2020
Add resume training and improve training exit
werner-duvaud
and
ahainaut
committed
Aug 20, 2020
de80a8b
Commits on Aug 14, 2020
Improve CPU/GPU management
werner-duvaud
committed
Aug 14, 2020
937acb6
Commits on Aug 10, 2020
Add Reanalyse
werner-duvaud
committed
Aug 10, 2020
b2539d8
Commits on Jul 30, 2020
Add selfplay with gpu, multi gpu, better env closing, save hp search results
werner-duvaud
committed
Jul 30, 2020
b87fdfe
Commits on Jul 26, 2020
Add hp search and deterministic lunarlander, improve ratio metric and seeding, refactor and minor fixes,
fix
#60
werner-duvaud
committed
Jul 26, 2020
ef88151
Commits on Jun 30, 2020
Fix backpropagate
werner-duvaud
committed
Jun 30, 2020
c046c03
Commits on Jun 22, 2020
Add diagnose model
werner-duvaud
committed
Jun 22, 2020
a31c830
Commits on Jun 1, 2020
Fix
#55
werner-duvaud
committed
Jun 1, 2020
2a7e44b
Commits on May 6, 2020
Merge pull request
#43
from TimZF/patch-1
ahainaut
committed
May 6, 2020
a191572
Typo
werner-duvaud
committed
May 6, 2020
8b886d8
Commits on May 3, 2020
Update doc and refactor
werner-duvaud
committed
May 3, 2020
d74a3b2
Commits on May 2, 2020
Fix reward for more than 2 players
tfzee
committed
May 2, 2020
c6db06d
Commits on Apr 24, 2020
Refactor
werner-duvaud
committed
Apr 24, 2020
f5dd3d2
Commits on Apr 21, 2020
Fix
#34
(last commit)
ahainaut
committed
Apr 21, 2020
2adb848
Commits on Apr 19, 2020
Refactor
ahainaut
committed
Apr 19, 2020
91afb1d
Commits on Apr 11, 2020
Add mean value plot
werner-duvaud
committed
Apr 11, 2020
b94cd65
Commits on Apr 7, 2020
Turn replay buffer into numbered dict
werner-duvaud
committed
Apr 7, 2020
540cd70
Commits on Apr 5, 2020
Improve memory with stacked observations
werner-duvaud
committed
Apr 5, 2020
4b4a0aa
Commits on Apr 4, 2020
Add value reanalyze
werner-duvaud
committed
Apr 4, 2020
df0e407
td error for PER
werner-duvaud
committed
Apr 4, 2020
6bc5bfe
Commits on Apr 1, 2020
Add Atari
werner-duvaud
committed
Apr 1, 2020
6306b5d
Commits on Mar 31, 2020
Add tree depth info
werner-duvaud
committed
Mar 31, 2020
f706376
Commits on Mar 30, 2020
Add selfplay / train ratio and improve reproductibility
werner-duvaud
committed
Mar 30, 2020
ecd8870
Commits on Mar 28, 2020
Change batch aggregation, fix value in replay buffer and prepare merge
werner-duvaud
committed
Mar 28, 2020
f60f199
Commits on Mar 20, 2020
Merge pull request
#23
from xuxiyang1993/master
werner-duvaud
committed
Mar 20, 2020
4d54162
Improve cartpole hyperparameters and fix typo
werner-duvaud
committed
Mar 20, 2020
633c658
Commits on Mar 19, 2020
add PER support
xuxiyang1993
committed
Mar 19, 2020
ebc9434
Commits on Mar 16, 2020
Fix OverflowError and Add Conv 1x1
ahainaut
committed
Mar 16, 2020
2a9b99b
Commits on Mar 14, 2020
Fix MCTS and typo
werner-duvaud
and
ahainaut
committed
Mar 14, 2020
ecca75c
Commits on Mar 11, 2020
Fix MCTS
werner-duvaud
committed
Mar 11, 2020
0918977
Commits on Mar 8, 2020
Add stack action to stacked observations
werner-duvaud
committed
Mar 8, 2020
283e353
Pagination
Previous
Next
You can’t perform that action at this time.