The difference between offical pseudo code and this repository about "num_unroll_steps" #221
Open
1 task done
Labels
bug
Something isn't working
Search before asking
🐛 Describe the bug
this is offical pseudocode about update weight:
and it only train action happend in history, exclude anything past the end of games,but will train action past the end of games in muzero_general
muzero-general/replay_buffer.py
Line 291 in 0c4c335
Add an example
as mentioned above
Environment
No response
Minimal Reproducible Example
No response
Additional
No response
The text was updated successfully, but these errors were encountered: