PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
-
Updated
Mar 24, 2023 - Python
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
Code and other material for the book "Deep Learning and the Game of Go"
BetaGo: AlphaGo for the masses, live on GitHub.
A student implementation of Alpha Go Zero
A Go playing program implemented in Tensorflow roughly according to the architecture of AlphaGo. Current strength is 3~4 amateur dan.
An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku
Reinforcing Your Learning of Reinforcement Learning
A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games
Meta-Zeta是一个基于强化学习的五子棋(Gobang)模型,主要用以了解AlphaGo Zero的运行原理的Demo,即神经网络是如何指导MCTS做出决策的,以及如何自我对弈学习。源码+教程
alphaGo版本的五子棋(gobang, gomoku)
Computer go engine using Monte-Carlo Tree Search written in Python3.
基於深度學習的 GTP 圍棋(围棋)引擎,KGS 指引文件以及演算法教學。
基于miniGo的幻影围棋AI,2019中国计算机博弈大赛幻影围棋组冠军;AI of Phantom Go based on miniGo
A very fast implementation of AlphaZero, applied to games like Splendor, Santorini, The Little Prince, … Browser version available
Deep Reinforcement Learning to Play 2048 (with Keras)
Add a description, image, and links to the alphago topic page so that developers can more easily learn about it.
To associate your repository with the alphago topic, visit your repo's landing page and select "manage topics."