Leela Zero
Original author(s) | Gian-Carlo Pascutto |
---|---|
Developer(s) | Gian-Carlo Pascutto |
Initial release | 25 October 2017 |
Final release | 0.17
/ 4 April 2019 |
Repository | |
Written in | C++ |
Type | Go software |
License | GPL-3.0 |
Website | zero |
Leela Zero is a free and open-source computer Go program released on 25 October 2017. It is developed by Belgian programmer Gian-Carlo Pascutto,[1][2][3] the author of chess engine Sjeng and Go engine Leela.[4][5]
Leela Zero's algorithm is based on DeepMind's 2017 paper about AlphaGo Zero.[3][6] Unlike the original Leela, which has a lot of human knowledge and heuristics programmed into it, the program code in Leela Zero only knows the basic rules and nothing more. The knowledge that makes Leela Zero a strong player is contained in a neural network, which is trained based on the results of previous games that the program played.[7]
Leela Zero is trained by a distributed effort, which is coordinated at the Leela Zero website. Members of the community provide computing resources by running the client, which generates self-play games and submits them to the server. The self-play games are used to train newer networks. Generally, over 500 clients have connected to the server to contribute resources.[7] The community has provided high quality code contributions as well.[7]
Version history
[edit]Leela Zero finished third at the BerryGenomics Cup World AI Go Tournament in Fuzhou, Fujian, China on 28 April 2018.[8] The New Yorker at the end of 2018 characterized Leela and Leela Zero as "the world’s most successful open-source Go engines".[9]
In early 2018, another team branched Leela Chess Zero from the same code base, also to verify the methods in the AlphaZero paper as applied to the game of chess. AlphaZero's use of Google TPUs was replaced by a crowd-sourcing infrastructure and the ability to use graphics card GPUs via the OpenCL library. Even so, it is expected to take a year of crowd-sourced training to make up for the dozen hours that AlphaZero was allowed to train for its chess match in the paper.[10]
The distributed training server was shut down on 2021-02-15, marking the end of Leela Zero project. The page now directs visitors to KataGo and SAI.[11]
The model sizes increased steadily over time. The first released model has hash name d645af97, size 1x8 (1 layer, 8 channels), and released at 2017-11-10 13:04. The last released model has hash name 0e9ea880, size 40x256, and was released at 2021-02-15 09:04.[12]
Technology
[edit]Leela Zero is an (almost) exact replication of AlphaGo Zero in both training process and architecture.[13]
The training process is Monte-Carlo Tree Search with self-play, exactly the same as AlphaGo Zero.
The architecture is the same as AlphaGo Zero (with one difference). Consider the last released model, 0e9ea880. It has 47 million parameters, and the following architecture:[14][6]
- The stem of the network takes as input a 18x19x19 tensor representation of the Go board.
- 8 channels are the positions of the current player's stones from the last eight time steps. (1 if there is a stone, 0 otherwise. If the time step go before the beginning of the game, then 0 in all positions.)
- 8 channels are the positions of the other player's stones from the last eight time steps.
- 1 channel is all 1 if black is to move, and 0 otherwise.
- 1 channel is all 1 if white is to move, and 0 otherwise. (This channel is not present in the original AlphaGo Zero)
- The body is a ResNet with 40 residual blocks and 256 channels.
- There are two heads, a policy head and a value head.
References
[edit]- ^ "Feature: One man's Go program looks to remake AlphaGo Zero - and beyond". Xinhuanet. 9 April 2018. Archived from the original on April 9, 2018. Retrieved 28 April 2018.
- ^ "围棋AI"丽拉"获赞接近职业棋手水准,它的作者竟是一个不太会下棋的程序员" (in Chinese). Xinhuanet. 5 February 2018. Archived from the original on 11 February 2018. Retrieved 27 April 2018.
- ^ a b "更开放,更共享,比利时围棋AI"丽拉·元"重塑"阿尔法元"" (in Chinese). Xinhuanet. 8 April 2018. Archived from the original on April 9, 2018. Retrieved 27 April 2018.
- ^ "프로 수준급 인공지능 바둑 프로그램 '릴라(Leela)' 무료 공개" (in Korean). Baduk News. 23 February 2017. Archived from the original on 6 July 2018. Retrieved 27 April 2018.
- ^ "릴라의 출현과 온라인 대국의 비극적인 종말..." (in Korean). Cyberoro. 3 March 2017. Retrieved 27 April 2018.
- ^ a b "leela-zero". GitHub. Retrieved 27 April 2018.
- ^ a b c "Gian-Carlo Pascutto - The man behind LeelaZero". European Go Federation. 24 May 2018. Retrieved 27 May 2018.
- ^ "世界AI大赛决赛腾讯内战 凤凰2-1绝艺夺冠" (in Chinese). sina.com.cn. 28 April 2018. Retrieved 28 April 2018.
- ^ "How the Artificial-Intelligence Program AlphaZero Mastered Its Games". The New Yorker. 2018. Retrieved 31 December 2018.
- ^ Silver, Albert (26 April 2018). "Leela Chess Zero: AlphaZero for the PC". Chess News. Retrieved 11 June 2018.
- ^ Morandin, Francesco; Amato, Gianluca; Fantozzi, Marco; Gini, Rosa; Metta, Carlo; Parton, Maurizio (2019-11-26). "SAI: a Sensible Artificial Intelligence that plays with handicap and targets high scores in 9x9 Go (extended version)". arXiv:1905.10863 [cs.AI].
- ^ "Leela Zero". zero.sjeng.org. Retrieved 2024-09-21.
- ^ Silver, David; Schrittwieser, Julian; Simonyan, Karen; Antonoglou, Ioannis; Huang, Aja; Guez, Arthur; Hubert, Thomas; Baker, Lucas; Lai, Matthew; Bolton, Adrian; Chen, Yutian; Lillicrap, Timothy; Fan, Hui; Sifre, Laurent; Driessche, George van den; Graepel, Thore; Hassabis, Demis (19 October 2017). "Mastering the game of Go without human knowledge" (PDF). Nature. 550 (7676): 354–359. Bibcode:2017Natur.550..354S. doi:10.1038/nature24270. ISSN 0028-0836. PMID 29052630. S2CID 205261034. Archived (PDF) from the original on 18 July 2018. Retrieved 2 September 2019.
- ^ Du, Haoxing (2023-03-01). "Inside the mind of a superhuman Go model: How does Leela Zero read ladders?".