Google AI Achieves “Alien” Superhuman Mastery of Chess and Go in Mere Hours – InApps 2022

Main Contents:

Google AI Achieves “Alien” Superhuman Mastery of Chess and Go in Mere Hours – InApps is an article under the topic Software Development Many of you are most interested in today !! Today, let’s InApps.net learn Google AI Achieves “Alien” Superhuman Mastery of Chess and Go in Mere Hours – InApps in today’s post !

Human-Like Search

In addition, unlike powerful, specialized game programs like Stockfish and Elmo, which are capable of searching through 70 million moves and 35 million moves per second respectively, AlphaZero’s deep learning neural network only searches through 80,000 moves per second. While this may seem like a disadvantage at first glance, but AlphaZero makes up for this lower number of evaluations by using its deep neural network to concentrate on the most promising sequence of moves — demonstrating what experts might characterize as a more human-like approach to play and discovery. The team’s findings also suggest that it’s this scaled-down approach that is most scalable and time-efficient, in contrast to the use of hand-tuned, resource-intensive, so-called “Type A” evaluation methods found in brute force or alpha-beta searches.

“It’s a remarkable achievement, even if we should have expected it after AlphaGo,” former world chess champion Garry Kasparov told Chess.com. “It approaches the ‘Type B,’ human-like approach to machine chess dreamt of by [mathematician and information theorist] Claude Shannon and [mathematician and computer scientist] Alan Turing, instead of brute force.”

Other chess greats echoed similar sentiments about AlphaZero’s unconventional approach to the game, as Danish grandmaster Peter Heine Nielsen quipped in a BBC interview: “I always wondered how it would be if a superior species landed on earth and showed us how they played chess. Now I know.”

The frequency of openings over time employed by AlphaZero during its self-training and learning phase.

Criticisms

In a series of twelve 100-game chess matches against Stockfish 8, where each program was allotted one minute of processing time per move, AlphaZero was able to win 290 games, drawing 886 and losing only 24.

However, there were criticisms from other experts that the results would have been more closely matched if Stockfish had access to a database of chess openings, as it is designed to work best in this way. According to other observers, the computational power allocated to Stockfish during the tests was also relatively suboptimal, and that the Stockfish program used in the trials was an older, less powerful version, and not designed for rigidly fixed time controls. Similar criticisms were raised in AlphaZero’s matches against Elmo.

As Stockfish’s developer, Tord Romstad points out: “The games were played at a fixed time of one minute per move, which means that Stockfish has no use of its time management heuristics. A lot of effort has been put into making Stockfish identify critical points in the game and decide when to spend some extra time on a move; at a fixed time per move, the strength will suffer significantly.”

Tabula Rasa Learning

As with its predecessor, AlphaGo Zero, what’s most extraordinary is the fact that AlphaZero uses a generalized, “tabula rasa” reinforcement learning algorithm that starts from scratch. There is no human input, no hand-tuning, nothing built in, aside from giving it the operative fundamentals.

From there, no matter what type of game it was, AlphaZero is able to figure out and “discover” conventional — as well as new and innovative — strategies to a problem on its own, from playing out random scenarios against itself. The broader significance here is that this powerful, generalized algorithm could someday be applied to more complex and intractable problems, potentially generating unexpectedly creative and original solutions in a variety of fields, such as discovering new drugs or materials.

“We have always assumed that chess required too much empirical knowledge for a machine to play so well from scratch, with no human knowledge added at all,” added Kasparov. “Of course I’ll be fascinated to see what we can learn about chess from AlphaZero, since that is the great promise of machine learning in general — machines figuring out rules that humans cannot detect. But obviously, the implications are wonderful far beyond chess and other games. The ability of a machine to replicate and surpass centuries of human knowledge in complex closed systems is a world-changing tool.”

Read the rest of the paper here, or download the AlphaZero-Stockfish .pgn game files here.

Images: Luiz Hanfilaque, DeepMind.

Source: InApps.net

Rate this post

Anh Hoang

Anh Hoang is Head of SEO Optimization at InApps Technology, ensuring that the message and research of InApps Technology reach the most people possible while adhering to our strict journalistic standards of excellence and integrity.