AlphaZero: DeepMind’s AI Works Smarter, not Harder
4:26

AlphaZero: DeepMind’s AI Works Smarter, not Harder

Two Minute Papers 27.02.2019 97 365 просмотров 2 718 лайков

Machine-readable: Markdown · JSON API · Site index

Поделиться Telegram VK Бот
Транскрипт Скачать .md
Анализ с AI
Описание видео
Errata: regarding the comment on the rules - the AI has no built-in domain knowledge but the basic rules of the game. 📝 The paper "AlphaZero: Shedding new light on the grand games of chess, shogi and Go" is available here: https://deepmind.com/blog/alphazero-shedding-new-light-grand-games-chess-shogi-and-go/ Kasparov’s editorial: http://science.sciencemag.org/content/362/6419/1087 ❤️ Pick up cool perks on our Patreon page: https://www.patreon.com/TwoMinutePapers 🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible: 313V, Alex Haro, Andrew Melnychuk, Angelos Evripiotis, Anthony Vdovitchenko, Brian Gilman, Christian Ahlin, Christoph Jadanowski, Claudio Fernandes, Dennis Abts, Eric Haddad, Eric Martel, Evan Breznyik, Geronimo Moralez, Jason Rollins, Javier Bustamante, John De Witt, Kaiesh Vohra, Kasia Hayden, Kjartan Olason, Lorin Atzberger, Marcin Dukaczewski, Marten Rauschenberg, Maurits van Mastrigt, Michael Albrecht, Michael Jensen, Morten Punnerud Engelstad, Nader Shakerin, Owen Campbell-Moore, Owen Skarpness, Raul Araújo da Silva, Richard Reis, Rob Rowe, Robin Graham, Ryan Monsurate, Shawn Azman, Steef, Steve Messina, Sunil Kim, Thomas Krcmar, Torsten Reil, Zach Boldyga, Zach Doty. https://www.patreon.com/TwoMinutePapers Splash screen/thumbnail design: Felícia Fehér - http://felicia.hu Károly Zsolnai-Fehér's links: Facebook: https://www.facebook.com/TwoMinutePapers/ Twitter: https://twitter.com/karoly_zsolnai Web: https://cg.tuwien.ac.at/~zsolnai/ #DeepMind #AlphaZero

Оглавление (2 сегментов)

<Untitled Chapter 1>

dear fellow Scholars this is 2minute papers with K finally I have been waiting for quite a while to cover this amazing paper which is about Alpha zero we have talked

What is Alpha 0?

about Alpha zero before this is an AI that is able to play chess go and shogi or in other words Japanese chess on a remarkably high level I will immediately start out by uttering the main point of this work the point of alpha zero is not to solve chess or any of these games its main point is to show that a general AI can be created that can perform on a superhuman level on not one but several different tasks at the same time let's have a look at this image where you see a small part of the evaluation of alpha Zero versus stockfish an amazing open-source chess engine which has been consistently at or around the top computer chess players for many years now stockfish has an ELO rating of over 3,200 which means that it has a win rate of over 90% against the best human players in the world now interestingly comparing these algorithms is nowhere near as easy as it sounds this sounds curious so why is that for instance it is not enough to pit the two algorithms against each other and see who ends up winning it matters what version of stockfish is used how many positions are the machines allowed to evaluate how much thinking time they are allowed the size of hash tables the hardware being used the number of threads being used and so on from the side of the chess Community these are the details that matter however from the side of the AI researcher what matters most is to create a general algorithm that can play several different games on a superhuman level with this constraint it would really be a miracle if Alpha zero were able to even put up a good fight against Dogfish so what happened Alpha zero played a lot of games that ended up as draws against stockfish and not only that but whenever there was a winner it was almost always Alpha zero insanity and what is quite remarkable is that Alpha zero has only trained for 4 to 7 hours only through selfplay comparatively the development of the current version of stockfish took more than 10 years you can see how reliably this AI can be trained the blue lines show the results of several training runs and they all converge to the same result with only a tiny bit of deviation Alpha zero is also not a Brute Force algorithm as it evaluates fewer positions per second than stockfish Kasparov put it really well in his article where he said that Alpha zero Works smarter and not harder than previous techniques even Magnus Carson Chessmaster extraordinaire said in an interview that during his games he often thinks what would Alpha zero do in this case which I found to be quite remarkable Kasparov also had many good things to say about the new Alpha zero in a let's say very Kasper ofes Manner and also note that the key point is not whether the current version of stockfish or the one from two months ago was used the key point is that stockfish is a brilliant chess engine but it is not able to play go or any game other than chess this is the main contribution that deep mind was looking for with this work this AI can Master three games at once and a few more papers down the line it may be able to master any perfect information game oh my goodness what a time to be alive we have only scratched the surface in this video this was only a taste of the paper the evaluation section in the paper is out of this world so make sure to have a look in the video description and I am convinced that nearly any questions one can possibly think of is addressed there I also link to kasparov's editorial on this topic it is short and very readable give it a go I hope this little taste of alpha zero inspires you to go out there and explore yourself this is the main message of this series let me know in the comments what you think or if you found some cool other things related to Alpha Zero thanks for watching and for your generous support and I see you next time

Другие видео автора — Two Minute Papers

Ctrl+V

Экстракт Знаний в Telegram

Экстракты и дистилляты из лучших YouTube-каналов — сразу после публикации.

Подписаться

Дайджест Экстрактов

Лучшие методички за неделю — каждый понедельник