DeepMind’s AI Athletes Play In The Real World!
5:51

DeepMind’s AI Athletes Play In The Real World!

Two Minute Papers 12.05.2023 82 410 просмотров 5 046 лайков

Machine-readable: Markdown · JSON API · Site index

Поделиться Telegram VK Бот
Транскрипт Скачать .md
Анализ с AI
Описание видео
❤️ Check out Weights & Biases and sign up for a free demo here: https://wandb.com/papers ❤️ Get more than $50 off from an upcoming W&B event in San Francisco! - https://shorturl.at/brtIQ 📝 The paper "Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement Learning" is available here: https://sites.google.com/view/op3-soccer My latest paper on simulations that look almost like reality is available for free here: https://rdcu.be/cWPfD Or this is the orig. Nature Physics link with clickable citations: https://www.nature.com/articles/s41567-022-01788-5 🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible: Aleksandr Mashrabov, Alex Balfanz, Alex Haro, Andrew Melnychuk, Benji Rabhan, Bryan Learn, B Shang, Christian Ahlin, Eric Martel, Geronimo Moralez, Gordon Child, Jace O'Brien, Jack Lukic, John Le, Jonathan, Kenneth Davis, Klaus Busse, Kyle Davis, Lorin Atzberger, Lukas Biewald, Martin, Matthew Valle, Michael Albrecht, Michael Tedder, Nikhil Velpanur, Owen Campbell-Moore, Owen Skarpness, Rajarshi Nigam, Ramsey Elbasheer, Richard Sundvall, Steef, Taras Bobrovytsky, Ted Johnson, Thomas Krcmar, Timothy Sum Hon Mun, Torsten Reil, Tybie Fitzhugh, Ueli Gallizzi. If you wish to appear here or pick up other perks, click here: https://www.patreon.com/TwoMinutePapers Thumbnail background design: Felícia Zsolnai-Fehér - http://felicia.hu Károly Zsolnai-Fehér's links: Twitter: https://twitter.com/twominutepapers Web: https://cg.tuwien.ac.at/~zsolnai/

Оглавление (2 сегментов)

Segment 1 (00:00 - 05:00)

Dear Fellow Scholars, this is Two Minute Papers  with Dr. Károly Zsolnai-Fehér. Dr. Papers. Today you are going to see an insane paper. One of  the most fun papers I’ve read in a while. Earlier,   scientists at DeepMind were looking to teach  a bunch of AI agents to play football/soccer.    Unfortunately, they started out like this.   Not very impressive. But then, they trained   them for 5 years. What? 5 years? Well, their  training took what felt like 5-years for them,   but only 3 days for us, because these are not  real robots, and thus, on a fast computer,   they can be simulated faster than real  time. So what happened in that time? This happened! Whoa! Now they can  not only play amazingly well, but,   they can even think a few steps ahead,  and they are even anticipating the   behavior of their teammates and  position themselves accordingly. The saves they are capable of are  also outstanding. And as we don’t   really have a referee around, they also  beat up each other. And when I say that,   I really mean it. It’s bad. Real bad. Also,  the only thing they like more than throwing   each other is throwing tantrums. And  it even gets worse. This lovely chap   almost seems like someone who is faking an  injury, but then, look! A miracle happened! And today, scientists at DeepMind  brought this work to the next level. How? In a different paper, scientists at OpenAI  simulated a robot hand in a video game world.    Then, the things it learned in this simulation  could be brought into life by deploying it into   the real world. So train in a simulation, and then  play in the real world. Yes, that is fantastic,   but, why am I saying this? Oh wait, is is  possible that the same happened here too? Real   robots playing football? Yes! That is exactly what  happened. So with that, let the insanity begin. The new environment contains 2 real robots with  20 actuated joints each. So what did they learn   first? Well, first, they learned to get up, and  also, score goals. All this in a simulated robot   body. That is very impressive already, but wait,  because that’s just the start. Now it is time for   random perturbations. What does that mean? It  means that forces are randomly exerted on the   AI to make sure it can recover from an impact.   And, are you thinking what I am thinking? Oh yes,   now let’s go to the real robots, and let’s test  that impact. Aww! That is not very nice. But   then…wow! Look at that! What it had learned  in a simulation now really applies to a real   life robot. It can get hit, and it is able to  get up over and over again. My goodness. But,   there is a problem. What is the problem? Well,  their knees get injured all the time. So what   did scientists at DeepMind do with that?   Well, hold on to your papers Fellow Scholars,   because they taught the robot to get up and move  in a way that is not too hard on the knees. Yes,   that’s right! And thus, they didn’t suffer that  many knee injuries. What a time to be alive! So with that, finally, they can now run, and  score goals. But wait, playing a match and   winning takes more than just running and kicking.   They need to have a variety of other skills and   combine them in order to win. So, did they? Oh  yes they did! They can now control the ball,   turn with it, block the ball with their  bodies, shoot accurately, even in the   case of a moving ball, which is even harder.   And that is the miracle of research papers:   such incredible progress can happen  just one more paper down the line. And, yes, I know, I know. You are  asking the question, okay, but Károly,   did they beat up each other here too like in the  previous paper? And the answer is no. You see,   they are penalized for getting too close to  each other, so luckily that did not happen.    But this kind of improvement in just one paper, I  am wondering what these little AIs will be capable   of two more papers down the line? Well, what do  you think? Let me know in the comments below!

Segment 2 (05:00 - 05:00)

Thanks for watching and for your generous  support, and I'll see you next time!

Другие видео автора — Two Minute Papers

Ctrl+V

Экстракт Знаний в Telegram

Экстракты и дистилляты из лучших YouTube-каналов — сразу после публикации.

Подписаться

Дайджест Экстрактов

Лучшие методички за неделю — каждый понедельник