DeepMind’s AI Athletes Play In The Real World!

5:51

DeepMind’s AI Athletes Play In The Real World!

Two Minute Papers 12.05.2023 82 410 просмотров 5 046 лайков

Machine-readable: Markdown · JSON API · Site index

Смотреть на YouTube

Поделиться Telegram VK Бот

Транскрипт Скачать .md

Анализ с AI

Описание видео

❤️ Check out Weights & Biases and sign up for a free demo here: https://wandb.com/papers ❤️ Get more than $50 off from an upcoming W&B event in San Francisco! - https://shorturl.at/brtIQ 📝 The paper "Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement Learning" is available here: https://sites.google.com/view/op3-soccer My latest paper on simulations that look almost like reality is available for free here: https://rdcu.be/cWPfD Or this is the orig. Nature Physics link with clickable citations: https://www.nature.com/articles/s41567-022-01788-5 🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible: Aleksandr Mashrabov, Alex Balfanz, Alex Haro, Andrew Melnychuk, Benji Rabhan, Bryan Learn, B Shang, Christian Ahlin, Eric Martel, Geronimo Moralez, Gordon Child, Jace O'Brien, Jack Lukic, John Le, Jonathan, Kenneth Davis, Klaus Busse, Kyle Davis, Lorin Atzberger, Lukas Biewald, Martin, Matthew Valle, Michael Albrecht, Michael Tedder, Nikhil Velpanur, Owen Campbell-Moore, Owen Skarpness, Rajarshi Nigam, Ramsey Elbasheer, Richard Sundvall, Steef, Taras Bobrovytsky, Ted Johnson, Thomas Krcmar, Timothy Sum Hon Mun, Torsten Reil, Tybie Fitzhugh, Ueli Gallizzi. If you wish to appear here or pick up other perks, click here: https://www.patreon.com/TwoMinutePapers Thumbnail background design: Felícia Zsolnai-Fehér - http://felicia.hu Károly Zsolnai-Fehér's links: Twitter: https://twitter.com/twominutepapers Web: https://cg.tuwien.ac.at/~zsolnai/

Оглавление (2 сегментов)

Segment 1 (00:00 - 05:00)

Dear Fellow Scholars, this is Two Minute Papers with Dr. Károly Zsolnai-Fehér. Dr. Papers. Today you are going to see an insane paper. One of the most fun papers I’ve read in a while. Earlier, scientists at DeepMind were looking to teach a bunch of AI agents to play football/soccer. Unfortunately, they started out like this. Not very impressive. But then, they trained them for 5 years. What? 5 years? Well, their training took what felt like 5-years for them, but only 3 days for us, because these are not real robots, and thus, on a fast computer, they can be simulated faster than real time. So what happened in that time? This happened! Whoa! Now they can not only play amazingly well, but, they can even think a few steps ahead, and they are even anticipating the behavior of their teammates and position themselves accordingly. The saves they are capable of are also outstanding. And as we don’t really have a referee around, they also beat up each other. And when I say that, I really mean it. It’s bad. Real bad. Also, the only thing they like more than throwing each other is throwing tantrums. And it even gets worse. This lovely chap almost seems like someone who is faking an injury, but then, look! A miracle happened! And today, scientists at DeepMind brought this work to the next level. How? In a different paper, scientists at OpenAI simulated a robot hand in a video game world. Then, the things it learned in this simulation could be brought into life by deploying it into the real world. So train in a simulation, and then play in the real world. Yes, that is fantastic, but, why am I saying this? Oh wait, is is possible that the same happened here too? Real robots playing football? Yes! That is exactly what happened. So with that, let the insanity begin. The new environment contains 2 real robots with 20 actuated joints each. So what did they learn first? Well, first, they learned to get up, and also, score goals. All this in a simulated robot body. That is very impressive already, but wait, because that’s just the start. Now it is time for random perturbations. What does that mean? It means that forces are randomly exerted on the AI to make sure it can recover from an impact. And, are you thinking what I am thinking? Oh yes, now let’s go to the real robots, and let’s test that impact. Aww! That is not very nice. But then…wow! Look at that! What it had learned in a simulation now really applies to a real life robot. It can get hit, and it is able to get up over and over again. My goodness. But, there is a problem. What is the problem? Well, their knees get injured all the time. So what did scientists at DeepMind do with that? Well, hold on to your papers Fellow Scholars, because they taught the robot to get up and move in a way that is not too hard on the knees. Yes, that’s right! And thus, they didn’t suffer that many knee injuries. What a time to be alive! So with that, finally, they can now run, and score goals. But wait, playing a match and winning takes more than just running and kicking. They need to have a variety of other skills and combine them in order to win. So, did they? Oh yes they did! They can now control the ball, turn with it, block the ball with their bodies, shoot accurately, even in the case of a moving ball, which is even harder. And that is the miracle of research papers: such incredible progress can happen just one more paper down the line. And, yes, I know, I know. You are asking the question, okay, but Károly, did they beat up each other here too like in the previous paper? And the answer is no. You see, they are penalized for getting too close to each other, so luckily that did not happen. But this kind of improvement in just one paper, I am wondering what these little AIs will be capable of two more papers down the line? Well, what do you think? Let me know in the comments below!

Segment 2 (05:00 - 05:00)

Thanks for watching and for your generous support, and I'll see you next time!

Другие видео автора — Two Minute Papers

Ctrl+V

Экстракт Знаний в Telegram

Экстракты и дистилляты из лучших YouTube-каналов — сразу после публикации.

Подписаться

Лучшие методички за неделю — каждый понедельник