# This AI Learned Boxing…With Serious Knockout Power! 🥊

## Метаданные

- **Канал:** Two Minute Papers
- **YouTube:** https://www.youtube.com/watch?v=SsJ_AusntiU
- **Дата:** 29.08.2021
- **Длительность:** 6:02
- **Просмотры:** 4,429,983

## Описание

❤️ Check out Perceptilabs and sign up for a free demo here: https://www.perceptilabs.com/papers

❤️ Watch these videos in early access on our Patreon page or join us here on YouTube: 
- https://www.patreon.com/TwoMinutePapers
- https://www.youtube.com/channel/UCbfYPyITQ-7l4upoX8nvctg/join

📝 The paper "Control Strategies for Physically Simulated Characters Performing Two-player Competitive Sports" is available here:
https://research.fb.com/publications/control-strategies-for-physically-simulated-characters-performing-two-player-competitive-sports/

🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible:
Aleksandr Mashrabov, Alex Haro, Andrew Melnychuk, Angelos Evripiotis, Benji Rabhan, Bryan Learn, Christian Ahlin, Eric Haddad, Eric Martel, Gordon Child, Ivo Galic, Jace O'Brien, Javier Bustamante, John Le, Jonas, Kenneth Davis, Klaus Busse, Lorin Atzberger, Lukas Biewald, Matthew Allen Fisher, Mark Oates, Michael Albrecht, Nikhil Velpanur, Owen Campbell-Moore, Owen Skarpness, Ramsey Elbasheer, Steef, Taras Bobrovytsky, Thomas Krcmar, Timothy Sum Hon Mun, Torsten Reil, Tybie Fitzhugh, Ueli Gallizzi.
If you wish to appear here or pick up other perks, click here: https://www.patreon.com/TwoMinutePapers

Thumbnail background design: Felícia Zsolnai-Fehér - http://felicia.hu

00:00 Intro - You shall not pass!
00:49 Does nothing - still wins!
01:30 Boxing - but not so well
02:13 Learning is happening
02:39 After 250 million training steps
03:10 Drunkards no more!
03:29 Serious knockout power!
04:00 It works for fencing too
04:20 First Law of Papers
04:43 An important lesson

Károly Zsolnai-Fehér's links:
Instagram: https://www.instagram.com/twominutepapers/
Twitter: https://twitter.com/twominutepapers
Web: https://cg.tuwien.ac.at/~zsolnai/

## Содержание

### [0:00](https://www.youtube.com/watch?v=SsJ_AusntiU) Intro - You shall not pass!

Now, in an earlier work, we saw a few examples of AI agents playing two-player sports, for instance, this is the “You Shall Not Pass” game, where the red agent is trying to hold back the blue character and not let it cross the line. Here you see two regular AIs duking it out, sometimes the red wins, sometimes the blue is able to get through. Nothing too crazy here. Until…this happens. Look. What is happening? It seems that this agent started to do nothing…and still won. Not only that, but it suddenly started winning almost all the games.

### [0:49](https://www.youtube.com/watch?v=SsJ_AusntiU&t=49s) Does nothing - still wins!

How is this even possible? Well, what the agent did is perhaps the AI equivalent of hypnotizing the opponent, if you will. The more rigorous term for this is that it induces off-distribution activations in its opponent. This adversarial agent is really doing nothing, but that’s not enough - it is doing nothing in a way that reprograms its opponent to make mistakes and behave close to a completely randomly acting agent! Now, this new paper showcases AI agents that can learn boxing. The AI is asked to control these joint-actuated characters which are embedded in a physics

### [1:30](https://www.youtube.com/watch?v=SsJ_AusntiU&t=90s) Boxing - but not so well

simulation. Well, that is quite a challenge - look, for quite a while after 130 million steps of training, it cannot even hold it together. And, yes…these folks collapse. But this is not the good kind of hypnotic adversarial collapsing. I am afraid, this is just passing out without any particular benefits. That was quite a bit of training, and all this for nearly nothing. Right? Well, maybe…let’s see what they did after 200 million training steps. Look!

### [2:13](https://www.youtube.com/watch?v=SsJ_AusntiU&t=133s) Learning is happening

They can not only hold it together, but they have a little footwork going on, and can circle each other and try to take the middle of the ring. Improvements. Good. But this is not dancing practice, this is boxing. I would really like to see some boxing today and it doesn’t seem to happen. Until we wait for a little longer…which is 250 million training steps.

### [2:39](https://www.youtube.com/watch?v=SsJ_AusntiU&t=159s) After 250 million training steps

Now, is this boxing? Not quite, this is more like two drunkards trying to duke it out, where neither of them knows how to throw a real punch…but! Their gloves are starting to touch the opponent, and they start getting rewards for it. What does that mean for an intelligent agent? Well, it means that over time, it will learn to do that a little better. And hold on to your papers and see what they do after 420 million steps.

### [3:10](https://www.youtube.com/watch?v=SsJ_AusntiU&t=190s) Drunkards no more!

Oh wow! Look at that! I am seeing some punches, and not only that, but I also see some body and head movement to evade the punches, very cool. And if we keep going for longer, whoa!

### [3:29](https://www.youtube.com/watch?v=SsJ_AusntiU&t=209s) Serious knockout power!

These guys can fight! They now learned to perform feints, jabs, and have some proper knockout power too. And if you have been holding on to your papers, now, squeeze that paper, because all they looked at before starting the training was 90 seconds of motion capture data. This is a general framework that also works for fencing as well. Look! The agents learned to lunge, deflect, evade attacks, and more.

### [4:00](https://www.youtube.com/watch?v=SsJ_AusntiU&t=240s) It works for fencing too

Absolutely amazing. What a time to be alive! So, this was approximately a billion training steps, right. So how long did that take to compute? It took approximately a week. And, you know what’s coming.

### [4:20](https://www.youtube.com/watch?v=SsJ_AusntiU&t=260s) First Law of Papers

Of course, we invoke the First Law Of Papers, which says that research is a process. Do not look at where we are, will be two more papers down the line. And line, I bet this will be possible in a matter of hours. This is the part with the gorillas. It is also interesting that even though there were plenty of reasons to, the researchers

### [4:43](https://www.youtube.com/watch?v=SsJ_AusntiU&t=283s) An important lesson

didn’t quit after a 130 million steps. They just kept on going, and eventually, succeeded. Especially in the presence of not so trivial training curves where the blocking of the other player can worsen the performance, and it’s often not as easy to tell where we are. That is a great life lesson right there. Thanks for watching and for your generous support, and I'll see you next time!

---
*Источник: https://ekstraktznaniy.ru/video/13835*