Microsoft’s New AI: The Selfies Of The Future! 🤳

6:16

Microsoft’s New AI: The Selfies Of The Future! 🤳

Two Minute Papers 28.08.2022 104 224 просмотров 4 427 лайков

Machine-readable: Markdown · JSON API · Site index

Смотреть на YouTube

Поделиться Telegram VK Бот

Транскрипт Скачать .md

Анализ с AI

Описание видео

❤️ Check out the Gradient Dissent podcast by Weights & Biases: http://wandb.me/gd 📝 The paper "GRAM-HD: 3D-Consistent Image Generation at High Resolution with Generative Radiance Manifolds" is available here: https://jeffreyxiang.github.io/GRAM-HD/ ❤️ Watch these videos in early access on our Patreon page or join us here on YouTube: - https://www.patreon.com/TwoMinutePapers - https://www.youtube.com/channel/UCbfYPyITQ-7l4upoX8nvctg/join 🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible: Aleksandr Mashrabov, Alex Balfanz, Alex Haro, Andrew Melnychuk, Benji Rabhan, Bryan Learn, B Shang, Christian Ahlin, Eric Martel, Geronimo Moralez, Gordon Child, Ivo Galic, Jace O'Brien, Jack Lukic, John Le, Jonas, Jonathan, Kenneth Davis, Klaus Busse, Kyle Davis, Lorin Atzberger, Lukas Biewald, Matthew Allen Fisher, Michael Albrecht, Michael Tedder, Nevin Spoljaric, Nikhil Velpanur, Owen Campbell-Moore, Owen Skarpness, Rajarshi Nigam, Ramsey Elbasheer, Steef, Taras Bobrovytsky, Ted Johnson, Thomas Krcmar, Timothy Sum Hon Mun, Torsten Reil, Tybie Fitzhugh, Ueli Gallizzi. If you wish to appear here or pick up other perks, click here: https://www.patreon.com/TwoMinutePapers Thumbnail background design: Felícia Zsolnai-Fehér - http://felicia.hu Károly Zsolnai-Fehér's links: Instagram: https://www.instagram.com/twominutepapers/ Twitter: https://twitter.com/twominutepapers Web: https://cg.tuwien.ac.at/~zsolnai/

Оглавление (2 сегментов)

Segment 1 (00:00 - 05:00)

Dear Fellow Scholars, this is Two Minute Papers with Dr. Károly Zsolnai-Fehér. Today we are going to use an AI to take a collection of photos, and hopefully create a digital human out of them. However, not so fast! For instance, have a look at NVIDIA’s amazing face generator AI. As you see, it can generate beautiful results, and they even show us the illusion of these characters moving around, however, this does not have a strong concept of 3D information. These are 2D images, and 3D consistency is not that great. What is that? Well, look. This person just one frame away is not exactly the same person. For instance, the hair moves around. This means that these images are not art directable, or at least, not easily. As you see here, it has improved a great deal over two years and one more research paper down the line, it’s still not quite there. So, fine details, checkmark, 3D consistency, not quite there. So let’s have a look at solutions that have more 3D consistency. To be able to do that, we can take a collection of photos like these, and magically, create a video where we can fly through these photos. It is really crazy, because this is possible today, for instance, here is NVIDIAs method that can be trained to perform this in a matter of seconds. This is remarkable, especially that the input is only a handful of photos. Some information is given about the scene, but this is really not much. So, can we have a solution with 3D consistency? Yes, indeed, for these, 3D consistency, checkmark. However, the amount of fine details in these outputs is not that great. Do you see the pattern here? Depending on which previous method we use, we either get tons of detail, but no 3D consistency, or we can get our highly coveted 3D consistency, but then, the details are gone. So, is the dream dead? No digital humans for us? Well, don’t despair, and have a look at this new technique that promises one megapixel images that are also consistent. Details and consistency at the same time! When I first saw this paper, I said I will believe it when I see it, so, let’s have a look together. This is StyleNERF, a previous technique, and we see the problems that we now expect - the hair is not consistent, the earring is flickering a great deal, and there are other issues too. So, are the authors of the new paper claiming that they can solve all this? Well, now, hold on to your papers, and have a look at this. This is the new technique. Oh my goodness. The earring, facial features and the hair stay still as we rotate them, and it indeed shows the new angles correctly. I love it! Let’s examine this phenomenon in a little more detail. Look at the hair here with StyleNERF. It looks decent, but it’s not quite there. And I really wonder why that is. Let’s zoom in and have a closer look. Oh yes, upon closer inspection, this hair is a bit of a mess. And, with the new technique, now that is what I call consistency across the video frames. Smooth hair strands everywhere. So good! And what I absolutely loved about this new work is that it outperforms this previous technique, which is from how many years ago? Well, if you have been holding on to your papers, now squeeze that paper, because this work is from not even one year ago, just from 8 months ago. And you already see a meaningful improvement over that. That is insanity. Now, not even this technique is perfect. I don’t know for sure if the hair consistency is perfect here and we are dealing with video compression artifacts, or whether this is still not a 100% there, but this truly is a great step forward. Also, here, you see that the images are still not as detailed as the photos, but this seems to be a roadblock to me that is much easier to solve than 3D consistency. My impression is that with this, the most difficult part of the task is already done, and just one or

Segment 2 (05:00 - 06:00)

two papers down the line, and I am sure we will be seeing even more realistic virtual humans. So, can we enter into a virtual world as a digital human from just a collection of photos? Oh yes! Time to meet our beloved ones from afar, or meet new people, and play some games together. What a time to be alive! So, does this get your mind going? What would you use this for? Let me know in the comments below! Thanks for watching and for your generous support, and I'll see you next time!

Другие видео автора — Two Minute Papers

Ctrl+V

Экстракт Знаний в Telegram

Экстракты и дистилляты из лучших YouTube-каналов — сразу после публикации.

Подписаться

Лучшие методички за неделю — каждый понедельник