Google’s New AI: DALL-E, But Now In 3D! 🤯
5:07

Google’s New AI: DALL-E, But Now In 3D! 🤯

Two Minute Papers 19.10.2022 134 893 просмотров 6 172 лайков

Machine-readable: Markdown · JSON API · Site index

Поделиться Telegram VK Бот
Транскрипт Скачать .md
Анализ с AI
Описание видео
❤️ Check out Weights & Biases and sign up for a free demo here: https://wandb.com/papers 📝 The paper "DreamFusion: Text-to-3D using 2D Diffusion" is available here: https://dreamfusion3d.github.io/ Unofficial open source implementation: https://github.com/ashawkey/stable-dreamfusion Interpolation: https://twitter.com/xsteenbrugge/status/1558508866463219712 Full video of interpolation: https://www.youtube.com/watch?v=Bo3VZCjDhGI ❤️ Watch these videos in early access on our Patreon page or join us here on YouTube: - https://www.patreon.com/TwoMinutePapers - https://www.youtube.com/channel/UCbfYPyITQ-7l4upoX8nvctg/join 🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible: Aleksandr Mashrabov, Alex Balfanz, Alex Haro, Andrew Melnychuk, Benji Rabhan, Bryan Learn, B Shang, Christian Ahlin, Eric Martel, Geronimo Moralez, Gordon Child, Jace O'Brien, Jack Lukic, John Le, Jonas, Jonathan, Kenneth Davis, Klaus Busse, Kyle Davis, Lorin Atzberger, Lukas Biewald, Luke Dominique Warner, Matthew Allen Fisher, Matthew Valle, Michael Albrecht, Michael Tedder, Nevin Spoljaric, Nikhil Velpanur, Owen Campbell-Moore, Owen Skarpness, Rajarshi Nigam, Ramsey Elbasheer, Steef, Taras Bobrovytsky, Ted Johnson, Thomas Krcmar, Timothy Sum Hon Mun, Torsten Reil, Tybie Fitzhugh, Ueli Gallizzi. If you wish to appear here or pick up other perks, click here: https://www.patreon.com/TwoMinutePapers Thumbnail background design: Felícia Zsolnai-Fehér - http://felicia.hu Károly Zsolnai-Fehér's links: Instagram: https://www.instagram.com/twominutepapers/ Twitter: https://twitter.com/twominutepapers Web: https://cg.tuwien.ac.at/~zsolnai/

Оглавление (2 сегментов)

Segment 1 (00:00 - 05:00)

Dear Fellow Scholars, this is Two Minute  Papers with Dr. Károly Zsolnai-Fehér. Today we are going to see how this new AI  is able to take a piece of text from us,   anything we wish, be it a squirrel  dressed like the king of England,   a car made out of sushi, or a humanoid  robot using a laptop, and, magically,   it creates not an image like previous techniques,  but get this, a full 3D model of it. Wow. This is absolutely amazing. An AI that can not  only create images, but create 3D assets? Yes,   indeed, the result is a full 3D model that  we can rotate around and even use in our   virtual worlds. So, let’s give it a really hard  time and see together what it is capable of. For instance, OpenAI’s earlier  DALL-E text to image AI was capable   of looking at a bunch of images of koalas, and  separately, motorcycles,   and it started to understand the concept of  both, and it was be able to combine the two   together into a completely new image.   That is, a koala, riding a motorcycle. So, let’s see if this new method is also capable  of creating new concepts by building on previous   knowledge. Well, let’s see…oh yes! Here is a  tiger wearing sunglasses and a leather jacket,   and most importantly, riding a motorcycle. Tigers  and motorcycles are well understood concepts,   of course, the neural network had plenty of these  to look at in its training set, but combining the   two concepts together, now that is a hint of  creativity. Creativity in a machine. Loving it. What I also loved about this work is that it makes  it so easy to iterate on our ideas. For instance,   first, we can start experimenting with a  real squirrel, or, if we did not like it,   we can quickly ask for a wooden carving,  or even a metal sculpture of it. Then,   we can start dressing it up,  and make it do anything we want. And sometimes, the results are nearly good enough  to be used as-is even in an animation movie   or in virtual worlds, or even in the worse cases,  I think these could be used as a starting point   for an artist to continue from. That would save  a ton of time and energy in a lot of projects! And that is huge. Just consider all the  miraculous things artists are using the   DALL-E 2 text to image AI and Stable Diffusion  for, illustrating novels, texture synthesis,   product design, weaving multiple images  together to create these crazy movies,   you name it. And now, I wonder what unexpected  uses will arise from this being possible for   3D models? Do you have some ideas?   Let me know in the comments below! And just imagine what this will be capable of just  a couple more papers down the line. For instance,   the original DALL-E AI was capable of this, and  then, just a year later, this became possible. So, how does this black magic work? Well, the  cool thing is that this is also a diffusion-based   technique, which means that similarly to the text  to image AIs, it starts out from a piece of noise,   and refines this noise over time to  resemble our input text a little more.    But this time, this diffusion process  is running in higher dimensions, thus,   the result is not a 2D image, but a full 3D  model. So, from now on, the limit in creating   3D worlds is not our artistic skill, the limit  is only our imagination. What a time to be alive!

Segment 2 (05:00 - 05:00)

Thanks for watching and for your generous  support, and I'll see you next time!

Другие видео автора — Two Minute Papers

Ctrl+V

Экстракт Знаний в Telegram

Экстракты и дистилляты из лучших YouTube-каналов — сразу после публикации.

Подписаться

Дайджест Экстрактов

Лучшие методички за неделю — каждый понедельник