Google’s New AI: DALL-E, But Now In 3D! 🤯

5:07

Google’s New AI: DALL-E, But Now In 3D! 🤯

Two Minute Papers 19.10.2022 134 893 просмотров 6 172 лайков

Machine-readable: Markdown · JSON API · Site index

Смотреть на YouTube

Поделиться Telegram VK Бот

Транскрипт Скачать .md

Анализ с AI

Описание видео

❤️ Check out Weights & Biases and sign up for a free demo here: https://wandb.com/papers 📝 The paper "DreamFusion: Text-to-3D using 2D Diffusion" is available here: https://dreamfusion3d.github.io/ Unofficial open source implementation: https://github.com/ashawkey/stable-dreamfusion Interpolation: https://twitter.com/xsteenbrugge/status/1558508866463219712 Full video of interpolation: https://www.youtube.com/watch?v=Bo3VZCjDhGI ❤️ Watch these videos in early access on our Patreon page or join us here on YouTube: - https://www.patreon.com/TwoMinutePapers - https://www.youtube.com/channel/UCbfYPyITQ-7l4upoX8nvctg/join 🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible: Aleksandr Mashrabov, Alex Balfanz, Alex Haro, Andrew Melnychuk, Benji Rabhan, Bryan Learn, B Shang, Christian Ahlin, Eric Martel, Geronimo Moralez, Gordon Child, Jace O'Brien, Jack Lukic, John Le, Jonas, Jonathan, Kenneth Davis, Klaus Busse, Kyle Davis, Lorin Atzberger, Lukas Biewald, Luke Dominique Warner, Matthew Allen Fisher, Matthew Valle, Michael Albrecht, Michael Tedder, Nevin Spoljaric, Nikhil Velpanur, Owen Campbell-Moore, Owen Skarpness, Rajarshi Nigam, Ramsey Elbasheer, Steef, Taras Bobrovytsky, Ted Johnson, Thomas Krcmar, Timothy Sum Hon Mun, Torsten Reil, Tybie Fitzhugh, Ueli Gallizzi. If you wish to appear here or pick up other perks, click here: https://www.patreon.com/TwoMinutePapers Thumbnail background design: Felícia Zsolnai-Fehér - http://felicia.hu Károly Zsolnai-Fehér's links: Instagram: https://www.instagram.com/twominutepapers/ Twitter: https://twitter.com/twominutepapers Web: https://cg.tuwien.ac.at/~zsolnai/

Оглавление (2 сегментов)

Segment 1 (00:00 - 05:00)

Dear Fellow Scholars, this is Two Minute Papers with Dr. Károly Zsolnai-Fehér. Today we are going to see how this new AI is able to take a piece of text from us, anything we wish, be it a squirrel dressed like the king of England, a car made out of sushi, or a humanoid robot using a laptop, and, magically, it creates not an image like previous techniques, but get this, a full 3D model of it. Wow. This is absolutely amazing. An AI that can not only create images, but create 3D assets? Yes, indeed, the result is a full 3D model that we can rotate around and even use in our virtual worlds. So, let’s give it a really hard time and see together what it is capable of. For instance, OpenAI’s earlier DALL-E text to image AI was capable of looking at a bunch of images of koalas, and separately, motorcycles, and it started to understand the concept of both, and it was be able to combine the two together into a completely new image. That is, a koala, riding a motorcycle. So, let’s see if this new method is also capable of creating new concepts by building on previous knowledge. Well, let’s see…oh yes! Here is a tiger wearing sunglasses and a leather jacket, and most importantly, riding a motorcycle. Tigers and motorcycles are well understood concepts, of course, the neural network had plenty of these to look at in its training set, but combining the two concepts together, now that is a hint of creativity. Creativity in a machine. Loving it. What I also loved about this work is that it makes it so easy to iterate on our ideas. For instance, first, we can start experimenting with a real squirrel, or, if we did not like it, we can quickly ask for a wooden carving, or even a metal sculpture of it. Then, we can start dressing it up, and make it do anything we want. And sometimes, the results are nearly good enough to be used as-is even in an animation movie or in virtual worlds, or even in the worse cases, I think these could be used as a starting point for an artist to continue from. That would save a ton of time and energy in a lot of projects! And that is huge. Just consider all the miraculous things artists are using the DALL-E 2 text to image AI and Stable Diffusion for, illustrating novels, texture synthesis, product design, weaving multiple images together to create these crazy movies, you name it. And now, I wonder what unexpected uses will arise from this being possible for 3D models? Do you have some ideas? Let me know in the comments below! And just imagine what this will be capable of just a couple more papers down the line. For instance, the original DALL-E AI was capable of this, and then, just a year later, this became possible. So, how does this black magic work? Well, the cool thing is that this is also a diffusion-based technique, which means that similarly to the text to image AIs, it starts out from a piece of noise, and refines this noise over time to resemble our input text a little more. But this time, this diffusion process is running in higher dimensions, thus, the result is not a 2D image, but a full 3D model. So, from now on, the limit in creating 3D worlds is not our artistic skill, the limit is only our imagination. What a time to be alive!

Segment 2 (05:00 - 05:00)

Thanks for watching and for your generous support, and I'll see you next time!

Другие видео автора — Two Minute Papers

Ctrl+V

Экстракт Знаний в Telegram

Экстракты и дистилляты из лучших YouTube-каналов — сразу после публикации.

Подписаться

Лучшие методички за неделю — каждый понедельник