NVIDIA’s New AI: Wow, 8x Better Text To 3D!

4:27

NVIDIA’s New AI: Wow, 8x Better Text To 3D!

Two Minute Papers 03.11.2023 66 346 просмотров 2 741 лайков

Machine-readable: Markdown · JSON API · Site index

Смотреть на YouTube

Поделиться Telegram VK Бот

Транскрипт Скачать .md

Анализ с AI

Описание видео

❤️ Check out Weights & Biases and sign up for a free demo here: https://wandb.me/papers 📝 The paper "Magic3D: High-Resolution Text-to-3D Content Creation" is available here: https://research.nvidia.com/labs/dir/magic3d/ Will be available in Picasso: https://www.nvidia.com/en-us/gpu-cloud/picasso/ Get notified: https://developer.nvidia.com/picasso My latest paper on simulations that look almost like reality is available for free here: https://rdcu.be/cWPfD Or this is the orig. Nature Physics link with clickable citations: https://www.nature.com/articles/s41567-022-01788-5 BG3 cat footage: https://www.youtube.com/watch?v=qbCMkjob-oE 🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible: Aleksandr Mashrabov, Alex Balfanz, Alex Haro, Andrew Melnychuk, Benji Rabhan, Bret Brizzee, Bryan Learn, B Shang, Christian Ahlin, Gaston Ingaramo, Geronimo Moralez, Gordon Child, Jace O'Brien, Jack Lukic, John Le, Kenneth Davis, Klaus Busse, Kyle Davis, Lukas Biewald, Martin, Matthew Valle, Michael Albrecht, Michael Tedder, Nikhil Velpanur, Owen Campbell-Moore, Owen Skarpness, Rajarshi Nigam, Ramsey Elbasheer, Richard Sundvall, Steef, Taras Bobrovytsky, Ted Johnson, Thomas Krcmar, Timothy Sum Hon Mun, Torsten Reil, Tybie Fitzhugh, Ueli Gallizzi. If you wish to appear here or pick up other perks, click here: https://www.patreon.com/TwoMinutePapers Thumbnail background design: Felícia Zsolnai-Fehér - http://felicia.hu Károly Zsolnai-Fehér's research works: https://cg.tuwien.ac.at/~zsolnai/ #nvidia

Оглавление (4 сегментов)

<Untitled Chapter 1>

I am very excited about this paper. So what does it do? Magic, according to the authors! That is, not text to image, where we write a text prompt and get a beautiful photo or painting, but something else: text to 3D. Hmm…now that would be fantastic, I would say it qualifies

Text to 3D

as magic. We write something and out comes 3D geometry that we can put into a video game or any kind of virtual world. Teleconferencing. Anything you wish. Dear Fellow Scholars, this is Two Minute Papers with Dr. Károly Zsolnai-Fehér. Now, wait a second. The previous, DreamFusion technique was capable of doing that. And this was a paper that was published just a year ago. Now, yes, but this one has two problems. Problem number one, it takes a long time, and problem number two, we still get coarse results. Now, does this paper really have the magic they promised? Let’s have a look together! Whoa! Look at that difference. I immediately see that this has so much more detail, it has 8 times higher resolution than the previous method. And it gets better, we noted earlier that they are not only coarse, but slow too. The coarseness is gone, fantastic, but I imagine that this is now probably even slower in return. We don’t get all this quality for free, do we? That cannot be true. Whoa! It is not only free, it is even twice as fast as the previous technique. I also love the variety of objects it can conjure up. We can even ask for more imaginative things, like a car made out of sushi. That’s good. Now, clearly, as all of you Fellow Scholar see right here, this is not the quality of geometry that you would put into those fancy triple A quality games, but at this point I feel that we are just two more papers away from that kind of detail. Now, there is plenty to love here, and somehow, it gets even better. We can even write a prompt, and if we would like to refine the results, we can rewrite parts of the prompt and get a very similar

Prompt workflow

result. It works great with this baby bunny, but I’ll note that if we modify the prompt too much, we might get an entirely different scene, so this one requires a light touch. And it gets even better. We can have a collection of photos of our cat, and then, ask the AI to create a virtual version of this cat but riding on a bike. Imagine meeting your own cat in a video game! Love it. So, when can we try it? Well, NVIDIA is planning to include it in Picasso,

Picasso

their generative AI framework, which is hopefully going to be widely available soon. It will be part of the Adobe suite too, quite honestly, this will be everywhere real soon. What a time to be alive! If you cannot wait to give it a go, and you wish to get notified when it becomes available, that is a possibility through the link in the description. And one more thing. I have surprising news. This technique has already been surpassed. Yes, you heard it right. The pace of progress in AI and computer graphics research is so incredible, that there is already something even better than this out there. I’ll give a hint: it is a Gaussian splatting-based technique. I can’t wait to tell you all about it in the next episode. Subscribe and hit the bell icon to not miss out on it.

Другие видео автора — Two Minute Papers

Ctrl+V

Экстракт Знаний в Telegram

Экстракты и дистилляты из лучших YouTube-каналов — сразу после публикации.

Подписаться

Лучшие методички за неделю — каждый понедельник