NVIDIA’s New AI: Wow, 8x Better Text To 3D!
4:27

NVIDIA’s New AI: Wow, 8x Better Text To 3D!

Two Minute Papers 03.11.2023 66 346 просмотров 2 741 лайков

Machine-readable: Markdown · JSON API · Site index

Поделиться Telegram VK Бот
Транскрипт Скачать .md
Анализ с AI
Описание видео
❤️ Check out Weights & Biases and sign up for a free demo here: https://wandb.me/papers 📝 The paper "Magic3D: High-Resolution Text-to-3D Content Creation" is available here: https://research.nvidia.com/labs/dir/magic3d/ Will be available in Picasso: https://www.nvidia.com/en-us/gpu-cloud/picasso/ Get notified: https://developer.nvidia.com/picasso My latest paper on simulations that look almost like reality is available for free here: https://rdcu.be/cWPfD Or this is the orig. Nature Physics link with clickable citations: https://www.nature.com/articles/s41567-022-01788-5 BG3 cat footage: https://www.youtube.com/watch?v=qbCMkjob-oE 🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible: Aleksandr Mashrabov, Alex Balfanz, Alex Haro, Andrew Melnychuk, Benji Rabhan, Bret Brizzee, Bryan Learn, B Shang, Christian Ahlin, Gaston Ingaramo, Geronimo Moralez, Gordon Child, Jace O'Brien, Jack Lukic, John Le, Kenneth Davis, Klaus Busse, Kyle Davis, Lukas Biewald, Martin, Matthew Valle, Michael Albrecht, Michael Tedder, Nikhil Velpanur, Owen Campbell-Moore, Owen Skarpness, Rajarshi Nigam, Ramsey Elbasheer, Richard Sundvall, Steef, Taras Bobrovytsky, Ted Johnson, Thomas Krcmar, Timothy Sum Hon Mun, Torsten Reil, Tybie Fitzhugh, Ueli Gallizzi. If you wish to appear here or pick up other perks, click here: https://www.patreon.com/TwoMinutePapers Thumbnail background design: Felícia Zsolnai-Fehér - http://felicia.hu Károly Zsolnai-Fehér's research works: https://cg.tuwien.ac.at/~zsolnai/ #nvidia

Оглавление (4 сегментов)

<Untitled Chapter 1>

I am very excited about this paper. So what does  it do? Magic, according to the authors! That is,   not text to image, where we write a text  prompt and get a beautiful photo or painting,   but something else: text to 3D. Hmm…now that  would be fantastic, I would say it qualifies

Text to 3D

as magic. We write something and out comes 3D  geometry that we can put into a video game or   any kind of virtual world. Teleconferencing.   Anything you wish. Dear Fellow Scholars,   this is Two Minute Papers with Dr. Károly  Zsolnai-Fehér. Now, wait a second. The previous,   DreamFusion technique was capable of  doing that. And this was a paper that   was published just a year ago. Now, yes, but  this one has two problems. Problem number one,   it takes a long time, and problem  number two, we still get coarse results. Now, does this paper really have the magic they  promised? Let’s have a look together! Whoa!    Look at that difference. I immediately  see that this has so much more detail,   it has 8 times higher resolution than the previous  method. And it gets better, we noted earlier that   they are not only coarse, but slow too. The  coarseness is gone, fantastic, but I imagine   that this is now probably even slower in return.   We don’t get all this quality for free, do we?    That cannot be true. Whoa! It is not only free, it  is even twice as fast as the previous technique. I also love the variety of objects it can conjure  up. We can even ask for more imaginative things,   like a car made out of sushi. That’s good. Now,  clearly, as all of you Fellow Scholar see right   here, this is not the quality of geometry that you  would put into those fancy triple A quality games,   but at this point I feel that we are just two  more papers away from that kind of detail. Now, there is plenty to love here, and somehow,  it gets even better. We can even write a prompt,   and if we would like to refine the results, we can  rewrite parts of the prompt and get a very similar

Prompt workflow

result. It works great with this baby bunny, but  I’ll note that if we modify the prompt too much,   we might get an entirely different scene,  so this one requires a light touch. And it gets even better. We can have  a collection of photos of our cat,   and then, ask the AI to create  a virtual version of this cat   but riding on a bike. Imagine meeting  your own cat in a video game! Love it. So, when can we try it? Well, NVIDIA  is planning to include it in Picasso,

Picasso

their generative AI framework, which is  hopefully going to be widely available   soon. It will be part of the Adobe suite too,  quite honestly, this will be everywhere real   soon. What a time to be alive! If you cannot  wait to give it a go, and you wish to get   notified when it becomes available, that is a  possibility through the link in the description. And one more thing. I have surprising news.   This technique has already been surpassed. Yes,   you heard it right. The pace of progress in AI  and computer graphics research is so incredible,   that there is already something even better  than this out there. I’ll give a hint:   it is a Gaussian splatting-based technique.   I can’t wait to tell you all about it in   the next episode. Subscribe and hit  the bell icon to not miss out on it.

Другие видео автора — Two Minute Papers

Ctrl+V

Экстракт Знаний в Telegram

Экстракты и дистилляты из лучших YouTube-каналов — сразу после публикации.

Подписаться

Дайджест Экстрактов

Лучшие методички за неделю — каждый понедельник