# NVIDIA's New AI: 150x Faster Virtual Worlds!

## Метаданные

- **Канал:** Two Minute Papers
- **YouTube:** https://www.youtube.com/watch?v=jyhu1VipWpo
- **Дата:** 29.01.2025
- **Длительность:** 6:15
- **Просмотры:** 65,126

## Описание

❤️ Check out Lambda here and sign up for their GPU Cloud: https://lambdalabs.com/papers

📝 The paper "InstantSplat: Sparse-view SfM-free Gaussian Splatting in Seconds" is available here:
https://instantsplat.github.io/
Try it out (hopefully works): https://huggingface.co/spaces/kairunwen/InstantSplat

Clouds paper: https://arcanous98.github.io/projectPages/gaussianVolumes.html

📝 My paper on simulations that look almost like reality is available for free here:
https://rdcu.be/cWPfD 

Or this is the orig. Nature Physics link with clickable citations:
https://www.nature.com/articles/s41567-022-01788-5

🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible:
Benji Rabhan, B Shang, Christian Ahlin, Gordon Child, John Le, Juan Benet, Kyle Davis, Loyal Alchemist, Lukas Biewald, Michael Tedder, Owen Skarpness, Richard Sundvall, Steef, Taras Bobrovytsky, Thomas Krcmar, Tybie Fitzhugh, Ueli Gallizzi
If you wish to appear here or pick up other perks, click here: https://www.patreon.com/TwoMinutePapers

My research: https://cg.tuwien.ac.at/~zsolnai/
X/Twitter: https://twitter.com/twominutepapers
Thumbnail design: Felícia Zsolnai-Fehér - http://felicia.hu

#nvidia

## Содержание

### [0:00](https://www.youtube.com/watch?v=jyhu1VipWpo) Segment 1 (00:00 - 05:00)

you snap just three photos of something a statue your living room even your cat doing that weird loaf thing or to flex your Nintendo switch 2 that you got earlier than anyone else priorities and then bam in seconds you're holding a fully immersive 3D version of that thing sounds great except that previous methods are not really up for the task there is just not enough information in these three photos to make it happen think of it as trying to bake a cake with just three grains of flour and a desperate prayer the result is going to be a blurry mess not just with this technique but with many other techniques too but Nvidia just unveiled a new AI technique called instant Splat that look wow this one can pull it off just three images and the rest are all synthesized by the AI now we're talking and I'll tell you how you can use it too when compared to a previous technique from just about a year ago against nope Nerf it goes on and after half an hour it nopes out but the new one what can it do in half an hour now that horse looks absolutely Majestic and wait a second this was not half an hour this is not an equal time comparison this was just 9 seconds this is not an incremental update this looks like a paradigm shift to me this goes against everything that I know about 3D modeling I mean just look at it this comes from two photos snapped on the surface of Mars and finally we can look at it in all of its Glory previously that was only possible maybe in science fiction movies NASA spent billions on her over and all we needed was just an AI that could Vibe with a selfie stick and the more we look the better it gets yes there are visual artifacts the Reconstruction is not perfect yet but look at those beautiful specular Reflections on the car these are view dependent effects that you cannot just reconstruct from a couple of photos no this requires proper intelligence to really understand how these materials work in reality an absolute stunner and you can even try it right Now by dragging your images here you can even create an entire city which is I don't even know what to say and of course the results will not be perfect yes it even captures these yummy glossy Reflections in this scene but they are a tiny bit hazy and some artifacts remain yes all true but this is so far beyond previous techniques many of which just gave you a blurry mess which is completely understandable able three images conveys so little information especially if you want to rotate the scene and look behind things and my favorite the images don't even need to be posed so if you move the camera a bit between the shots not a problem goodness and remember it is better not only in terms of quality but it is blazing fast the old way can take 84 minutes of grinding which is 84,000 times more than the average attention span on Tik Tok while this one is done in seconds 150 times faster and all this with this kind of quality improvement you see the new one on the left while previous methods are to the right there is no contest whatsoever here and the full paper and source code is available to all of us for free I am super grateful for this one it builds on the gaussian splatting technique which imagines the surfaces in the world has a set of little bumps which can give you a high quality scene that takes up very little data on your hard drive now note that we are talking about surfaces here not volumes it is not great at modeling something like a puff of smoke but here is an amazing new paper for that look at that beauty this uses a proper light transport simulation that is R tracing to model this cloud and the underlying data structure is also a bunch of these gaussian lumps and don't worry about the noise if you wait a little it cleans up over time just fine and when looking into the three photo reconstruction paper yep they ditched the classic structure from motion element used almost everywhere it's like saying we are building a car and we are ditching the wheels imagine showing up to a Formula 1 race with no tires and still lapping everyone that's the vibe here so

### [5:00](https://www.youtube.com/watch?v=jyhu1VipWpo&t=300s) Segment 2 (05:00 - 06:00)

these are the crazy times we live in the Nintendo switch is coming out soon you take three photos and get a virtual world and you will even be able to model explosions smoke and Haze with it Michael Bay just popped a champagne bottle somewhere and all this for free through the power of research papers the future is coming and it is coming fast now 150 times faster imagine taking a couple photos and creating a video game where you can play in real world places what a time to be alive and now Excuse me while I take three photos of my kitchen and turn it into a level for Doom for research of course this is 2minute papers with Dr car to run your own experiments on an Nvidia GPU check out Lambda I use it myself regularly for these videos H look at that you can generate high quality images in less than a second per image I did a ton more of them and paid less than a dollar for all this crazy seriously try it out now at lamb labs. com SLP papers or click the link in the description

---
*Источник: https://ekstraktznaniy.ru/video/12635*