And it gets even worse when we try putting real-looking humans in it. Oh my goodness. I don’t want to talk to this person. So this seems completely hopeless. So why is everyone talking about AI this, AI that if it can’t pull this off? Well, to get the answer to that, all you have to do is look at these papers. Yes, luckily, we have three amazing works that might solve all of these three problems. Let’s start with this one. First, rendering worlds. In goes a bunch of images about the scene, but not everything. So we need a technique to learn the scene, and to be able to draw it from viewpoints we’ve never seen it from. That is really tough. It is kind of possible with NERFs and Gaussian splatting, two of the go-to techniques to perform this these days. However, not so fast. If we don’t have enough information, they can still introduce lots of noise and visual artifacts. And some of the results are just criminally bad. So, I don’t think a single new paper could fix all of that of course, so let’s see…goodness. It can! These suddenly look almost perfect. Absolutely amazing. So how is this even possible? what the heck happened here? Well, a genius idea happened. This AI technique is trained not to give us the perfect answer immediately, but to take an imperfect one, and learn to clean it up. That is nearly as good as giving the perfect answer, however, it is much simpler to pull off. And when I look at the results, with previous techniques I’m thinking, I don’t want to use any of these. And in just one paper we go from that to…wow, let’s start using this right now! So, worlds are working okay now. Great, but remember, that is just 1 out of 3. What if we want to put new things into this virtual world? Well, previous techniques are not great at reconstructing 3D information from a photo or a video of something. And this one was from just 3 years ago, and everything is so coarse here. I don’t want to play in a world like that. Now, things have gotten a bit better since, for singular objects, newer AI methods can get pretty good results. But wait until you try an entire scene of objects. They completely fall apart. Even the better ones have trouble understanding object alignment and scale. Now check this out. This one is from a different research lab. Wow! A new AI technique can do what none of the previous ones can do, and that is, take just one image, not of an object, but of an entire scene, and create a digital 3D version of it. So let’s go back to that alignment scene, and see the new one. Wow, so cool! The whole scene, with the correct scales, and nothing is intersecting each other. So, how? Well, it has two incredible ideas to make this happen. One, it has been infused by a GPT-like AI model that is meant to understand the relation of these objects. And it is doing a glorious job at that. And now let me show you the second one, my favorite. Look at this reconstruction. As we expected, positions and scales are correct in this scene, they are true to the input photos, that is excellent. But…come on man. The guitar is poking through the box. That is really difficult to guess correctly, but here is the second genius idea: you don’t need to do that. The scene is generally good, but does not obey the laws of physics. Floating, poking things. So now, hold on to your papers Fellow Scholars, and just run a simple correction step that is inspired by physics simulations, and let it sort out all of these issues. Can it? Oh my, look at that beauty! Fantastic, so, worlds and things are good. But the last puzzle piece: