# Enhance! AI Super Resolution Is Here!

## Метаданные

- **Канал:** Two Minute Papers
- **YouTube:** https://www.youtube.com/watch?v=POJ1w8H8OjY
- **Дата:** 13.02.2024
- **Длительность:** 7:04
- **Просмотры:** 74,987

## Описание

❤️ Check out Weights & Biases and sign up for a free demo here: https://wandb.me/papers

📝 The paper "Scaling Up to Excellence: Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild" is available here:
https://supir.xpixel.group/

📝 My latest paper on simulations that look almost like reality is available for free here:
https://rdcu.be/cWPfD 

Or this is the orig. Nature Physics link with clickable citations:
https://www.nature.com/articles/s41567-022-01788-5

🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible:
Alex Balfanz, Alex Haro, B Shang, Benji Rabhan, Bret Brizzee, Gaston Ingaramo, Gordon Child, Jace O'Brien, John Le, Kyle Davis, Lukas Biewald, Martin, Michael Albrecht, Michael Tedder, Owen Skarpness, Richard Putra Iskandar, Richard Sundvall, Taras Bobrovytsky, Ted Johnson, Thomas Krcmar, Tybie Fitzhugh, Ueli Gallizzi.
If you wish to appear here or pick up other perks, click here: https://www.patreon.com/TwoMinutePapers

Thumbnail background design: Felícia Zsolnai-Fehér - http://felicia.hu
Károly Zsolnai-Fehér's research works: https://cg.tuwien.ac.at/~zsolnai/
Twitter: https://twitter.com/twominutepapers

## Содержание

### [0:00](https://www.youtube.com/watch?v=POJ1w8H8OjY) Intro

This new AI system has looked at 20 million images. So what has it learned? It learned to do something meaningful with these unusable, super pixelated images. Dear Fellow Scholars, this is Two Minute Papers with Dr. Károly Zsolnai-Fehér. So, the goal here would be that a pixelated image goes in, and out comes what?

### [0:23](https://www.youtube.com/watch?v=POJ1w8H8OjY&t=23s) Super Resolution

Well, out comes an image with more details. This is what we call super resolution and this is what a previous technique was capable of. The result is fair, it is little less blurry, but I don’t feel that it has added too much detail to the image. But, a newer technique from 2023, from less than a year ago, whoa. Now we’re talking. Tons of detail just appeared! The feathers looks great, but the eye. My goodness, the eye is a bit creepy. Not a fan. And now, let’s see if the new technique can do even better…and…oh my, look at that! That looks fantastic! If you thought the feathers were good before, look at them now, and finally, the eyes are completely different, and much more realistic. And if we run the same for a person, previous techniques aren’t even able to recognize that this should be a person.

### [1:30](https://www.youtube.com/watch?v=POJ1w8H8OjY&t=90s) Car

But the new technique. That is incredible. So, what other incredible things is it capable of? Well, let’s see together through 6 examples. One. Have a look at this car. We have an unusably pixelated image here of a car, and we know it’s good, so it’s going to get the car right I am sure, but are you thinking what I am thinking? Because I am thinking license plate. Can it do anything with the license plate? Not a chance, right? Let’s see…and…whoa! Look, we got something. Now I will immediately say that there is so little information in the input image that this new data might very well be completely made up. If not, bravo! But even if it is, I love the fact that the AI knows that this is roughly the kind of information that belongs there. Really cool. Two, how about trying to remix old videos game graphics? That works too. And if we could do that in real time, we might not even need to store all of these detailed textures, just small coarse ones if we know that we can super resolve it to something way better in real time.

### [2:53](https://www.youtube.com/watch?v=POJ1w8H8OjY&t=173s) Landscapes

Three, landscapes. It not only gives us the clouds up there, but also the trees in the background in considerable detail. And if it can create so many new details, I am thinking about just drawing up a really crude draft of an image, and have the super resolution AI do the heavy lifting. That would be so good. Also, not far away at all.

### [3:19](https://www.youtube.com/watch?v=POJ1w8H8OjY&t=199s) Animals

Four, animals. Just look at that. In the area here, we get a really sharp image, we can see the fur almost down to the individual strands, and in the background, we even have a subtle depth of field effect as it goes out of focus. Fantastic. Five, human faces.

### [3:38](https://www.youtube.com/watch?v=POJ1w8H8OjY&t=218s) Human Faces

Now this one puts up a clinic on how to do this. Wow. Here, it is even better - we get a result down to the individual level of hair strands. Some of them in the foreground look a little plasticky, and the teeth are not great either, but everything else. Unbelievable. Just think about having the internet using images and videos that are compressed down to take a hundred times less space because every device will be capable of performing this kind of super resolution soon.

### [4:16](https://www.youtube.com/watch?v=POJ1w8H8OjY&t=256s) Vintage Photos

And six, vintage photos and movies. This one has tons of those signature compression artifacts, the image is also probably zoomed in. Almost unusable. I don’t think I would want to watch this video. And now, hold on to your papers Fellow Scholars, and look at this. Holy mother of papers! I am just looking and looking, and I can’t believe that this is possible. Interestingly, the paper discusses that the AI has also learned from 100k negative prompts.

### [4:45](https://www.youtube.com/watch?v=POJ1w8H8OjY&t=285s) Negative Prompts

What does that mean? This means that it was given counterexamples, badly done images to also teach it not only what to do, but also what not to do. And this is a good paper, so they also tested how much of a difference this makes. So let’s see. Goodness, it makes a huge difference. But, we are not done yet, because we haven’t even talked about my favorite feature. Text prompts. Wait, what can text prompts possibly do here? We already have an image, just repaint the image, right? Why do we need the text? You see, this is great, if we invest an additional minute to describe what is supposed to be in the scene, it creates these absolutely incredible results. Or we can tell it that look, this should be a bicycle, and now it is. But, it gets better. Two, we can even use the text prompts to bend reality itself. Yes, in this case, we can even specify that the woman’s hat should be made out of suede or denim. That’s good. And now I wonder if we can push this further? Oh yes! Yes we can! We can easily add a mustache, I can’t tell that this has been added, I don’t think anyone can, or we can even choose if our subject should be resolved to a young or an old man. How cool is that? Bending reality. What a time to be alive! And, good news, an online demo of it is either coming or will be already available by the time this video appears. So, hopefully, let the experiments begin!

---
*Источник: https://ekstraktznaniy.ru/video/12717*