# NVIDIA’s New AI: The Age of Real Time Game Making Is Here!

## Метаданные

- **Канал:** Two Minute Papers
- **YouTube:** https://www.youtube.com/watch?v=FpZ_6bxx5v8
- **Дата:** 21.02.2025
- **Длительность:** 5:54
- **Просмотры:** 61,663
- **Источник:** https://ekstraktznaniy.ru/video/12583

## Описание

❤️ Check out Lambda here and sign up for their GPU Cloud: https://lambdalabs.com/papers

📝 Magic 1-For-1:
https://magic-141.github.io/Magic-141/
https://github.com/Open-Magic-Video/Magic-1-For-1
https://arxiv.org/abs/2502.07701v1

📝 Phantom: https://phantom-video.github.io/Phantom/

📝 Relighting paper: https://bujiazi.github.io/light-a-video.github.io/

📝 Stepfun:
https://github.com/stepfun-ai/Step-Video-T2V
https://yuewen.cn/videos
https://arxiv.org/abs/2502.10248
https://huggingface.co/stepfun-ai/stepvideo-t2v

📝 My paper on simulations that look almost like reality is available for free here:
https://rdcu.be/cWPfD 

Or this is the orig. Nature Physics link with clickable citations:
https://www.nature.com/articles/s41567-022-01788-5

🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible:
Benji Rabhan, B Shang, Christian Ahlin, Gordon Child, John Le, Juan Benet, Kyle Davis, Loyal Alchemist, Lukas Biewald, Michael Tedder, Owen Skarpness, Richard S

## Транскрипт

### Segment 1 (00:00 - 05:00) []

what a day free text to video AIS are popping up like crazy where you write a short text prompt and you get a video of it and now you are seeing a brand new system here and I am delighted by how incredible it is you see the title of the paper says it can generate one minute video clips within 1 minute is that true well kind of let me explain this might give you the impression that we are going to see one minute long generated videos we are not instead they probably meant to imply video generation times that are approaching real time speed that means generating 1 second of video footage within 1 second of time in real life which is kind of mind-blowing I mean less than a year ago I was able to try Sora at the open AI lab and unfortunately I couldn't show it to you everything was super secret and now less than a year later you are telling me that there are dozens of these textto video AI systems with matching quality perhaps even better I mean look at the kind of variety it is capable of and all this in real time wow what a time to be alive just think about it here is a huge pack of already existing techniques many of which are less than a year old and now bam this one is 12 times faster that sounds insane I mean how do you even do that how is that even possible dear fellow scholar this is two minute papers with Dr car J well here is a crazy idea it does something that we fellow scholar are already doing you see if text to video takes too long we don't want to try the same prompt a 100 times to get something good instead we have tons of free text to image AIS out there so we write a short text prompt create a steel image quickly and then if we like the image give that to a text to video Ai and we only have to generate one video not 100 and given that we like the image we will probably like the video too and that is what this technique does first generate an image from text and then make that image move now they also use a spcification step cutting a few Corners if you will and this leads to nearly real time text to video my understanding is that this can run on one consumer graphics card not cheap one mind you but it can be done and that really makes me think I mean real time deep Minds AI is able to take just one image and create these video games out of it they are simple games mind you but games nonetheless that was less than a year ago and now you are saying that we have the capability to generate games out of one photo in real time at home I have been doing this channel for almost 10 years and nearly a, th videos but I still have a hard time thinking that this is real unbelievable now not even this technique is perfect it was trained on an unbalanced data set that is very dense in human Centric and cinematic stuff so this is a limitation of this particular trained model but not the concept itself that means an even better model can be trained wo so good but it doesn't end there is another system that just came out called Phantom that does something incredible not just text to video but subject to video you can have a person a place or an object and it will create a video about them but here comes the twist in a manner that preserves their identities this was and kind of still is a problem even with image generator systems imagine trying to create a comic with a central character you need many images but the character keeps changing every time you generate them this system is a bit lower in terms of visual quality however you can recall the same identities as many times as you want the full research paper is available for this one too and we are still not done yet now that we can create videos efficiently the content often comes out as we wanted because we write the prompts for them but sometimes the presentation is not exactly as we envisioned but this new tool does this really amazing relighting for an input video without changing it much if you want to look a bit more dramatic not a problem put your cat in a cyber Punk World easy now hold on to your papers fellow Scholars for a final paper for today stamp video If you okay to wait a bit longer for video creation but in return they promise higher visual quality and I got to say I am not sure if this is strictly better than the first one which was 12 times faster but

### Segment 2 (05:00 - 05:00) [5:00]

I left it here to show you that things are progressing so quickly we can barely keep up with all the things that are offered for us for free so many papers to read and so many models to try this is open science and open source at its best that is what I want and just imagine that all of these Works came out just a few days apart loving it what a time to be alive so what do you think what would you fellow Scholars use this for let me know in the comments below would you like to run your own copy of deep seek in the cloud cheaply without using the official app yes then try Lambda GPU Cloud they have so many powerful Nvidia gpus with tons of memory I use them regularly too seriously try it out now at lamb labs. com slapers or click the link in the description