# DeepMind Gemini 1.5 - An AI That Remembers!

## Метаданные

- **Канал:** Two Minute Papers
- **YouTube:** https://www.youtube.com/watch?v=oJVwmxTOLd8
- **Дата:** 21.02.2024
- **Длительность:** 8:33
- **Просмотры:** 139,675
- **Источник:** https://ekstraktznaniy.ru/video/12681

## Описание

❤️ Check out Lambda here and sign up for their GPU Cloud: https://lambdalabs.com/papers

📝 The paper "Gemini 1.5: Unlocking multimodal understanding across millions of tokens of
context" is available here:
https://blog.google/technology/ai/google-gemini-next-generation-model-february-2024/
https://storage.googleapis.com/deepmind-media/gemini/gemini_v1_5_report.pdf

📝 My paper on simulations that look almost like reality is available for free here:
https://rdcu.be/cWPfD 

Or this is the orig. Nature Physics link with clickable citations:
https://www.nature.com/articles/s41567-022-01788-5

🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible:
Alex Balfanz, Alex Haro, B Shang, Benji Rabhan, Bret Brizzee, Gaston Ingaramo, Gordon Child, Jace O'Brien, John Le, Kyle Davis, Lukas Biewald, Martin, Michael Albrecht, Michael Tedder, Owen Skarpness, Richard Putra Iskandar, Richard Sundvall, Taras Bobrovytsky, Ted Johnson, Thomas Krcmar, Tybie Fitzhugh, Ueli 

## Транскрипт

### Segment 1 (00:00 - 05:00) []

yes of course I'll talk about this just a few days ago open AI released Sora their text to video AI that took the World by storm but Sora was not the only thing that was released that day deep Minds AI assistant Gemini 1. 5 also appeared and it offers something that interestingly not even the great gp4 can do dear fellow Scholars this is two minute papers with Dr car and the story of Gemini 1. 5 is a story of context Windows H what is a context window well you see we are living in the age of smart AI assistants they help you with assignments homework it is a personalized teacher a programmer your assistant for daily planning and so much more however there is something still missing in imagine that you are in a conversation with someone and the context window means how long of a conversation this person can remember if you have an assistant you expect them to remember what you just talked about minutes ago right well let's see together if that is really the case h gp4 had an 8,000 token context window so it remembers at most a few pages of a book by the end of the book it doesn't remember who even wrote it or how it started and let's not even talk about movies that is way too much data then a few months later gp4 turbo appeared we jump from 8K to 128k oh my that is an entire book we can talk about a book now fantastic now hold on to your papers fellow Scholars because from 28,000 the new Gemini Pro offers a 1 to 10 million token window wait what that is insanity now let's see what that means in practice level one now little AI here is a 400ish page transcript of the Apollo 11 the American space flight that landed on the moon now find three of the funnier moments not a problem and now a multimodal question what is this of course you fellow Scholars already know but does the AI know oh my yes one small step for man one giant leap for mankind and believe it or not this was the easiest task it was given now level two now let's give it a huge code base with up to a 100,000 lines of code now see through it and show me parts of it that have to do with character animation not a problem we can even run these animations and now it can go from engineer to teacher and explain to us how it works exactly and then back to AI engineer again and code up a little slider to control the animation I absolutely love this how many people are there who are great teachers and engineers at the same time very few loving it and now the boss level three check this out little AI forget the text that's too easy let's step it up and watch a movie together and look at that we can ask questions about a 44-minute movie and it knows exactly the moment we are asking about but it gets better I did not think it could do this but it did now here is a crew drawing what is this and when did it happen yep that's the scene this is incredible and according to Google it is going to be released soon and here comes the kicker oh this is just the pro not the ultra version remember the pro version is the one that is free for everyone to use the ultra will likely be even better now wait a minute we are experienced fellow Scholars here so we have questions we found out recently that CAD GPT remembers what does that mean well have a look this is clae 2. 1 a competing chatbot from anthropic and it had a context window that was twice as big as GPT 4 Turbo however wait look at this context length is one variable but being

### Segment 2 (05:00 - 08:00) [5:00]

able to actually recall all this information is also important and when asking questions about the document it matters where in this document the information was for instance if it was early on it might be forgotten by the time it reads until the end but CH GPT remembered much more reliably so our question is clear as day what is the price to be paid for this million token contact window is Gemini going to be as red as claw 2 was here let's see and whoa that is insanity it is not perfect it will forget a few tiny details but my goodness look at that Improvement and all this barely a few days after the initial release of Gemini Bravo and yes you're seeing correctly we have a p paper oh yes so a deeper look is coming soon subscribe and hit the Bell icon to not miss out on it now the judge is always going to be us fellow Scholars the users as soon as it is out let's run our own experiments and find out how well it works in practice but just think about it given this pace of progress we will soon have essentially almost infinite tokens for a context window and this is likely not even years away but a year away possibly just months away and then it will know not just one film it will know the entire history of films better than Tarantino well maybe not as well as Tarantino but who knows and these smart AI assistants will be lifelong partners that remember every single thing you talk to them about they remember and can recall your blood work from a decade ago and have the knowledge of a good doctor to connect the dots of what happened since they will be a lifelong teacher who is infinitely patient and gets smarter and smarter every month an assistant that has access to almost the entirety of the knowledge base of humans and just imagine all the good we could do if we had this assistant right at our fingertips and all this for free or for very little what a time to be alive and yes a closer look into Sora is also coming in a few days I can't wait to show this to you if you're looking for inexpensive Cloud gpus for AI Lambda now offers the best prices in the world for GPU Cloud compute no commitments or negotiation required just sign up and launch an instance and hold on to your papers because with the Lambda GPU Cloud you can now get on demand h100 instances and they are one of the first Cloud providers to offer publicly available on demand h100 accs did I mention they also offer persistent storage so join researchers at organizations like apple MIT and ctech in using Lambda Cloud instances workstations or servers make sure to go to lamb. com slapers to sign up for one of their amazing GPU instances today
