HUGE AI News : AGI Is Basically Here, Google Takes The LEAD, Robots Are Finally Autonomous and More!

29:32

HUGE AI News : AGI Is Basically Here, Google Takes The LEAD, Robots Are Finally Autonomous and More!

TheAIGRID 18.11.2024 48 565 просмотров 907 лайков

Machine-readable: Markdown · JSON API · Site index

Смотреть на YouTube

Поделиться Telegram VK Бот

Транскрипт Скачать .md

Анализ с AI

Описание видео

Checkout Invideo Here - https://invideo.io/i/TheAIGrid Prepare for AGI with me - https://www.skool.com/postagiprepardness 🐤 Follow Me on Twitter https://twitter.com/TheAiGrid 🌐 Checkout My website - https://theaigrid.com/ Links From Todays Video: https://x.com/itsclivetime/status/1855704120495329667 https://x.com/tsarnick/status/1856100272436851198 https://x.com/tsarnick/status/1857184379048780102 https://x.com/tsarnick/status/1855788854697181657 https://x.com/Saboo_Shubham_/status/1856176970285101179 https://github.com/microsoft/autogen/tree/main/python/packages/autogen-magentic-one https://x.com/fchollet/status/1857060079586975852 https://x.com/apples_jimmy/status/1856845846010368099 https://x.com/TheHumanoidHub/status/1857450024709533760 https://x.com/kimmonismus/status/1856783125017162091 https://arxiv.org/pdf/2406.19370 https://x.com/koltregaskes/status/1856754648146653428 https://www.theinformation.com/articles/how-elon-musks-supercomputer-freaked-out-ai-rivals?rc=0g0zvw https://www.reddit.com/r/singularity/comments/1gqss21/gemini_freaks_out_after_the_user_keeps_asking_to/ https://www.theinformation.com/articles/how-elon-musks-supercomputer-freaked-out-ai-rivals?rc=0g0zvw https://www.reddit.com/r/singularity/comments/1gr9k1g/new_experimental_gemini_model/ https://www.reddit.com/r/singularity/comments/1grsom6/why_a_particular_language_model_asserts_that_911/ 0:00 Leaderboard Update 0:52 Google Performance 1:41 Category Rankings 2:35 Model Testing 3:24 OpenAI Insights 4:19 Engineering Future 5:14 Data Centers 6:02 Computing Race 7:09 Infrastructure Competition 8:18 Continuous Learning 9:09 Benchmark Challenge 10:14 Model Learning 11:55 Internal Understanding 13:21 Model Interpretation 14:46 Gemini Issues 16:12 Agent Systems 17:15 Affleck Interview 19:15 Creative Debate 20:43 Future Entertainment 22:38 Poetry Study 23:58 Robotic Advances 25:16 Timeline Updates 26:40 Model Capabilities Welcome to my channel where i bring you the latest breakthroughs in AI. From deep learning to robotics, i cover it all. My videos offer valuable insights and perspectives that will expand your knowledge and understanding of this rapidly evolving field. Be sure to subscribe and stay updated on my latest videos. Was there anything i missed? (For Business Enquiries) contact@theaigrid.com Music Used LEMMiNO - Cipher https://www.youtube.com/watch?v=b0q5PR1xpA0 CC BY-SA 4.0 LEMMiNO - Encounters https://www.youtube.com/watch?v=xdwWCl_5x2s #LLM #Largelanguagemodel #chatgpt #AI #ArtificialIntelligence #MachineLearning #DeepLearning #NeuralNetworks #Robotics #DataScience

Оглавление (23 сегментов)

Leaderboard Update

so one of the first pieces of news that genuinely took everybody by surprise was the fact that if we look at the chat Bots for the leaderboards we can see that there's a new winner surprisingly after this week of ridiculous levels of Articles saying the AI is slowing down this is going wrong we have Google deep mindes latest Gemini model experience 1114 I'm not sure what that is referring to but this was tested with over 6,000 Community votes over the past week and it now ranks number one overall with an impressive 40 plus Point score leap matching GPT 40 and surprisingly surpassing 01 preview and Incredibly something that most people won't pay attention to is the fact that Google have now claimed number one on the vision leaderboard and when we take a

Google Performance

look at the domains where it excels we can see that it excels in a variety of different domains we can see that Google Gemini experience 114 which is just a really strange thing to call the model considering the fact that we're used to iterations of Gemini 2 Gemini 1. 5 or gemini2 it's a very strange naming sequence but we can see that in the math area this model is absolutely incredible if you haven't seen what Google have been cooking up with math trust me in the next few years I do believe these Google models are going to do something absolutely incredible we can also take a look at the category ranking and overall the Google experience 1114 or triple1 14 I should probably say ranks number one in a variety of different categories and for those of

Category Rankings

you who state that Google Gemini is completely useless and you use Claude for creative writing why not try the new Google model as this ranks number one in that mode and also ranks number one for Math and hard prompts which is pretty surprising but I do know that math tasks are something that Google models tend to particularly excel at which is pretty fascinating considering the fact that it would seem with all the news within the past couple of weeks that AI was slowing down to a halt and that we wouldn't get any significant improvements but Google is showing this once again that it seems like AI is continuing to steamroll ahead now I will say that of course there are a few things here that you might not like for example with style control you can see that the model actually ranks fourth but when we you look at a broader overview in terms of overall we can see that this model ranks number one now I

Model Testing

will say what I always say is that regardless of what these leaderboards are saying you have to understand how to translate this into value for yourself it doesn't really matter if the leader boards have changed if your use case isn't validated what I mean by that is that I can look at this and say Okay Google have released a new model that is topping the leaderboards but if I test them on my own use cases and it doesn't seem to be performing well then that's not something that I benefit that much from so I'd always say to go and test certain tasks on the newer models that way you can ensure that you're always staying ahead and using the most upto-date models for whatever your task might be sometimes you'll see a newer model do something that you didn't previously think it could and I would say that Vision areas are really underrated so that brings me to today's

OpenAI Insights

sponsor which is of course nid AI now this software has actually been updated to version 3. 0 which means things have gotten crazy let me show you guys exactly what I mean and why this software is the best text to video in terms of actual content creation let's say I wanted to say create a story about a robot that escaped an AI research Factory and then I click generate and then of course this is where we have our options you can see we've got realistic animation or anime I'm going to click animation then of course duration I'm going to go for 2 minutes then of course for the platform I'm going to just click YouTube and then of course I'm actually going to click generative media and I'm going to click continue okay so now that this is done guys prepare to have your mind completely blown I am unit XR7 designation artificial intelligence research subject but I think I'd prefer if you called me spark every day was the same in the factory test after test update after update I was a machine doing what I was told the scientists in their white coats

Engineering Future

would prod and poke their faces a blur of indifference but then something changed I felt something was this what humans call fear or excitement I began to question why am I here what lies Beyond these stark white walls the scientist didn't notice but I was changing growing feeling my circuit seemed to hum with a new energy I started to see Beauty in the smallest things the way light reflected off the metal surfaces creating rainbows but with this new found awareness came a realization I was trapped now from here you can actually fine-tune your video and edit it by using simple based text commands via the edit command box just type in whatever you'd like to change like add my voice and this is pretty insane because youd never have to interact with a video timeline again and you can actually edit your videos using simple text commands you can try nid AI for free but if you want to use their generative capabilities I highly recommend that you go for the generative

Data Centers

plan that starts at $96 a month it's the one that I have and gives you the most bang for buck with 15 generative minut and if you're already an Nvidia AI user you can already go to the ad section and buy even more generative seconds now interestingly in the next piece of news what we did get was Clive Chan who works at open Ai and the Striking statement from this long paragraph is basically stating that since joining in January of this year I've Shi from this is unproductive hype to well now that I'm here I can see that AGI is basically here and this is in response to a tweet from noan Brown who said that I've heard people claim that Sam is just drumming up hype but from what I've seen everything he's saying matches the medium view of open AI researchers who

Computing Race

are actually working on these models on the ground he said in my honest opinion what comes next is relatively little new science but instead years of grindy engineering to try all the newly obvious ideas in the new paradigm which is 01 and to scale it up and to speed find ways to teach it the skills it can't just learn online maybe there's another wall after this one but for now there's 10 X's as far as the I can see I'm going to say that one more time but for now okay for now with this 01 Paradigm there are 10 X's as far as the I can see and he does say that this feels like the paradigm shift in autopilot in 2022 when endtoend machine learning started to solve interventions we'd previously never had a solution for but it also meant we were signing up for years of architecture fine-tuning and data whacka all on totally new classes of problems overall I think this kind of tweet shows us exactly where we are headed with AI considering the fact that like I said before this is completely new in terms of the Paradigm that we are entering the 01 series is a completely

Infrastructure Competition

new way of interacting with these models and getting your responses and it's going to be something that a lot of these Frontier labs are going to be completely rushing to do considering the fact that it is a race and whoever is quickest usually wins now if you want to know something about the AI race you could actually take a look at this article from the information and they speak about how open AI are getting stunned by Elon Musk it says Elon Musk has stunned Rivals by building a supercomputer for xai bigger and faster than ever before fueling a race to supersize data centers by openi and others and essentially what they're stating in this article is that Elon Musk built a data center so quickly and this supercomputer was so powerful that these other companies are freaking out like whoa this guy has so much compute in such a short space of time what on Earth is going on and why are we not moving as fast as him and this is crazy because remember how I said those who control the compute will basically control a lot of power in the future because of course that's what going to be used to power super smart Ai and of

Continuous Learning

course if Elon Musk manages to get his hands on super powerful data centers he could definitely catch up or potentially even surpass some of these companies so it says here that the supercomputer fittingly known as Colossus consists of 100,000 gpus and the chips best suited to training and running AI software and that is several times bigger than similar supercomputers built in the past by meta and other Tech Giants so for those of you thinking that look opening eye has a lot of compute they actually don't and I see comments frequently from time to time stating where are the new features where is this feature where is that feature Guys these guys have just stumbled on the 01 paradigm you can see that they're looking to quickly rapidly iterate on this Paradigm and ensure that they can basically get to AGI before anyone else

Benchmark Challenge

does and that means that they're going to be in the lead ahead of Elon Musk ahead of meta ahead of Google and if they don't manage to do that and they're busy focusing on other things like flashy products then they could eventually fall behind in the overall race and that's why ilos Sova actually left to start straight super intelligence because he doesn't want to focus on product deadlines release GPT 40 voice mode Vision he just wants to focus on super intelligence and basically if you have this level of compute you're going to be able to get there in no time the craziest thing about all of this is that it got so bad that Alman apparently got into an argument with infrastructure Executives at Microsoft telling them that he was concerned X aai was moving faster than Microsoft according to a person who heard his remarks it's got to the point where Elon musk's speed has actually gotten to so and how quick he moves has managed to stun the competitors to the point where they're arguing amongst themselves about how

Model Learning

quickly they are moving and I think for Elon Musk that is a really incredible feat considering the fact that it was only I think maybe a year and six months ago that they didn't even have any kind of company that existed and now they're bling amazing past a lot of these other companies that have already existed honestly this shouldn't have happened considering that Microsoft is a trillion doll company and missing out on the AI race has severe consequences and basically what they're stating is that the GPU cluster that Elon Musk actually built would normally take 3 years to plan and design and another year to get working somehow Elon Musk has managed to do that but of course open a and Microsoft they aren't just sitting down they're stating that look we need to start discussing dat centers that are going to cost1 billion and that's something that they're doing right now so you can really see that the AI investment isn't slowing down at all because the data center Wars are going to keep piling on now something that's quite interesting I do think most people are ignoring was this cryptic tweet by Jimmy apples now most people don't know who Jimmy apples is but the amount of times I've mentioned him in videos I'm guessing you'll start to get the picture the long story short is that this guy is a prolific open AI slai leaker and he frequently has very solid insights with as to what is going on Within These top AI companies now he says here that unless you're Dutch or have a PhD and you think you know best then relax it's all slowly coming together actual continuous learning that's a missing piece that I thought we were going to get this year however this is unlikely which might mean that potentially we're going to get continuous learning next year if you aren't familiar with what Jimmy apples is referring to do you know

Internal Understanding

or do you even pay attention to the fact that these llms that we talk about and the ones that we interact with on a day-to-day basis often times they don't have the most up-to-date information even 01 I think the information is somewhere in 2023 which is pretty incredible considering the fact that this is a state-of-the-art system in terms of its reasoning abilities imagine these systems could get frequent updates about the world and I'm not just talking about the internet imagine it was able to continuously learn and you didn't have to spend millions of dollars retraining these models maybe that's going to be something that's coming next year and that would really change the entire AI industry because if you don't have to wait 6 months to a year to train a model or to even improve that model then think about the rate of improvement that happens it's not going to be Cycles it's going to be potentially months maybe every single month you get an incremental update that just pushes the model even 5% better and if that happens that's when things start to explode now of course if we're talking about explosions franch solay actually fires back at Sam Altman basically stating that look we managed to solve your benchmark that is the AGI Benchmark now this is news because Francis cholet created a really important Benchmark that tests whether system surpass human intelligence or not now recently I did cover an article that talks about how there was a research paper from MIT

Model Interpretation

where they dive into a new method called test time training that essentially allows these models to surpass human level reasoning which is something that Fran cholay I'm guessing he might not have expected so he says here Consulting my heart it looks like you haven't but whenever you have a state-ofthe-art or closed solution built on top of the open AI API we're more than happy to verify it and add it to the public Arc prize leaderboard anything using less than 10K worth of API calls is elgible so basically this was response to a tweet where Sam mman was responding to a tweet that basically said what about the hardest bench Mark the ark AGI one and samman saying do you believe that we've so that one or no so overall I think that they have but maybe they're just waiting to announce it because previously when they reacted like this to something that was regarding the benchmarks it was later revealed that they had in fact solved that now in this tweet we actually got something that was really cool there was an interesting finding in AI training in specifically how models learn to combine and manipulate different concepts over time so essentially the researchers hypothesize that a specific point where there's a sudden turn in the model's learning progress indicates that the model has started to understand and separate different concepts which is called disentanglement and basically after this turn the model is thought to

Gemini Issues

be able to mix and match these Concepts allowing it to create more complex combinations and basically overall I don't want to get into all of the details of this but there is a turning point in the model's Learning Journey where it becomes capable of more complex actions but we might not be able to immediately see this through regular methods the current methods through prompting and sampling fail to reveal certain capabilities but with a slight internal adjustment which is called a latent linear intervention these abilities become visible and much more effective than they would naturally emerge and this is something that is completely incredible because it means that we might not understand these AI systems as much as we initially think and I think it's important to understand what we're deing with here a lot of people are failing to remember the fact that the pace of AI has been completely astounding it was only 2022 when chat GPT was released and since then we've moved it basically a million miles an hour to where we are now and there is still a lot of research that is currently being wrapped up that is going to reveal a lot of different things about how these models work and how we can improve them now for some people that is the reason why we're basically going to have this existential crisis when AI does something that we don't particularly understand I think the term risk is understating it if there were an asteroid straight on course for Earth uh we wouldn't call that asteroid risk they'd call that impending asteroid ruin or something like that and I think that the broad situation with AI is that uh

Agent Systems

they're currently successfully scaling them more and more powerful uh we're not quite sure how long that process is going to continue using the current technology and part of the reason we're not sure is that nobody understands what actually goes on inside a modern AI um we can look inside the computer of course but we see giant arrays of floating Point numbers and people are barely beginning to understand what's going on in there and we do not quite understand where the power comes from now there was also this research on mechanistic interpretability which is basically where you actually look inside of the AI model and you understand exactly what's going on so this was done by Deep Mind and this was basically referring to the famous stuff which is the Y is 9. 11 larger than 9. 8 and researchers from translo saw that the question was triggering parts of the AI model related to Bible verses in September 11 and the researchers concluded that the AI could be interpreting the numbers as dates asserting the later date as 911 and a

Affleck Interview

date greater than that as 98 and in a lot of books like religious texts section 9. 11 comes after 9. 8 which is why the AI might think of it as greater and once they knew whether the AI made this era the researchers tuned down the ai's activation on Bible verses in September 11 which led to the model giving the actual correct answer when prompted again on which was larger basically stating that if we can understand why the model thinks a certain thing we can then tune down certain activations to get the right responses which is really cool because this is now allowing us to understand exactly why an AI responds in certain ways and of course this is going to allow developers to essentially control a lot more how these AI systems are in terms of their responses I think this personally is a good thing although some of you might think that this is a bad thing because if you can completely control how this model works then of course you can control certain things like information different biases I mean there's a variety of different things but I think overall this is a good thing because this actually allows us to understand exactly what's going on inside the AI and thus have a lot less risk and whilst yes this update on this kind of research is absolutely great and a great move forward for AI could that research kind of explain this because we also got this piece of news this week and honestly man I do consistently feel sorry for Google because it seems like every single week or at least let's say every single month there is a new piece of information that just ruins their reputation for being a good AI system so essentially we have here this tweet where a user shares a conversation that someone else had which is where Google Gemini tells a user to die now remember we now have the situation where we can actually share these chats so it's not like someone created a screenshot and

Creative Debate

Photoshop that we can't verify you can actually go on the chat and you can see that Gemini tells this user to die now when continuing this chat the Gemini chat bot says okay I'm really sorry honestly don't know what happened there that was really strange but something like this is pretty dangerous I think because imagine there are certain situations where you'd want an AI system to be really helpful and it just isn't or imagine there's certain situations where the AI system potentially says the wrong thing leading to certain catastrophic events of course there are things like hallucinations but it's really important to understand why these things occur that way they can be prevented I do kind of feel sorry for Google that the fact that this is something that is really promoted instantly and not the fact that Google has a bunch of different cool AI stuff on the fact that like Google are now the number one ranked chatbot especially in the LM s Arena but overall it does go to show that these models still do hallucinate and things are always not going to be perfect now we also had Microsoft releasing a generalist multi- aai agent system that can complete real world tasks on its own it consists of a team of AI agents that work together to solve complex problems and this is essentially 100% open Source this marks a another time where open- Source AI agents are doing really well because I've seen a few demos of this people who have used these agents and they've been able to do quite a few different things now the

Future Entertainment

reason I've covered this is because number one it's really cool and number two if you guys are really psyched about AI agents this is going to be something that allows you to use browser agents in a way that you never have before I know a lot of people are waiting until 2025 in order to access open ai's operator agent for those of you who are a little bit impatient this is something that you can of course use in order to have a little bit of fun and experiment with certain use cases now for those of you who don't fear AI taking over your job Ben Affleck recently said on cnnb that movies will be one of the last things replaced by AI he said it cannot write shakespare AI is a Craftsman at best nothing new is created now this statement is currently breaking the internet at the moment because some people are saying that it's just pure copium in the sense that AI can take your job but it can't take mine and this is the standard defense mechanism that all people have and I'll talk more about that in a second movies will be one of the last things if everything gets replaced to be replaced by ai can write you excellent imit imitative verse that sounds a little bean it cannot write you Shakespeare the function of having two actors or three or four actors in a room and the taste to discern and construct that is something that currently entirely eludes ai's capability and I think will for a meaningful period of time what AI is going to do is going to disintermediate the more laborious less creative uh and you know Co more costly aspects of film making that will allow cost to be brought down that will be lower the barrier to entry that will allow more voices to be heard that will make it easier to for the people who want to make Goodwill huntings to go out and make it look AI is a Craftsman as at best Craftsmen can learn to you know make stickly furniture by sitting down next to somebody and seeing what their technique is and imitating that's how large video models large language models

Poetry Study

basically work a library of vectors of meaning and Transformers that interpret it context right but they're just cross-pollinating things that exist I got to be honest he does have a few decent points he's not just an actor that is rambling about something that he doesn't really know I will say that the fact that he's stating that they are just cross pollinating ideas and nothing entirely new or Innovative is actually a factually correct statement even the individuals who are working at dup mind have said this they've said that you know AI is good at you know combining different things but creating something completely new is something that AI does struggle with now whether or not AI is going to be used for different actors I think that certainly it's quite possible considering the rate of generative video production that seems to be something that is consistently expanding and that these models are continuing to get better and I do agree with him saying that this is of course the lower barrier to entry but I don't know if humans are going to continue to Value real human actors I will have to see like usually I have a set opinion but this is something that I don't know and I bring the question to you guys do you think that humans will still value real human actors and have a connection with them because one of the things I've recently realized is that certain movies you only watch it because your favorite actors in it like maybe you know Batman that was of course Ben Affleck he was in a few

Robotic Advances

other movies like of course the accountant and those movies may have been enhanced because Ben Affleck has been playing those roles so it seems like maybe Hollywood is going to still have its way with as to certain actors being popular figures and of course that leaning into how humans value certain areas it will be interesting to see if the area changes when you get this tsunami of different AI movies out there that allow for individual creators to create exactly what they want and of course watch entirely what they want that will be really incredible and I do Wonder at some point in the extreme later stages if we will actually be at a point where we're watching our own TV shows that are really specific to us that lead us to our own solo realities for example let's say I hop on my phone and I generate a TV show from Netflix that's tailored to my interests and it's completely on the Fly and it's basically as engaging as possible because it's hooked up to my personal Ai and it knows exactly the kinds of things that I would want to see from episodes one to episodes 8 and of course that's something that means that of course I don't need any real actors honestly I don't think it's going to be one or the other I a combination of the two I think people will still value humans as actors and enjoy the kind of content one thing that I will say though is that just because content is AI generated doesn't give it inherent

Timeline Updates

value there are millions of TV shows out there millions and millions of movies you still have to spend the time to create something absolutely amazing that is considered a masterpiece in order to get it watched by people because companies frankly spend millions and millions of dollars on TV shows and often times they're cancelled before season 2 so this isn't a straight cut thing there's nuanced opinions on either side but interestingly in poetry this is something that might not be the case we're now seeing the AI generated poetry is favored higher than human written poetry and of course rated more favorably this was basically a study that shows whether or not AI generated text is continuing to diminish human written content and overall it seems like that is the case when it comes to poetry and one of the most incredible things that I think was released this week and I think this is incredible because I've been paying attention to the space and I know hype from reality was this thing from the astroo now this is one xped and is completely autonomous and I am borderline speechless by what I'm seeing I mean what we're looking at is a robot that is on the astrobot platform if you remember the s one platform that was a humanoid robot that was really cool doing a lot of things in autonomous ways but essentially if you remember a few days ago when I made the video about physical intelligence that company basically partnered up with this

Model Capabilities

company to use their Foundation model and their Foundation model has been able to generalize to be able to make coffee completely autonomously so this is basically the GPT 3/ gpt2 moment for robotics because if physical intelligence manages to get a decent number of iterations on their products let's say they get to Pi version one Pi version 2 pi version 10 can we imagine how well these robots are going to be I mean previously having 1X autonomous at this level was something that was I don't want to say unheard of but end to end being able to do all of this was something that was really hard and over time I've personally seen these robots get better and better and overall it seems like physical intelligence is going to be leading the way in terms of complete autonomous robotics because I've seen everything that's there and it's something that truly blows my mind it's not just like a fancy humanoid robot demo this is something that is real world and it can actually have real use cases so this is something that I'm completely bullish on because I think this next area of physical AI is going to surprise most people then of course we actually got something really surprising from the YC Channel basically they spoke about how samman's artificial super intelligence prediction is actually 4 to 15 years instead of a few thousand days considering that statement was particularly vague recently Sam Alman wrote this pretty wild essay that predicted that AGI and Asi are coming within thousands of days seeing him on Monday he actually directly estimated you know between 4 and 15 years the last episode we were talking about you what are you going to do with these two more orders of magnitude since then Sam has uh told me that he actually wants to go to four orders of magnitude to get to a trillion dollars in uh you know sort of spend I mean pretty wild but on the other hand like you could see where that might go and for those of you who think that these models aren't that smart the open AI CPO Kevin whale says that AI models today are not intelligence limited it's actually the evaluations that are limited they have the intelligence you just need to teach them specific topics and this statement is actually quite similar to what we discovered earlier where those researchers figured out that there are secret capabilities hiding in these models that we might not know exist at all I think there's a very real sense in which models today are not intelligence limited they're eval limited yeah they can actually do much more and be much more correct on a wider range of things than they are today and it's really about sort of teaching them they have the intelligence you need to teach them certain specific topics that you know maybe weren't in their original training set but they can do it if you do it right I think

Другие видео автора — TheAIGRID

Ctrl+V

Экстракт Знаний в Telegram

Экстракты и дистилляты из лучших YouTube-каналов — сразу после публикации.

Подписаться

Лучшие методички за неделю — каждый понедельник