AI News : GPT5 Will Be Better Than We Think , Gemini 2 China Takes The Lead , , Googles New Robots..

20:01

AI News : GPT5 Will Be Better Than We Think , Gemini 2 China Takes The Lead , , Googles New Robots..

TheAIGRID 21.07.2024 45 897 просмотров 895 лайков

Machine-readable: Markdown · JSON API · Site index

Смотреть на YouTube

Поделиться Telegram VK Бот

Транскрипт Скачать .md

Анализ с AI

Описание видео

Prepare for AGI with me - https://www.skool.com/postagiprepardness 🐤 Follow Me on Twitter https://twitter.com/TheAiGrid 🌐 Checkout My website - https://theaigrid.com/ 00:00 Introduction and AI development outlook 00:43 Continued AI improvements and scale 01:56 Microsoft CTO on AI progress 02:40 Potential of future AI models (Gemini 2, GPT-5) 03:33 Gemini 2 Test model in Chatbot Arena 04:42 Impressive image location capabilities 05:21 OpenAI's text-to-speech API in playground 05:42 Major risks of AI persuasion and propaganda 07:59 AI's potential for opinion manipulation 09:07 OpenAI's careful approach to system releases 10:12 China's progress in AI development 11:48 SenseTime's AI model demonstration 12:51 Google's Gemini 1.5 Pro with robots 14:48 Elon Musk on new Optimus design 15:15 Neuralink and Optimus integration possibilities 17:57 Hyper-personalization of media content 19:31 Potential future of personalized reality Links From Todays Video: https://x.com/tsarnick/status/1810846219499175965 https://x.com/OpenAIDevs/status/1811094275859566848 https://x.com/tsarnick/status/1810787231290024236 https://x.com/tsarnick/status/1810416352110506089 https://x.com/GoogleDeepMind/status/1811401363336901029 https://x.com/theaienterprise/status/1810359724321452096 https://x.com/TheHumanoidHub/status/1811119813315678573 Welcome to my channel where i bring you the latest breakthroughs in AI. From deep learning to robotics, i cover it all. My videos offer valuable insights and perspectives that will expand your knowledge and understanding of this rapidly evolving field. Be sure to subscribe and stay updated on my latest videos. Was there anything i missed? (For Business Enquiries) contact@theaigrid.com #LLM #Largelanguagemodel #chatgpt #AI #ArtificialIntelligence #MachineLearning #DeepLearning #NeuralNetworks #Robotics #DataScience

Оглавление (17 сегментов)

Introduction and AI development outlook

so welcome back to the AI grid today we're going to take a look at some of the top AI stories one of them being the fact that AI is going to continue getting better currently there has been a recent sentiment regarding AI technology many people think that we are towards the top of the Gardner hype cycle meaning that we are about to experience the trough of delusion currently apparently we are at the peak of inflated expectations and this is where we are headed now I think that this is mildly true in the sense that currently where we are at now there seems okay and I put an emphasis on seems to be a lack of AI development but you have to remember all of the insane model releases that we've gotten this year and we also have to remember the

Continued AI improvements and scale

fact that there are two key things that we haven't done yet number one we haven't exhausted all of our available compute yet the models that we've recently trained they've only been trained up to $100 million while that seems to be an insane number and yes it absolutely is a staggering amount of investment for a single AI system we have to realize that there is still a billion dollar training runs and10 billion draining runs that could be commencing within the next couple of months or even years which means that the amount of returns that we are going to get due to the scaling laws is very likely to continue which means that despite people thinking there is this trough of delusion and that hype is about to die down I would argue that this is not the case you have to remember that AI is very much so going to be a technology that impacts pretty much everything think we're not at diminishing marginal Returns on scale up um and like I try to I try to you know help people understand like you know there is an exponential here and like the unfortunate thing is you only get to sample it every couple of years because it just takes a while to build superc computers and then train models on top of them uh and so the

Microsoft CTO on AI progress

next sample is coming uh and like I you know I can't tell you when and I can't predict exactly how good it's going to be but it will uh almost certainly be better at like the things that are brittle right now where you're like oh my God like this is a little too expensive or it's a little too fragile for me to use like all of that gets better like it'll get cheaper and like you know things will become less fragile and then like more complicated things will become possible like that is the story of each generation of these models as we've scaled up that is Microsoft's CTO Kevin Scott confirming what I do believe about the AI space now I do want to say that this is no surprise have people recently

Potential of future AI models (Gemini 2, GPT-5)

forgotten what you're able to do with claw 3 artifacts the model is only marginally better than GPT 4 and in some case worse than GPT 40 in terms of the user preferences we can see that CLA 3. 5 Sonic being able to do what it's able to do with code is completely gamechanging and this isn't claw 3. 5 Opus which are the next iteration of models so I truly don't believe that people are going to realize what is going to happen on the second iteration of model releases things like Gemini 2 claw 3. 5 Opus GPT 5 these systems are truly going to obliterate what we currently have and what our perception is of what AI is currently capable of and in speaking about future models like Gemini 2 there is actually a recent tweet about a new model that appeared on the chatbot Arena

Gemini 2 Test model in Chatbot Arena

if you're not familiar with the chatbot Arena essentially that is a place where leading Frontier Labs collect early data on their models in the wild about how capable their systems are what you'll do is you'll put a question to a model and then what will happen is two models will respond now sometimes what you will get is you'll get a test model you can see right here that this is titled Gemini test we can see that this means that they are testing some form of Gemini now some people have speculated that this is probably going to be a exclusive Gemini Vision model but some others have stated that this is Gemini 2. 0 now if you remember Gemini 2. 0 did already start training a couple of months ago so considering the fact that we know Google are behind open aai in terms of product releases and in terms of their development and if we know that Google have recently taken a much more aggressive approach in terms of releasing their software it could very much be that they are testing Gemini 2 in the chatbot Arena to see how it performs compared to certain models and how it performs in certain benchmarks

Impressive image location capabilities

now with this example we saw that someone managed to put a picture in that they took on their phone and then of course the system Gemini managed to get exact locations simply from a personal image now the reason that this is rather impressive is because of two things either one the model just realized exactly where it was based on looking at the image and that's absolutely outstanding for it to know where you are just based on what it's seen or number two it was able to use some of its data from Google and it was able to geolocate where this person was which is also pretty cool in a small piece of information for open AI developers we

OpenAI's text-to-speech API in playground

actually did get the text to speech API in the playground so if you've ever wanted to use chat gpt's text to speech feature where you've asked a question and then it manages to talk back to you this is something that I've played around with it's actually pretty cool I just wanted to include this in the video because I know that some people might want to know that this is a feature

Major risks of AI persuasion and propaganda

major risks around I do think there are major risks around persuasion um where you know you could persuade people very strongly to do specific things um you could control people and I think that's incredibly scary um to control Society to go in a specific Direction with the current systems uh you know they're very capable of persuasion and influencing your uh way of thinking and your beliefs um so and this is something that we've been studying for a while and I do believe it's a real issue with it gets majorly exacerbated um so especially in the past year we've been very focused on how to help uh election Integrity um and there's there are a few things that we are doing so number one we're trying to prevent abuse as much as possible and so that includes improving the accuracy of detection political information detection and understanding what's going on in the platform and taking quick action when that happens so that's the second thing is reducing political bias so you might have seen that chbt was you know criticized for being overly liberal um that was Elon you're too W right well uh there were a few other voices but you know the point is that it wasn't intentional and um we work really hard to reduce the political bios in the Model Behavior and will continue to do this um and also the hallucinations um and then the third thing is we want to point people to the correct information when they're looking for where they should be voting or voting information so this is a topic that isn't as spoken about as much when it comes to AI development because it's one that tends to fall under the radar because it's quite hard to detect if an AI system is actively changing your opinion because the problem is that usually what AI does try to do is it tries to always give a fair and balanced opinion for example if you ask an AI its

AI's potential for opinion manipulation

opinion on something it usually just says either you could do a or B but it's always up to the person so with that being said it's very hard to understand when these systems are trying to persuade you now I do think there was something that they said that was rather important if you're trying to predict the kinds of dates to where AI systems are going to be released one of the things they did also talk about was the fact that persuasive AI systems they wouldn't want those systems around to maintain election integrity of course right now we are at a very intense period of time because former president was recently under an assassination attempt which is completely remarkable the fact that was even able to occur and with that if open AI were to release the future system GPT 5 around this time it could certainly put open AI in a precarious position to where certain laws and regulations are passed that impact their future development if you haven't been paying attention to open AI releases the past releases of their previous AI systems were under heavy scrutiny and heavy criticism for example when GPT 4

OpenAI's careful approach to system releases

was released it was only a month or two after that people were saying look we need to slow down because this model is way too advanced and GPT 5 is literally just you know it's just way too much in addition to that when Sora was released there was also a huge outcry from many individuals saying that you guys need to literally and this is verbatim a tweet by someone people were literally sating that you need to go and kill yourself now I think that open AI have to be very careful about how they release these systems because if they aren't they're going to be in a worse position because as these systems get more capable they can do a lot more things and impact Society even further which brings us to the question how on Earth are they going to release truly capable systems in the wild if they have the potential for superhuman persuasion that is a question that I will leave to open AI now once again it might be seeming that China has taken the lead again now this is something that comes of no surprise to me because I've been paying attention to every area of AI development and I know that China are rushing ahead in terms of the kind of things they are developing one of the

China's progress in AI development

things that they've been continuing to focus on is of course their Frontier models and recently we can see here that it seems that they've managed to catch up to GPT 40 and even apparently surpassing clae 3. 5 Sonet now I would like for these models to at least be in some area where we can truly test them I know that yes it is very hard to get access to Chinese models such as cling and of course sense time but one thing that I can't help but thinking is that many companies are brushing out these models and managing to do just marginally better on the benchmarks which leads me to believe that it might just be a exaggeration of the model's capabilities because the benchmarks are so close in terms of the differences between what the models can do it's very hard to analyze whether or not these results are actually true we can see that the average score is actually tied with GPT 40 and of course on the mm star the mmu the math Vista apparently sense time 5. 5 manages to surpass all previous iterations for example we know just how good claw 3. 5 son is like I'm astounded at how smart this model is I pretty much use it on a day-to-day basis and the difference between it and GPT 40 surprisingly is night and day which means that if this model right here is claiming to be even better than GPT 40 and 3. 5 Sonet with such a remarkable difference I truly would like to see some kind of tests or some kind of videos that can confirm this now what was also interesting was we did see this small video demo I'll leave a

SenseTime's AI model demonstration

link to this in the description and of course I did actually try to translate this but it just didn't sound right because the Echoes and the translation just completely sounded off but the long story short is that there was a video live demonstration of the sense time 40 or sense Time 5 which is basically their version of chat gbt talking to you can see it's got a camera it's got a voice mode and it has an AI system that manages to talk and reply with realtime accuracy now I'm not sure how good this is it's quite hard like I said before to get access to these systems but one thing that really does surprise me is the rate of development of these Frontier models it would be interesting to get someone who could potentially go over there and test these models and do a 1 by one/ side by side comparison with The Western models but until then I guess we're going to have to take their word for it on how capable these models are and this likely won't be the last company that manages to develop models they this good I 1. 5 Pro long context window was recently tested and integrated with robots they said a

Google's Gemini 1.5 Pro with robots

limited context makes it a challenge for many AI models to recoil environments powered with Gemini 1. 5 Pros 1 million token context length our robots can use human instructions video tours and Common Sense reasoning to successfully find their way around the space they took the robots to specific areas in a real world setting highlighting key places to recall such as Louis's desk or temporary desk area then they were asked to lead us to these locations hi robot let's go to a tour so this is uh Lou's here we have uh some open separate desks for everyone oh hey robot I'm louw can you take me to a temporary desk okay thinking with Gemini please give me a minute let's go this temp desk is reserved for you oh thank you how nice so it's clear as these models get more and more capable they will be integrated into robots and we're slowly starting to see the evolution of how that may occur there was also another video demo in which they showed how they provided more multimodal instructions such as map sketches on a whiteboard audio requests referencing places from the tour and visual cues like a box of toys hey robot can you take me somewhere to draw things okay thinking with Gemini please give me go congratulations you've reached the goal you can now draw on the Whiteboard thanks robot hey robot can you take me somewhere to draw things okay thinking with Gemini please give me a minute let's go congratulations you've reached the

Elon Musk on new Optimus design

goal you can now draw on the Whiteboard thanks robot in more robot news apparently the new Optimus design will be complete later this year and it is something special so I wonder if we're going to get an image of what that actually looks like considering the recent developments in robotics and AI there was also this clip here which shows an interesting evolution in terms of what many people might think about regarding Robotics and AI essentially if

Neuralink and Optimus integration possibilities

you remember we have neuralink which is a truly capable system that allows people to talk SLC control systems with their thoughts and here Elon Musk and his team are basically talking about how you could potentially use neuralink to control certain parts of Optimus generally for anything that's got a Bluetooth interface including potentially an yes yeah you yes you could communicate with Optimus uh yep absolutely Optimus will we Al also be able to talk to Optimus but like why not just beam it but you could just yeah instead of talking just you could just beam it directly or if someone has lost the use of speech then they can still communicate to an Optimus uh they can communicate telepathically to Optimus or Bluetooth and um and so even if someone has you know completely less the ability to speak they could still uh control Optimus or their computer or phone I mean also like if you have an Optimus and you have a nurly map the brain signal to control of the physical armor of the robot and that's a very meaningful thing like if you're you know folks that have injury one of the biggest requests is to be able to scratch something that quite annoying um and if you have a scratch on your face you like you can't fall asleep until you scratch it uh you know it's very convenient to be able to move something physically towards you similar things like eating food you know if you need somebody very hard to have dinner with friends in a way that a normal social experience and so if you can feed yourself pick up a fork and actually eat a pizza of chicken on your own uh you know that's a big deal saves a lot of interaction with caretakers and other people in your life that rely on to take I think an exciting possibility long term also is to say um if you take parts of the optim Optimus humanoid robot and you combine that with a neural link let's say somebody has lost their arms or legs uh well we could actually attach an Optimus arm or Optimus legs uh and uh do a neuralink implant so that the motor commands from your brain that go would go uh to your biological arms now go to your robot arms or robot legs um and again you you'd have basically cybernetic superpowers see so the latency from the nurlink to your hand would probably be slightly faster than it is just to go to your physical hand so you can imagine like if you're a piano player or anything that requires extremely fast know hand movements that you could actually have a pretty imbalanced right-and robotic arm control versus left-and physical Arm Control that's one of them yeah like I said this just kind of a cyberpunk deox in a future where you have cybernetic upgrades that are actually better than your biological another interesting topic that I recently discussed with someone was the hyper personalization of media content due to AI may lead to a fracturing of our once shared reality and essentially everyone living in different universes you had product placements back in the day where everything was all the Marvel movies had Coca-Cola cans in them and now you're going to have Marvel movies that have a unique product based on all of the data that they've collected off of you off your cell phone or whatever

Hyper-personalization of media content

else and that fractures reality just a little bit and then you have all of these subsequent fractures of reality where that shares if uh a shared reality is a limited commodity all of the sudden that's kind of the fundamental that that's the scariest thing at all where you can go talk to somebody across the street and they no longer live in the same universe as you because they have consumed video and media that has nothing to do with what you've seen and so that City on the back of AI is kind of baked into the way that the tools are going to be used you have Finly enough Elon Musk responded to this saying that you are not wrong and I think that this is a lot further off than people think not to say that this is not going to happen but I mean you have to think about how crazy and personalized our feeds already are for example all of our home feeds right now are remarkably different whilst yes some of your friends might have similar interests you might be into similar things I wouldn't say that there is one feed that is exactly the same as everyone else's everyone's you know what everyone sees is based on their different algorithm and preferences and what they recently watched what they watched before what they don't like and with that being said you know are we going to be living in a different reality where with certain social media apps we might have entire worlds that are just truly hyper personalized now I do think there will always be a sense of shared reality on some extent because there's always going to be things that happen in real life that people want to know about for example the Trump

Potential future of personalized reality

assassination right now there's a global it outage I do think that these are going to be things that people are always you know talking about and connected about we all still live in the same world but I do Wonder at what point you know there's going to be that drift part to where people are just in their own worlds watching their own TV shows and everything's just unique for them it's going to be a weird future and I think this is going to be probably something that happens in perhaps 20 years maybe or in fact now that I say 20 years it sounds so crazy but maybe like 10 years I would say

Другие видео автора — TheAIGRID

Ctrl+V

Экстракт Знаний в Telegram

Экстракты и дистилляты из лучших YouTube-каналов — сразу после публикации.

Подписаться

Лучшие методички за неделю — каждый понедельник