# OpenAI's NEW MULTIMODAL GPT-4o Just SHOCKED The ENTIRE INDUSTRY!

## Метаданные

- **Канал:** TheAIGRID
- **YouTube:** https://www.youtube.com/watch?v=DrKkKLEditU
- **Дата:** 13.05.2024
- **Длительность:** 19:38
- **Просмотры:** 81,638

## Описание

OpenAI's NEW MULTIMODAL GPT-4 o Just SHOCKED The ENTIRE INDUSTRY!

How To Not Be Replaced By AGI https://youtu.be/AiDR2aMye5M
Stay Up To Date With AI Job Market - https://www.youtube.com/@UCSPkiRjFYpz-8DY-aF_1wRg 
AI Tutorials - https://www.youtube.com/@TheAIGRIDAcademy/ 

🐤 Follow Me on Twitter https://twitter.com/TheAiGrid
🌐 Checkout My website - https://theaigrid.com/

Links From Todays Video:


Welcome to my channel where i bring you the latest breakthroughs in AI. From deep learning to robotics, i cover it all. My videos offer valuable insights and perspectives that will expand your knowledge and understanding of this rapidly evolving field. Be sure to subscribe and stay updated on my latest videos.

Was there anything i missed?

(For Business Enquiries)  contact@theaigrid.com

#LLM #Largelanguagemodel #chatgpt
#AI
#ArtificialIntelligence
#MachineLearning
#DeepLearning
#NeuralNetworks
#Robotics
#DataScience

## Содержание

### [0:00](https://www.youtube.com/watch?v=DrKkKLEditU) Intro

open AI arguably just released the most impressive demo of 2024 revealing their AI system which is pretty much an end to end NE Network that can do any kind of input and any kind of output and this is just truly remarkable but without wasting any more time take a look at GPT 40 and today we're also bringing the

### [0:19](https://www.youtube.com/watch?v=DrKkKLEditU&t=19s) GPT4o

desktop app to chat GPT because we want you to be able to use it wherever you are as you can see it's easy it's simple it integrates very easily in your workflow along with it we have also refreshed the UI we know that these models get more and more complex but we want the experience of interaction to actually become more natural easy and for you not to focus on the UI at all but just focus on the collaboration which H gbt and now the big news today we are releasing our us Flagship model provides gp4 level intelligence but it is much faster and it improves on its capabilities across text vision and audio for the past couple of years we've been very focused on improving the intelligence of these models and they've gotten pretty good but this is the first time that we are really making a huge step forward when it comes to the ease of use and this is incredibly important because we're looking at the future of interaction between ourselves and the machines and we think that gbd4 is really shifting that Paradigm into the future of collaboration where this interaction becomes much more natural and Far easier but you know making this happen is actually quite complex because when we interact with one another there's a lot of stuff that we take for granted you know the ease of our dialogue when we interrupt one another the background noises the multiple voices in a conversation or you know understanding the tone of voice all of these things are actually quite complex for these models and until now with voice mode we had three models that come together to deliver this experience you have transcription intelligence and then text to speech all comes together in orchestration to believer voice mode this also brings a lot of latency to the experience and it really breaks that immersion in the collaboration with ch gbt but now with gbt 40 this all happens natively gbt 40 reasons across voice text and vision and with these incredible efficiencies it also allows us to bring the GPT 4 class intelligence to our free users this is something that we've been trying to do for many months and we're very excited to finally bring GPD 40 to all of our users we have these Advanced tools that are only available to our paid users at least until now with the efficiencies of 40 we can bring these tools to everyone so starting today you can use gpts and the GPT store so far we've had more than a million users create amazing experiences with gpts these are custom chat gbts for specific use cases they're available in the store and now our Builders have a much bigger audience where you know University professors can create content for their students or podcasters listeners and you can also use Vision so now you can upload um screenshots photos documents containing both text and images and you can start conversations with Chad gbt about all of this content you can also use memory where it makes CH GPT far more useful and helpful because now it has a sense of continuity across of all your conversations and you can use browse where you can search for Real Time information in your conversation and Advanced Data analysis where you can upload charts or any information and it will analyze this information it will give you answers and so on lastly we've also improved on the quality and speed in 50 different languages for chbt and this is very important because we want to be able to bring this experience to as many people out there as possible so we're very excited to bring GPD 40 to all of our free users out there and for the paid users they will continue to have up to five times the capacity limits of our free users but gbt 40 is not only available in ch GPT we're also bringing it to the API so our developers can start building today with GPT 40 and making amazing AI applications deploying them at scale 40 is available at 2x faster 50% cheaper and five times higher rate limits compared to gbd4 Turbo but you know as we bring these Technologies into the world it's quite challenging to figure out how to do so in a way that's both useful and also safe and gbd 40 presents new ch challenges for us when it comes to safety because we're dealing with realtime audio real time vision and our team has been hard at work figuring out how to build in mitigations against misuse hey I'm Mark so one of the key

### [6:13](https://www.youtube.com/watch?v=DrKkKLEditU&t=373s) Live Demo

capabilities we're really excited to share with you today is realtime conversational speech let's just get a demo fired up so I'm taking out a phone if you are wondering about this wire it's so we have consistent internet and if you see there's this little icon on the bottom right of the chat GPT app and this will open up GPT for's audio capabilities hey chat GPT I'm Mark how are you oh Mark I'm doing great thanks for asking how about you hey so I'm on stage right now I'm doing a live demo and frankly I'm feeling a little bit nervous can you help me calm my nerves a little bit oh you're doing a live demo right now that's awesome just take a deep breath and remember you're the expert I like that suggestion let me try a couple deep breaths can you give me feedback on my breaths okay here I go whoa slow a bit there mark you're not a vacuum cleaner breathe in count of four okay uh let me try again so I'm going to breathe in deeply and then breathe out for four and then exhale slowly okay I'll try again breathing in and breathe out that's it hi how do you feel I feel a lot better thank you so much so Mark you've been working on these capabilities for a while now can you tell us a bit how it's different from voice mode right so if you've used our voice mode experience before you'll notice a couple key differences first you know you can now interrupt the model you don't have to wait for it to finish your turn before you can start speaking and you know you can just butt in whatever you want second the model is real time responsiveness so that means that you don't have this awkward two to three second lag before you wait for the model to give a response and finally the model picks up on emotion right when I was breathing super hard there it could tell and it knew hey you might want to calm down a little bit you're kind of running a little bit too fast so you know it really has that capability across the board to perceive your emotion not only that though the model is able to generate voice in a variety of different emotive Styles and it really has a wide dynamic range so I'm going to fire up another demo for that yeah hey Chach PT how are you doing I'm doing fantastic thanks for asking how about you pretty good what's up so my friend U Barrett here he's been having trouble sleeping lately and uh I want you to tell him a bedtime story about robots and love oh I got you covered gather round Barrett once upon a time in a world not too different from ours there was a robot named bite was a curious robot always exploring the story but I want a little bit more emotion in your voice drama got it let's add some drama once upon a time in a world not too different from ours there was a robot named no CHT I really want maximal emotion like maximal expressiveness much more than you were doing before understood let's amplify the drama once upon a time in a world not too different from ours there was a robot named bite you do this SE a robotic voice now initiating dramatic robotic voice once upon a time in a world not too different from ours there was a robot named bite was a curious robot always exploring new circuit bar here calm down a little bit can you end the story um but do it in a singing voice and so bite found another robot friend and they live circly and Ever After thank you so much I definitely feel a lot more relaxed you've just seen the voice capabilities but we also want to show you the vision capabilities as the model can also see the whole world around us yeah absolutely like as you can see you know you can chat with chat GPT in real time but you can also interact with it um with video as well okay let me boot up chat gbt hey chat BT hello there how's it going it's going really well today I'd really like your help solving a math problem I'm all ears what math problem can I help you tackle today so I'm going to write down a linear equation on a sheet of paper and I'll show you and then I love your help working me through it but importantly don't tell me the solution just help give me hints along the way got it oh okay I see it no I didn't show you yet just give me help along the way one second whoops I got too excited I'm ready when you are so chat GPT what equation did I write down there ah I see it now you wrote down 3x + 1 equals 4 yep exactly so what's the first step I should take to try to solve this the first step is to get all the terms with X on one side and the constants on the other side so what do you think we should do with that plus one okay I'm going to try to subtract one from both sides and then I'll see what I get great idea perfect now what do you get when you subtract one from both sides okay so chat GPT this is what I ended up with how does this look spoton now you've isolated the term with X on one side and it looks like 3x equal 3 what do you think we should do next to solve for x I'm not sure can you give me another hint of course you have 3x and you want to find the value of x think about what operation would undo multiplication is it subtraction close but think of the opposite of multiplication because 3x means 3 * X okay I think it's division let me try to divide both sides by three and let me see what I get Bingo go ahead and divide both sides by three okay so chat GPT this is what I ended up with how does this look it looks perfect you've solved it and x equals one nicely done how do you feel about solving linear equations now yeah honestly i' I'm not pretty confident I can solve linear equations but why would you ever need to use this in the real world I'm really skeptical that's a fair question linear equations pop up in a lot of everyday situations even if we don't realize it for example calculating expenses planning travel cooking and even in business for profit and loss calculations it's basically a way to solve problems we you need to find an unknown Value Plus wow okay you know I have a lot of unknown values in my life I'm totally convinced we I'm going to learn math now I love that attitude Matt can definitely help solve some of life's mysteries whenever you're ready to dive into more math I'm here for you anything else you'd like to tackle today so chbt I really love that you know taught the value of math to my friend Mark and I wrote one last thing I love if you could take a look at of course I'd love to see what you wrote show it to me whenever you're ready okay so this is what I wrote down what do you see a I see I love chat CHT that's so sweet of you yeah well I really appreciate all the help so yeah as we can see um yeah wow that's quite the outfit you've got on yeah as we can see you know we can chat in real time to chat GPT and not only that we can interact with it with uh video as well and I swear like if you look at Barrett's GPT history it's all equations like this now that you know you can do linear equations can you show us something more complex I think so okay next let's try something uh coding related so let me take out um a computer for this okay so I'm going to open the chbt

### [14:06](https://www.youtube.com/watch?v=DrKkKLEditU&t=846s) Code Demo

desktop app like miror was talking about before okay and to give a bit of background what's going on so here we have um a computer and on the screen we have some code and then the chat gbt voice app is on the right so chat gbt will be able to hear me but it can't see anything on the screen so I'm going to highlight the code command see it and then that will send it to chat GPT and then I'm going to talk about the code to chat GPT hey there how's it going yeah it's going really well I was wondering if you could help me with a coding problem today of course I'd love to help you out what's the coding problem you're dealing with okay I'm going to share with you some code one second sure thing take your time okay so I just shared some code with you could you give me a really brief one- sentence description of what's going on in the code this code fetches daily weather data for a specific location and time period Smooths the temperature data using a rolling average annotates a significant weather event on the resulting plot and then displays the plot with the average minimum and maximum temperatures over the year awesome thanks so much and in the code there's this function Fu could you describe to me what the plot would look like if I had versus didn't have this function sure the function Fu XY takes in a data frame X containing temperature data and a window size y for calculating the rolling mean it Smooths the average temperature minimum temperature and maximum temperature columns by applying a rolling mean over the specified window with the fu function the plot would show smoother temperature lines because the rolling mean reduces the noise or fluctuations in the data awesome this makes thanks so much so yeah I definitely will want to apply this function and so I'm going to uh generate the plot and then show it to you give me one second absolutely take your time I'm excited to see the plot when you're ready yeah so far chat gbt has only been able to see the code and now I'm going to run it and then I'll use the vision capabilities of the chat PT desktop app so it can actually see everything going on the screen as well okay chat PT I'm sharing with you the plot now I'm wondering if you can give me a really brief one- sentence overview of what you see the plot displays smooth average minimum and maximum temperatures throughout 2018 with a notable annotation marking a big rainfall event in late September fantastic I have another question for you um which months do you see the hottest temperatures and roughly what temperatures do those correspond to the hottest temperatures occur around July and August the maximum temperature during these month months is roughly between 25° and 30° 77° F to 86° F what's going on with the Y AIS is this in Celsius or Fahrenheit in Celsius the temperatures are labeled accordingly such as average temperature minimum temperature and maximum temperature yeah so as we can see you know not only can chat PT help me solve very easy linear um algebra equations that I need help with it can also interact with you know code bases and see the outputs of plots and everything like this going on a computer so a few minutes ago we asked the live audience on X to submit few requests for what they would like us to try out here so I will take a couple of prompts okay bot gascar wants to know if gbd4 is capable of real time translation Mark you want to try this one sure yeah let's do it Italian so we can English Italian try to do English Italian sure let's do it hey Chachi P how are you I'm doing fantastic thanks for asking how about you how's your day going I'm uh doing great so I would like you to function as a translator I have a friend here who only speaks Italian and I only speak English and uh every time you hear English I want you to translate it to Italian and if you hear Italian I want you to translate it back to English is that good Mike she wonders if whales could talk what would they tell us um they might ask uh how do we solve linear equations certainly yes great looks like it works all right so the next one is from burit John 78 who asks can you tell what you're feeling just by looking at your face Barrett you want to give this one to try absolutely let's try it out hey chat chpt hey there what's up how can I brighten your day today okay yeah so I'm going to show you um a selfie of what I look like and then I'd like you to try to see what emotions I'm feeling based on how I'm looking sounds like a fun challenge go ahead and show me that selfie and I'll put my emotional detective hat on Okay so here's me so what kind of emotions do you think I'm feeling H it seems like I'm looking at a picture of a wooden surface oh you know what that was the thing I sent you before don't worry I'm not actually a table um okay so take a take another look uh that makes more sense a there we go it looks like you're feeling pretty happy and cheerful with a big smile and maybe even a touch of excitement whatever's going on it seems like you're in a great mood care to share the source of those Good Vibes yeah no uh the reason I'm in a really good mood is because we were doing a presentation showcasing how useful and amazing you are

---
*Источник: https://ekstraktznaniy.ru/video/14321*