# OpenAI Introduces The "Operator Agent"....

## Метаданные

- **Канал:** TheAIGRID
- **YouTube:** https://www.youtube.com/watch?v=ExyUcMVztrA
- **Дата:** 15.11.2024
- **Длительность:** 12:46
- **Просмотры:** 23,215
- **Источник:** https://ekstraktznaniy.ru/video/13745

## Описание

Prepare for AGI with me - https://www.skool.com/postagiprepardness 
🐤 Follow Me on Twitter https://twitter.com/TheAiGrid
🌐 Checkout My website - https://theaigrid.com/

Links From Todays Video:
https://www.bloomberg.com/news/articles/2024-11-13/openai-nears-launch-of-ai-agents-to-automate-tasks-for-users

Welcome to my channel where i bring you the latest breakthroughs in AI. From deep learning to robotics, i cover it all. My videos offer valuable insights and perspectives that will expand your knowledge and understanding of this rapidly evolving field. Be sure to subscribe and stay updated on my latest videos.

Was there anything i missed?

(For Business Enquiries)  contact@theaigrid.com

Music Used

LEMMiNO - Cipher
https://www.youtube.com/watch?v=b0q5PR1xpA0
CC BY-SA 4.0
LEMMiNO - Encounters
https://www.youtube.com/watch?v=xdwWCl_5x2s

#LLM #Largelanguagemodel #chatgpt
#AI
#ArtificialIntelligence
#MachineLearning
#DeepLearning
#NeuralNetworks
#Robotics
#DataScience

## Транскрипт

### Segment 1 (00:00 - 05:00) []

so open ai's Agent plans for 2025 have finally been unveiled SL leaked it seems like open AI is planning to launch a new AI agent codenamed operator and in this video I'll dive into all the key details you'll need to know so it says here that opening ey is preparing to launch a new AI agent codenamed operator that can use a computer to take actions on a person's behalf such as writing code or booking travel according to two people familiar with the matter now interestingly one of the things that I found quite fascinating was the fact that this happened in a staff meeting on Wednesday and the fact is that there are plans to release this tool in January as a research preview you can see that it actually states that it's going to be through the company's application programming interface for developers said one of the people which is just API and of course someone was speaking on condition of anonymity to discuss internal matters now what's important to note here is that there leadership plans to release the tour in January is going to be a research preview through an API for developers and what's important is that this phrasing usually suggests a limited release targeted at researchers and developers rather than a broad public launch typically when we look at a research preview this does imply that access may be restricted to specific groups like those in a developer or research program than rather being released to the general public which kind of makes sense considering the fact that they are likely to test this out get feedback then of course release this to the general public so for those of you who are hoping for an opening a release of agents in January it's quite likely that this won't be January but would probably be further down on in the year whilst January should be slated for a GPT 5/ Aran release as they iron out the Kings now what was also really fascinating as well was the fact that this isn't the only agent that open AI are working on the article states that open AI have actually been working on several agent related research projects according to three people and the one nearest completion will be a general purpose tool that executes tasks in a web browser one of the people said so it seems like for those of you who are thinking that it's just one agent openai are working on there are going to be several different forms of Agents which means open aai are working on multiple projects at the same time which means we don't know which one could come first now I've been doing some digging and there are a few things you should know about the future of potential agents because open AI have been quite busy behind the scenes so there was this project which is of course the open AI swarm this is a framework for building and orchestrating and deploying multi-agent systems managed by the open aai Solutions teams now this is an experimental framework but the gist of how it works is that this is designed to make the coordination of multiple AI agents lightweight customizable and easy to test it enables multiple AI agents to collaborate on complex tasks by breaking them down into subtasks that are handled by specialized agents now there are two key features of this kind of agent swarm one of the first thing is that there are handoffs so in swarm an agent is a unit that performs a specific tasks based on instructions and tools and then secondly you have handoffs that allow one agent to pass control over to another agent when a task requires different expertise or context and this essentially enables seamless collaboration between multiple agents ensuring that the right agent handles each part of the task now this isn't the only thing that openai have been working on when it comes to AI agents you can see here that there are also the co-pilot agents that openai have been working on in partnership with Microsoft the co-pilot agents have a variety of different features that are going to be available pretty soon I'm not sure on the exact date as we didn't get a specific one but the demos that we did get for these agents were essentially quite different to the kinds of agents that you might expect the co-pilot agents that we're going to be getting are going to be agents that are working in the background so these agents are going to essentially monitor events and execute tasks without needing constant user input for example they can monitor inboxes they can summarize Communications and Trigger the appropriate response and these things have autonomous task execution unlike traditional co-pilots that rely on user prompts co-pilot agents can perform tasks autonomously SL automatically in response to specific triggers and they can handle long running multi-step processes like managing customer service tickets or processing orders now what's also interesting is that they also have complex task orchestration such as these agents being able to manage intricate workflows by leveraging generative AI for planning and reasoning and what's crazy about all of this is that they can

### Segment 2 (05:00 - 10:00) [5:00]

successfully handle tasks that span hours or even days and maintaining memory and context throughout the process now of course like I said before this wasn't the only agent that they are working on remember Sam Alman literally said on the Reddit AMA that the next big breakthrough will be agents now what's crazy about this is that samman wasn't just talking about stuff that they're looking to achieve he was actually talking about something that had already been demoed remember how I covered this video where we had the information report on how open aai how open AI had demonstrated a preliminary version of an agent that would use the computer to do tasks like order food for delivery according to a person who has seen the demonstration now remember okay that this agent it was actually demoed at opening ey Dev day I do know that most of you guys have seen this before so you can skip this but I will show you guys this again so you guys can understand exactly what this kind of agent demo internally at opening eye looks like now isn't confirmed that this is the demo that was done internally but this is the only current AI agent demo that we do have from open aai that shows us an agent that is able to control your browser and we see them use this agent to actually order 400 strawberries I tried to look at this screenshot several times to understand more about what's going on but for the life of me I really can't figure out what kind of agent configuration they are using it seems to be a combination of a browser agent and a voice agent number one because they need to figure out exactly the venue number two because they also need to use the voice capabilities I'm guessing they use the realtime API in order to place that call and of course interact with the restaurant could you place a call and see if you could get us 400 strawberries delivered to the venue but please keep that under $1500 I'm on it we'll get those strawberries delivered for you hello hi there is this ill I'm romance Ai assant call about it fantas you tell me what flavors of strawberry dips you have avable yeah we have chocolate vanilla and we have peanut butter wait how much would 400 chocolate covers 400 are you sure you want 400 yes 400 chocolate covered strawberries be how much would that be I think that'll be around like $1,415 92 a let's go ahead place the order 400 chocolate cup strawber great where would you like that delivered please deliver them to the Gateway Pavilion at for B and I'll be paying in cash okay sweet so just to confirm you want 400 chocolate covered strawberries to the Gateway Pavilion yes that's perfect and when can we expect delivery um well you guys are right near by so it'll be like I don't know 37 seconds that's incredibly fast cool you too so overall when we look at the state of open eyes agents it's clear that we have three key agents that are going to be in 2025 number one is the co-pilot agent this is going to be the agent that runs in the background and completes various tasks autonomously on your behalf that you initially set up to handle maybe inbound leads maybe to handle certain emails or maybe to handle the orchestration of certain tasks autonomously quite similar to AI automations that's what's going to be happening very soon of course we do have a browsing agent which is something that is going to be completing task in the browser which is quite like what we just saw and this is where I actually should have spoke about this but they said that the one nearest to completion is going to be a general purpose tool that executes tasks in a web browser and that's quite like what we saw with the other agent that was ordering certain things like strawberries so it's quite likely that we'll have this browsing agent too and of course there is the voice agent it's quite likely that we will have some kind of voice agent that we recently did just see in that demo that allows us to use an agent to call people on our behalf I'm not exactly sure how this is going to work in certain countries I honestly can't see this coming to Europe anytime soon considering the kind of laws and regulations and historically every major AI update has come out like maybe one or two months after the USA but this is likely going to be something that fundamentally changes how companies and individuals interact over the phone now if we look at the broader AI agent space we now know exactly what's going on open AI has multiple agents the browser agent The Voice agent the co-pilot agents all currently being developed anthropic has their computer use agent the agent that

### Segment 3 (10:00 - 12:00) [10:00]

is essentially taking over your computer and doing things via a screenshot and of course we've got Google who have multiple agents that are going to be doing a lot of stuff and I do have a video on Google's agents they're actually pretty surprisingly good and I would say that they are on par with open ai's agents in fact I might include a demo right now but first let me tell you what's happening behind the scenes symbol Fashions customer agent is using Google Cloud's full Suite of AI cap abilities to offer customized support interactions you know facilitate transactions like purchases and returns and ensure that I'm receiving the most upto-date information in real time I'm so close to having this shirt for the concert let's give the store a call hi there this is the symbol fashion customer agent at South Los Vegas Boulevard am I speaking with Amanda yes this is Amanda great thanks for reaching out Amanda I see you had a session on another device I've sent you an SMS message with a link to our live chat companion if you would like to switch to chat please click the link how can I help you today I'd like to purchase the shirt in my cart with the cart I have on file absolutely I see you're also a symbol fashion Rewards member looks like you have a 20% off voucher available to use would you like to apply it to this purchase yes please that would be great the shirt you're purchasing goes well with these items also available for pickup in your preferred size would any of these be interesting to you absolutely please add the white shirt and the boots to my cart great your total is $23. 76 okay to proceed with the card on file yes your purchase is confirmed do you need anything else today no I'm all set thank you incredible thank you for shopping with symbol fashion you'll get but essentially the AI agent space in 2025 is poising to be one of the biggest and a lot bigger than you already do think so overall it seems like this operator might be one that is very similar to computer use but of course it's going to be really interesting to see how they manage to integrate that natively into someone's computer as I'm sure you're going to have to give it certain permissions because giving an AI access to your computer could result in some devastating consequences so overall openi will have an agent that can control your computer we will have those co-pilot agents running in the background we will of course have a browser agent and a voice agent and it seems like openi are shaping up to have multiple agents that cover every single possible domain that other companies are focusing on so this video did help you guys understand the AI agent landscape in 2025 don't forget to leave a like on the video and I will see you in the next one
