I Paid 200 $ for the first ChatGPT Agent. Does It Actually Work? (OpenAI Operator)
19:26

I Paid 200 $ for the first ChatGPT Agent. Does It Actually Work? (OpenAI Operator)

The AI Advantage 23.01.2025 106 524 просмотров 2 207 лайков обн. 18.02.2026
Поделиться Telegram VK Бот
Транскрипт Скачать .md
Анализ с AI
Описание видео
Today we have a first look at OpenAI's brand new Operator in action. What does this do, does it even work and is it worth the money? Links: https://operator.chatgpt.com/ https://openai.com/index/introducing-operator/ #openai #operator #chatgptoperator Free AI Resources: 🔑 Get My Free ChatGPT Templates: https://myaiadvantage.com/newsletter 🌟 Receive Tailored AI Prompts + Workflows: https://v82nacfupwr.typeform.com/to/cINgYlm0 👑 Explore Curated AI Tool Rankings: https://community.myaiadvantage.com/c/ai-app-ranking/ 🐦 Twitter: https://twitter.com/TheAIAdvantage 📸 Instagram: https://www.instagram.com/ai.advantage/ Premium Options: 🎓 Join the AI Advantage Courses + Community: https://myaiadvantage.com/community 🛒 Discover Work Focused Presets in the Shop: https://shop.myaiadvantage.com/

Оглавление (4 сегментов)

  1. 0:00 Segment 1 (00:00 - 05:00) 954 сл.
  2. 5:00 Segment 2 (05:00 - 10:00) 887 сл.
  3. 10:00 Segment 3 (10:00 - 15:00) 889 сл.
  4. 15:00 Segment 4 (15:00 - 19:00) 842 сл.
0:00

Segment 1 (00:00 - 05:00)

all right so it happened the spontaneous release out of open AI with the product that I personally have been waiting for the most in a long time and I know a lot of people shar's feelings because this is the agentic cat gbt version that so many people have been excited for have been waiting for I personally see this as the future of chat GPT and really the direction that they're going to be developing everything towards for now it's a future that is only available to a selected few people namely people in the US on the 200 do Pro Plan but no worries within a few months they promis to bring this to All Team subscriptions and I believe that eventually with all the competitors just like we saw with the thinking models I think even before that we're going to get operator products out of other AI Labs that work at a fraction of the price they're just the very first one to do it but that is an exciting opportunity to actually give this a spin and this video we'll be going through this and we'll be running two of our very first operations within this new product you just access it like so I am connect to a VPN to the US because I'm sitting in Portugal and I am on a Pro Plan right here so again those are the two prerequisites if you match those then you can use operator for yourself today now there's a few things I want to point out here that I think are interesting like you can schedule some of these and you can run free tasks at the same time and there's some competitor products that I want to sort of compare this to but we're going to do all of that after we run our initial operations because I think I like I could be explaining this for a few minutes but the best thing we can do is just show you because what this does is it remote controls your mouse and keyboard and that's what we're just going to do right here okay first example we're going to go easy on it I'm going to say Airbnb and as you can see it comes with these partnered apps there's more of them I'll show you that in a second but if you pick that it's going to be using the Airbnb site that it has been like pre-trained on and it works especially well on it so to say and I'm going to be booking um night in some place with CV and Lisbon how about that so book uh one night stay for two people in Lisbon for the for uh January 24th oops and yeah we're going to have a look at this in practice so this should be really interesting I think last thing that I might add here is maybe some filtration criteria so um maybe I'll add with a c View and then I'll say under $300 I guess okay let's send this let's see how it operates this is going to be unedited because I want to really see and show you how this performs on the first try full disclaimer I gave this one quick shot before I begin just to see if it's all working if it's operating I didn't let it finish so this is literally a first look and uh unfiltered one and at that so A few more interesting facts here because one of my favorite things that included here is actually its ability to um save tasks okay so if you have something that you want to be doing every week like ordering groceries in a specific way you can um you can save that and basically it can run once a week at the same time or you get a little preset for it okay we can do that in a second show you that secondly you can run free of these operations at the same time so if I just go to create a new one maybe let's do that right away and how about this one the second one we're going to do more free form okay this used the Airbnb app that it had but because operator is using um browser in the cloud essentially and just clicking you know buttons and using the keyboard for you we can actually give it a task with any website here okay so let me do that let me formulate that little prompt with something that I've actually recently done I thought of this in advance I think we're going to do um Reserve uh table for uh two people at TNA I think this is what it's called esperansa I think it's named like this in Lisbon for January um 25th let's say um through the fork that's a website that I Ed to reserve a table there recently and again I could run this as you can see here in the top left it's running this one operation now it's run running two operations at the same time it's like cloning yourself no like literally this one um this other instance of it is running and trying to book me the Airbnb accommodation while this one is running a search to reserve a table so let's have a look at this complete free form browser use you see my hands over here I'm not doing anything this is just operator doing its thing and if this works correctly it should change the location to Lisbon and then it should um pick the correct restaurant and book it and I think yeah
5:00

Segment 2 (05:00 - 10:00)

and we can watch it do that maybe we can open up a second tab see how it's going with Airbnb so this is really interesting because this is just the first step into this direction and at this point I want to point out that yes we've seen competitors do similar things here okay this is not like a first it's not like they like invented this but if you look at some of the statistics some of the benchmarks that I actually like found super refreshing here because these are just benchmarks I believe they were um somewhere here or were they in the launch video yeah I suppose they were this thing looks like look it looks like this thing performs Head and Shoulders above all of the competition um and let's see if the practical application actually proves that because really that's all I care about here benchmarks all right fine but will it get will this get it right let me tell you I spent about was it maybe 15 maybe 20 hours with mopic computer use which is essentially this from mopic okay it's a Linux machine you install it through Docker and then you basically have like a virtual computer that mopic CLA can remote control that thing could not get one thing done reliably okay I'm serious like there was not one task that you could run twice that I was able to find that actually got the job done if one of these two tasks work that's already better than computer use I can tell you that okay so it was an interesting experiment from aeropic but in my books not really usable and I don't know anybody that has been using it successfully so let's see if operator is different here what I really care about is if this works so look location waterfront that's good it's asking me a question okay so uh should I proceed with applying these filters to view the available options yes so as you can see this is not fully autonomous yet but this one is still running and you can see all the steps Happening Here you could also rewind see how it was doing what it clicked when it scrolled all of this but we'll just jump back to the live view where I could also take control here right this is something they showed in their live stream I could let it stop and take manual control but I want to just let this run autonomously okay this is looking good and what about the second operation with Airbnb here also looking good look at that okay there's some translation popup let's see if it can deal with that yep it closed it successfully this does have a SE view okay you can see um the ocean here well to be fair it's the Lisbon River but it's sort of considered C and also like the advertise it at C so this counts this works so now it asked me a question I found a listing in Lisbon with a Riverview that fits it even says Riverview aha okay smart okay it's a two-bedroom apartment with a balcony overlooking the tiure river priced at 74 okay so with taxes and cleaning fee 13 should I proceed with the booking me just say yes that looks really good okay so this is what I expect it the one with the application where it has direct access to Airbnb worked right it's already out here booking it found the place successfully I think this is the one where I'm curious uh it requires an email address to proceed with the reservation um okay let me see what if we did like a thing here um where like I don't mind I can you know hm so I don't have a fork login so I guess what I could do here I could just click take control okay it's Operator browser it's not screenshotting and it's all private what if I actually went here and let me just quickly switch to the cam log in with my Google account as I would right and then let's see if it actually works um so okay successfully pasted my email here we'll just have to have a few seconds of patience with me here but luckily I had the login information saved over here it wants twostep verification from my phone from the YouTube app let me show you no problem okay yeah yeah I'm in the US now Google no worries okay and I confirmed it and now I logged in manually okay and now well there you go eorp this is my profile and I could basically say finish up and say finish the booking rerun control to operator let's see this is Operation by the way that I literally did myself less than a week ago so you know I could save a bunch of tasks like this in my favorite restaurants with the login information as a part of the prompt and you know if I have a burner Google account like that I don't mind giving chat GPT operator my password in the prompt preset it could do of this by itself and just book my restaurants for
10:00

Segment 3 (10:00 - 15:00)

me this is amazing so while it does it I just briefly want to touch on the model that this is possible with because this is not good old chat GPT 40 it's a essentially A specialized version um from a new model called computer using agent which is essentially gp40 with vision that is trained on a bunch of computer usage okay so they showed it a bunch of examples of people interacting and doing these tasks that's why it works really well with things like Airbnb because they just showed it you know thousands millions of examples of people using their BM website that's why it works really reliably and it has you know it's one of the preset apps but from what I can see it successfully picked what did I want January 25th the fork two people like this is looking good it's still working but the only thing I had to do is log in with Google and this brand new what is it called computer using agent model got this right let me tell you there's no way on Earth that any of the competing products would have gotten two requests like this right in a row and from everything I'm seeing this is looking very good so far here I'm basically at the checkout so I just you know I need to take control and log in with my airb B account which I'm you know not going to do right now because we did already One log in I showed you how that works and the second one is still working let's see what is it doing reloading p page to finalize refreshing page like this is very promising everything else I've seen not so promising okay and on the benchmarks it just performs like 50% higher than all the competition on these agentic benchmarks they only have two agentic benchmarks so far but ladies and gentlemen let me tell you this is like the start of a new era like I know I have a tendency to overhype some of these things and I get like excited by them but like this is literally what a lot of people have in their mind when they think of AI they don't think of this assistant where you have to precisely communicate what you need and then like follow up prompts to get some info on it they think of like hey buy my groceries book my restaurants you know do my work for me and take the work off me that's what people mean that's what people come in when they open that's the expectation they come in with and right here it's doing that successfully isn't this unbelievable yeah I mean this is great look this is some you know restaurant booking sites that is not a part of open eyes program it's in Lisbon that they didn't train it on specifically right it had cookies in the beginning I bet right and it just works so I could just take control here and confirm or let's try and do this programmatically actually as the magnum opus of this video Let's just say confirm return control to operator and let's see if we can book this table I can let you know in the next video how the dinner went there on Saturday but thinking finalizing reservation click confirm your booking is confirmed wow okay so that worked very first try worked second try with their app worked what more can I say so guys quite unbelievable in my opinion because I've played a lot with these products and they always just fail and it's always just a pain okay but this worked now I could take this task and if I like going to this restaurant I could just say save task and I could say save this sort of like a prompt preset this is a gentic preset so I could save all my favorite restaurant I could give it the login information here right I could even give it the precise URL to the booking page of it to make it even faster in this case I'm just going to leave it like so I'm going to say save and then here we are save tasks and if I open up a brand new dialogue I have my custom save task down here and like I don't want to overhype this really but like I book this restaurant regularly and do you think the next time I'll be booking this restaurant I'll just be I'll be doing this manually heck no well just go to my chat GPT press this button and then you know go do something and come back and confirm that's way easier like I think chat GPT was so successful because for the use cases that really apply to all of humanity it just kills it it's like you know improve the grammar of my text improve you know the structure give me some suggestions change the tone to this like all of those basic writing use cases they're just such no-brainers or like here's a few keywords or here's what I like here's my rough email draft now turn it into something polished that stuff helps so much and just applies to
15:00

Segment 4 (15:00 - 19:00)

everybody I'm not talking about these like super limited use cases that apply to a few people chat GPT got its success because everybody can find the use case for it that works and I believe that if everybody had access to this for free at this point in time I think this is a superior way to booking a table at this concrete restaurant than going through the whole process and Google login myself and look I don't mind giving it my Google login info on this account it's just a burner Google account and I think this is promising now let's talk about the future of this to round it out okay because as you can see there's a lot of options here we can look through some of these and I think there's also a lot of custom options so at this point I would say if you want to run something Concrete in operator and you're not in the US and you don't have an account in the next 24 hours from upload I'll be do I'll be going for the comments here and I'll be running your guys prompt so if you want me to see if a prompt works and if it succeeds leave a comment below I'll take it I throw it into here and then I'll reply with the result if it worked or if it got stuck okay I think that's a good way to do it other than that look it can like aggregate news I guess there was only one thing that um I would like to correct that I said in this video which is I believe it can save these presets for you but it cannot automatically run them yet right in chat GPT we got the tasks where you can schedule things um up here and then they run regularly it's just a question of time until operator gets this obviously the road map for this will be very interesting because there's many more things we can get so to round out my thoughts on this look this clearly performs better than anything we've seen in this category up until now this is clearly as they stated just a research preview that is going out to the smallest group they have the pro users once it ships to the team users I expect there be to more presets more partnered apps more of maybe you know the little bugs that might appear if I run 50 examples through this and not two will be ironed out there will be more Integrations you will I can totally see how you will be eventually be able to save your login information to like your Google account and your Airbnb account within chat GPT uh they'll find some secure solution for that I'm sure and then on the other hand there's going to be an entire open source movement of people following opening eyes lead just like they did with o1 like we're going to upload music news episode uh Friday tomorrow and like basically you know the Chinese copied like the idea behind 01 and did open source version that is like a 100 times cheaper now that you can use locally I expect the same to happen for this it will just take some time so it's very early it's very unique I think my intuition on this is more useful than all of like AI video and all of you know gpts and actions and whatever else we've seen come out of these companies over the past year or two for the broad masses I'm not saying there's no use cases for that for certain individuals like in video production but for the masses I think that this is the most useful thing since chat GPT and I think that's a big deal it might take some time until this trickles down to everybody right now $200 probably not worth this you know operator and assistant service for most people nevertheless I'm super excited if you want me to test out your use case leave a comment below and I'll spend the next few days looking into this deeply we'll research it we'll compile all the use cases that we find I'm sure there's some interesting thing things you can do here Beyond booking a table and I honestly this is this makes me wna this makes me want to play with this stuff and yeah should be an exciting Fe future this is the worst that will ever be and it's already worth okay so thank you for so much for watching this was an unedited video on open eye operator First Look What A Time To Be Alive really like H I think this is the sweet spot where AI really does make our life easier what's coming in the next few years I don't know but right now I'm here to use this and to teach you how to use it yourself see you soon

Ещё от The AI Advantage

Ctrl+V

Экстракт Знаний в Telegram

Транскрипты, идеи, методички — всё самое полезное из лучших YouTube-каналов.

Подписаться