ChatGPT o3 Mini - Best Model In The World & It's FREE
12:22

ChatGPT o3 Mini - Best Model In The World & It's FREE

The AI Advantage 31.01.2025 45 230 просмотров 1 468 лайков обн. 18.02.2026
Поделиться Telegram VK Бот
Транскрипт Скачать .md
Анализ с AI
Описание видео
o3-mini is here and it's not clear who should be using this. I analyzed all the benchmarks and prepared some recommendations. Links: https://openai.com/index/openai-o3-mini/ https://github.com/deepseek-ai/DeepSeek-R1/blob/main/DeepSeek_R1.pdf https://openai.com/index/introducing-chatgpt-pro/ The Use Case challenge: https://community.myaiadvantage.com/c/public-challenge/ The video where I share what I use these for: https://www.youtube.com/watch?v=M0Rl2oNLPX0&t=726s&ab_channel=TheAIAdvantage Chapters: 0:00 Why this Matters? 1:45 Smartest Model in The World 4:58 Which Model Should You Use? 9:27 What Can You Use These For? #ai #o3-mini Free AI Resources: 🔑 Get My Free ChatGPT Templates: https://myaiadvantage.com/newsletter 🌟 Receive Tailored AI Prompts + Workflows: https://v82nacfupwr.typeform.com/to/cINgYlm0 👑 Explore Curated AI Tool Rankings: https://community.myaiadvantage.com/c/ai-app-ranking/ 🐦 Twitter: https://x.com/IgorPogany 📸 Instagram: https://www.instagram.com/ai.advantage/ Premium Options: 🎓 Join the AI Advantage Courses + Community: https://myaiadvantage.com/community 🛒 Discover Work Focused Presets in the Shop: https://shop.myaiadvantage.com/

Оглавление (4 сегментов)

  1. 0:00 Why this Matters? 371 сл.
  2. 1:45 Smartest Model in The World 608 сл.
  3. 4:58 Which Model Should You Use? 881 сл.
  4. 9:27 What Can You Use These For? 573 сл.
0:00

Why this Matters?

all right so open I just released all free mini the smartest the best model that we have ever seen according to all the benchmarks in this video I'm going to show you how to use it which we're just going to do right here if you're in a prade plan you simply go into here and you can switch to o3e mini but as you can see this is not exactly intuitive matter of fact I would argue it's quite the opposite you have 01 Pro you have o fre mini you have o mini High then you have deep seek which essentially does the same thing what should you be using and what decisions do you make here depending on your budget so in this video I want to answer two questions well kind of free questions the first one which one should you be using if you're on a free plan if you're not willing to spend a single dollar what is the smartest AI model that you could be using today second question what should you be using if you're on an unlimited budget and you just don't care what you spend give me the best model in the world I want that and then thirdly what do you spend and or like which model do you pick if you're on a $20 budget and you're on a plus plan or similar we going to answer all of these and we're going to do it based on benchmarks this video is not going to be about running some random test cases or doing oneof prompts or something like that we do that in separate content here we're simply going to look at all the benchmarks that came with the release because the focus of these thinking models as you might know is really more um science driven more mathematical and coding focused use cases so those are really hard to judge one by one we're going to rely upon the benchmarks here Vibes are different story okay so datadriven recommendations on what to use what is the best model in the world today for you to use let's go all right very simple um I prepared an
1:45

Smartest Model in The World

overview for you okay with pen and paper here I compiled all the benchmarks I'm going to show them to you in about 15 seconds I just want to spend 10 seconds complaining by how messy this releases if you try to look into this yourself it's mess like seriously not just the naming I'm not just talking about the fact that we have 01 O3 mini whatever O3 mini High um sure like I I'll cut them some slack for that because you know like deeps only has one model and they named it R1 so that's simpler but the benchmarks that they released here Jesus like the chat GPT Pro block post doesn't even include all the benchmarks that are um included with the O3 mini block post so that's kind of messy but then also the code forces benchmarks they're just here they're expressed in Elo values and percentiles in the world so then I had to go in and I had to like look up what each ELO means in terms of percentile and I had to translate them so we could compare apples to oranges U Apples to Apples because this is apples to oranges long story short here is everything that I came up with for you in one sheet and let me make my recommendations based on this before I do this quick explanation of what you can see over here on the screen uh basically you can see all the models that matter the most right now you can see 01 cat gpt's old reasoning model then you can see 01 Pro the one that is only available to the $200 Pro Plan then you see o free mini on the low setting the one that they just released then you have o free mini medium by the way low medium and high refers to how much reasoning the model is doing okay so if it spends more time reasoning more time thinking about something it just performs better across all the benchmarks and does better but as you can see it also takes more time matter of fact I couldn't even test o mini low in chat GPT because it's not available it's that's a API thing and all fre mini load their blog post says uh it's not available what you get on the free plan is all free mini medium okay so we have o mini medium then high and then we have deep seek R1 which you know the entire world is talking about now and a lot of people are using and we're comparing all of the benchmarks that have been published here to each other in yellow with this wonderful Sharpie marker that I got from California last year I circled the highest values on here okay so as you can see o free mini High across the board really amazing okay that's the one we want to be looking at and at the bottom this speed number this is a purely subjective Benchmark that I created here over the past hour okay so take it with like not just a grain of salt but a truckload of salt but I think it's worth something I basically went ahead and I ran uh the same prompt here through 01 Pro three times through 03 mini three times all of these three times and I averaged it out and these are the results I got in seconds okay so as you can see the old 01 was the fastest and o one pro as expected is by far the slowest here okay so those are
4:58

Which Model Should You Use?

the results now let's get to the recommendation based on these results based on the data what should you be using if you have an unlimited budget if you don't care what you spend you just want the best model out there well you would probably be on a 01 on a cat gbt Pro Plan but there's no real point in doing that anymore because as you can see o free mini on the high setting actually outperforms chat GPD Pro on these benchmarks now there's one caveat to this which I do want to highlight and that's the fact that the blog post that was published with chat gbt these numbers on for example like 01 they don't line up with the new 03 block post that's why I'm saying this one this is a little confusing and the 01 Pro evaluation I'll cave at this because look at that competition math 78 for 01 and then if I go to the 03 mini um competition math same Benchmark right all of a sudden 01 has 83 again uh over here it was 78 and now it's 83 I don't know what's up with that but basically I took these numbers for 01 Pro these are the only benchmarks I could find on the internet and from open Ai and if I compare them all free mini on the high setting just wins across the board so there's no real point to pay for the $200 plan if the only thing you care about is the smartest model out there there's a lot of other points uh you know no rate limits for example because o fre mini pro is rate limited I'll put it in the top uh comment in the description this they keep changing this um I I'll put it I'll pin it in the top comment that's what I'm trying say and the other models um underperform it so as you can see no point for the $200 plan if you want operator if you want no rate limits if you want unlimited advanced voice mode unlimited Sora all these wonderful things that may be well worth it I think especially operator is for many people for the smartest model you don't need it okay so then let's move on over to the second question of this video which model should you be using if you want to spend $ on a Premium plan or if you are spending $20 on a Premium plan well as you can see all free mini on the high setting has the highest performance so that's the one you want to be using you get better benchmarking results across the board than deepsea R1 and you get a better speed right but this is in seconds so lower is better so over here uh you can see in this row I like this um 60 seconds is faster than 95 seconds so that's better and then all the values are higher than the Deep seek R1 value so that's the point of this release right like stop using deep seek we have a better model now that's what open ey did here okay so that's a recommendation for the paid users now what about the free users because this is the most important one I think because it's going to be affecting the largest amount of people worldwide what if you're not willing to spend any money on an AI assistant and you just want the best model out there well I think and according to these numbers um I would say that's also the why open I released it that you should be using all free mini in the free plan as everybody has that now they released it to the public they would have probably never done this if deep seek didn't happen last week but all free mini is on the medium setting and as you can see we're comparing um 80 to 80 right on math science 77 to 72 all3 mini medium wins code 96 to 96 it's a tie and on software engineering bench actually that's the one where deep seek R1 performs better so you know if you care about that one specific benchmark then you should be using deeps R1 but the time it took only 32 seconds for it to generate the result on average deep seek took an average of 95 seconds three times as much and due to that um it's you know Superior performance on most benchmarks and a way better time all free mini on the free chat GPT plan is what you want to be using moving forward if you want the best and smartest AI plus chat GPT has obviously more tooling custom instructions and the file attachments and stuff like that so it's just the superior product in that sense too um and if you want the smartest model o free mini on the medium setting or in the interface um this one right here o fre mini is the best model you could be using today so basically chat GPT to Packa crown and that's what happened here so I hope this helped I have one final point and the final point
9:27

What Can You Use These For?

is many people keep asking IG like okay we have these new reasoning models like they seem really potent they are really potent I'm using them amazing what do I use them for though does this open up some new use cases because at the end of the day like I try to make this focus a channel focused on all the AI tools for consumers and how you can make your life better with them and there is specific things that these models do better than the previous generation of AI models as many people say these days you know the GPT 40's and the Sonet 3. 5s and then the llamas Etc what they do really well is they they're good at planning they're good at bigger picture thinking we're not going to talk about that in this video but I'm going to refer you to two resources that I already created matter of fact one is an interactive and open and fully free community challenge that we're running right now that I want to briefly show you I think this is like one of the most amazing things we do here at the AI advantage in the public area of our community we basically uh have this space right here so you can freely access this if you just um you don't even need an account you can just look at this entire area look I'll just close all of this and down here you have the public Challenge and the public challenge basically asks asked people all of genery viewers of this channel like what is your best use case for 01 or now 03 or deep seek doesn't matter and then multiple community members shareed their best use case here there's some really amazing ones look at that like um splitting bills and having a financial advisor or turning 01 or deep seek into your personalized science research lab we looked at this one in the last video really amazing with downloadable files to run this all and a discussion underneath of people or using it or revolutionizing HR with these new models all of these different approaches and more for you to view and I'll be doing a public Challenge on February the 3 a public stream I wanted to say and on February the 3 we're going to be looking at all of the results here and we're going to be talking more in depth about the use cases plus there is one more video on the channel where I went a little more um in depth on all1 um I'll just pull this up right here um I believe it's this one I'll link it in the description below there's a video here that is all about um using or fre okay I'll just use it uh I'll just link it below so there you go use cases from the community separate video where I share all my thoughts it's really good at translating rewriting and business activities that require more planning those are basically the free big use cases for me if you're a paid user use o free mini high if you're a free user use chat GPT O3 mini and that's going to give you the best model in the world all right I hope this was helpful bit of a confusing release and I hope this brought you some clarity enjoy

Ещё от The AI Advantage

Ctrl+V

Экстракт Знаний в Telegram

Транскрипты, идеи, методички — всё самое полезное из лучших YouTube-каналов.

Подписаться