OpenAI o1 for Agents & More AI Use Cases
18:53

OpenAI o1 for Agents & More AI Use Cases

The AI Advantage 13.09.2024 30 396 просмотров 924 лайков обн. 18.02.2026
Поделиться Telegram VK Бот
Транскрипт Скачать .md
Анализ с AI
Описание видео
To try everything Brilliant has to offer—free—for a full 30 days, visit https://brilliant.org/TheAIAdvantage/ . You’ll also get 20% off an annual premium subscription. OpenAI's new o1 models are now available to 100% of ChatGPT Plus and Teams users! I'll show you what you can expect from o1 and how you can use it yourself. I'll also tell you everything you need to know about the new Replit Agents, Google's NotebookLM upgrades, Adobe Firefly Video and more. This is a huge week for AI news you can actually use! Links: https://platform.openai.com/docs/guides/reasoning https://replit.com/~ https://x.com/cognition_labs/status/1834292718174077014/photo/1 [https://x.com/HBCoop_/status/1833318152513442250](https://x.com/HBCoop_/status/1833318152513442250?ref_src=twsrc%5Etfw%7Ctwcamp%5Etweetembed%7Ctwterm%5E1833318152513442250%7Ctwgr%5E39cfb2ceda5ba9db6363c9437e318a3144187b44%7Ctwcon%5Es1_c10&ref_url=https%3A%2F%2Fwww.notion.so%2Faiadvantage%2Fda89498c2bb743159c595730ef68b527%3Fv%3Dec9750f876384086867b914b103dd11cp%3D4c61737aebe5469ba7811de98980d75bpm%3Ds) https://x.com/pelaseyed/status/1832059120058257711 https://x.com/karankaria_/status/1833591604738691482 https://x.com/mckaywrigley/status/1832497488336707762 https://x.com/laughlinrigby/status/1833503891041771747 https://console.anthropic.com/settings/workspaces https://illuminate.google.com/home https://notebooklm.google.com/ https://www.google.com/photos/about/#ask-photos https://www.reddit.com/r/OpenAI/comments/1fcl4yb/tommy_the_dog_ai_stop_motion_cartoon/?utm_source=embedv2&utm_medium=post_embed&utm_content=whitespace&embed_host_url=https://embed.notion.co/api/iframe https://news.adobe.com/news/news-details/2024/Adobe-Previews-Major-Advancements-to-its-Upcoming-Firefly-Video-Model/default.aspx Chapters: 0:00 What’s New? 0:55 OpenAI o1 4:29 Replit Agent 8:49 Brilliant 10:16 Illuminate & NotebookLM 13:29 Ask Photos 15:29 Riddle Me This 15:59 Anthropic Workspaces 16:57 Minimax Stop Motion 17:45 AI Video Editing #ai #news #openai #o1 This video is sponsored by Brilliant. Free AI Resources: 🔑 Get My Free ChatGPT Templates: https://myaiadvantage.com/newsletter 🌟 Receive Tailored AI Prompts + Workflows: https://v82nacfupwr.typeform.com/to/cINgYlm0 👑 Explore Curated AI Tool Rankings: https://community.myaiadvantage.com/c/ai-app-ranking/ 🐦 Twitter: https://twitter.com/TheAIAdvantage 📸 Instagram: https://www.instagram.com/ai.advantage/ Premium Options: 🎓 Join the AI Advantage Courses + Community: https://myaiadvantage.com/community 🛒 Discover Work Focused Presets in the Shop: https://shop.myaiadvantage.com/

Оглавление (10 сегментов)

  1. 0:00 What’s New? 203 сл.
  2. 0:55 OpenAI o1 851 сл.
  3. 4:29 Replit Agent 919 сл.
  4. 8:49 Brilliant 346 сл.
  5. 10:16 Illuminate & NotebookLM 750 сл.
  6. 13:29 Ask Photos 468 сл.
  7. 15:29 Riddle Me This 133 сл.
  8. 15:59 Anthropic Workspaces 219 сл.
  9. 16:57 Minimax Stop Motion 173 сл.
  10. 17:45 AI Video Editing 264 сл.
0:00

What’s New?

so this week's releas has got a lot of attention across the internet and I think it's for good reason because this is the first time where we made a meaningful step towards this gentic future that we keep talking about it's a future where AI apps don't just assist you but actually take over some of the thinking and decision making for you and then deliver you the results and between opening eyes brand new model 01 that has multi-step reasoning built in and applications like repet agent coming onto the market that think through software architecture before writing some code and they do that even without using the brand new open AI model we're really entering a new era of what can be done with AI tools as a consumer and then we have also Innovations out of Google like notebook LM being able to curate all the notes into audio podcast with one click I particularly like this feature we'll talk more about this and so many more AI use cases that showed up over the course of the last week in this week's episode of AI news you can use let's get into it all right so first up
0:55

OpenAI o1

we need to talk about the brand new model out of open AI 01 and instead of doing an in-depth discussion here I will make this more use case focused how can you put this to work or what kind of use cases can you expect coming out of this over the next weeks because that's really the point of the show it's AI news you can use not AI news gossip and theories if you want a more in-depth look at open AI 01 I created a standalone video for that talking about all the details testing a few basic logic tests and weird translations and giving you tips on how to prompt it but now I want to look at how to get more out of it so one thing I do have to mention is that this is locked behind the pay walls so you need a teams or Plus subscription to have access to this and you only get 30 messages a week on this one and 50 this is what goes into the background rather than you giving a prompt and it giving an answer you give a prompt it thinks multiple steps and then gives you an answer okay so this was a 15-second summary of the whole thing but how could you actually use this and what kind of tips do I have well it's still super early and people are just figuring out how to use this but first things first I thought it was fascinating that the video that I made about one has some of the highest quality comments I ever got on YouTube now I wonder if that was due to the fact that I actually recorded this in a raw format this whole video was one take so maybe people were more inclined to give their genuine thoughts but there are really good thoughts in here and I want to highlight some of them first of all it's prasan highlighting here that he's an AI engineer and the biggest difference in this model is the logical reasoning okay that's well known but then he highlights that the biggest difference here is that this model can build an entire software rather than just giving you some codes that will be part of a software and that's really the direction this is going in and not just that it will also consider the business implications and adjust it accordingly and then he hints at that this will be coming over the next months and that's exactly the theme of this video too because the next segment little preview is going to be about repet agents that came out last week and that's exactly the whole point of repet agent they didn't even have 01 and they already built this multi-step reasoning agentic type of workflow where they used llms to design entire architectures and applications rather than just one piece of code Cod and now it will be working even better so more than that in a second but I think this top comment is spot on and this is an interesting thing we should keep an eye on but I also like this comment just intuitively seeking innovative solutions for inventing and system improvements I think all of these highlevel thought processes seems like the right direction and we'll get more Concrete in the next segment here with repet on what this can actually do but I'm going to leave you with one last tip considering the fact that most people have access to this through their chat GPT Plus subscription and considering the fact that you only have 30 messages on the preview model and 50 messages on all1 Mini model these get used up really fast especially as you start trying this new model and seeing how it performs on some of your common tasks oh no I just used up one message my bad I just want to highlight the fact that you don't have to use this right away you can and probably should start most of your conversations inside GPT 40 and then if you want to you can always switch to 01 preview and regenerate the answer by using this would be especially relevant with follow-up questions where you're not satisfied with the answers when you read the 40 sponsor like hey this is wrong I wish JP put more F into this well that's where you would go in and regenerate with 01 preview or even 01 mini this is also an interesting fact many people on Twitter are already reporting that 01 mini actually works better for them so don't dismiss it just because of the name it seems to be performing even better than the preview model in some use cases again I'll be pulling in all the use cases and presenting them to you in a separate video it's just a bit too early for that right now because this just is different from any other AI model we've received so far okay and linking to gbt 40 I want
4:29

Replit Agent

to talk about replit agent this is something that came out late last week so it came out right after our recording cuto off and one could argue that by now replit agent is not as relevant anymore because 01 came out and it's better at multi-step reasoning and code architecture and generation but to anybody saying that I would point out this graph these are benchmarking results from cognition Labs the company behind de Devon if you remember Devon around 6 months ago came out with this incredible demo of a code generation agent where you just told it what kind of app you want to build and Deon figur out all the steps and compiled all the code and built you a full website by itself the demos were really impressive but we never got access to it and while we still don't have Devon cognition published this graph where they show off the capabilities of Devon their software agent with various AI models in the background so this is GPT 4 old this was open ai's best model I personally would love to see Claude 3. 5 sonnet in this spot because that was state-of-the-art in terms of code generation it wasn't GPT 40 but it's okay for this comparison I suppose it works then they switched out the model behind Devin from GPT 40 to 01 mini and instantly jumped by almost 50% in the performance this is their internal evaluation so take it with a grain of salt but then when they switch to the main model the performance doubled 26 to 52% essentially additionally it should be noted that all of these free benchmarks have been measured on a base model of Devon not the production ready model as outlined here further fine tunes their model with their proprietary data resulting in a further jump in performance but the point here is this Devon performed twice as well with 01 versus GPT 40 and repet agent is essentially accessible version of Devon that works today and while as of today Friday September 13th 01 is not accessible for repet agents the CEO has already been tweeting about 01 and my guess is that it's just a question of days until they implemented and their app becomes so much better at what it already does okay that was a really long in to actually show you what it does but I thought this context was necessary to contextualize that all the use cases we're about to look at are going to become even better a few days from now so what can replit Agent actually build we went out into the internet and looked at various examples these are the ones that caught my interest so most of these things that repet agent build you will see are more like internal tools rather than production ready applications so here for example we have a color palette extractor built by Heather Cooper on X and in minutes she builts a tool where you upload an image and gives you the full color palette and you can add extra tools so this would be a good example of an internal tool if you need something like this no need to use external website you can quickly start building a library of your own tools that do exactly what you need just like this 2hour live stream from Karan here which shows him building an app that maps all the glutenfree restaurants onto a map look this is the published thing you can actually go in and use it there you go where are all the gluten-free restaurants in Manhattan here they are I mean let's be real this is not going to change the world but this also shouldn't take an hour or two to build right and on that same note repet actually has a phone app where you can build things on the go and Mech Wrigley here by the way one of the great follows on X when it comes to gentic workflows in relation to code generation built a stripe coupon generator on his phone in under 5 minutes and look he just types in his email gets a custom coupon code this got me looking at it thinking like wow I should build my own for our own products that all run on stripe I think this one is actually super impressive because it's done on the phone and it's fully functional linked to a stripe account already and then the last use case I want to highlight here is building a character chatbot website if you have interesting prompts inside of cat GPT you could easily set up a website like this where you could access either different characters or your different prompts in a web interface like this again all of these projects only take a few hours Max and this is legitimately just the beginning because with 40 in the background clearly all of this will perform so much better and therefore all of these little niche applications slash internal tools are going to expand in terms of complexity and what you can do with these apps so I'm super excited for the future of this category with a combination of agentic workflows and models with o1 that manage to actually scale reasoning just for using more compute power that this worked is really unexpected and the future is very bright for this category of applications so one
8:49

Brilliant

of my personal favorite Tools in AI right now is open eyes code interpreter or enic artifacts I use and recommend it regularly but what I find when I do that is that certain people get a lot of value from it and others none at all and that's because it's one of these tools where you need to know where you're taking your questioning before you engage it in this case you need a little bit of a background in data analysis or at least a basic understanding of why that job even exists so yeah this channel is really good at showing you what tools and techniques exist out there but at the end of the day a lot of these use cases are going to be hidden behind some base knowledge of a specific subject and one of the best resources to up your understanding of a specific subject is the sponsor of today's video brilliant. org especially in the stem subjects brilliant offers hundreds of lessons and all of them go Hands-On with the concepts helping you learn in an interactive way so if you want to get better at using a tool like code interpreter you could and should check out their data analysis learning path it starts with the exploring data visually course which shows you what can be done with traditional data analysis tools and then it goes on to explore various Concepts and techniques within data analysis and even if you don't apply any of them you gain a little bit of understanding of what the code interpreter could be doing for you so the next time you're inside of cat GPT or Claude you know what to ask for so if you want to check out this learning path or one of many other courses head on over to brilliant. org Advantage you'll get a free month fund if you decide to upgrade 20% of the annual membership all right and now back to more AI news that you can use okay so next up there's two
10:16

Illuminate & NotebookLM

apps that come out of Google that you should probably be aware of because I think these are going to be implemented into your devices and computers eventually but right now they're available as Standalone apps I'm talking about Google's illuminate and notebook LM as you can see both of these are in the experimental phase and I'm going to start by talking about illuminate what does it do you give it a paper and it turns it into a podcast and it's the best tool at doing that I've seen so far this is finally coming out of weight list dozens of members in our community are reporting that they just got access to this I personally am still on the wait list but even Google came out with an announcement that they're releasing illuminate so you can generate your very own podcasts based on academic papers now I'll note that they must have some black magic under the hood going on here because generally speaking llms are not the best at summarizing long and Technical papers they just leave out certain details that might be crucial but illuminate seems to get it right and if you're off the wait list you can finally generate your own papers here's a little sample what that would sound like let's unpack a paper titled attention is all you need what's the core idea here well the big idea in this paper is that we can build a really effective sequence transduction model and yeah my guess is by the time this video is up everybody should have access to this they're rolling this out super quickly just like they did with notebook LM this was also first in weight list and now everybody has access to this and I just quickly wanted to resurface this and highlight it because they did add a new feature here and basically it's what illuminate does to papers but to all your sources notebook LM if you're not familiar with this app this is basically a research environment sort of so you add various sources here on the left side you can add links paste text add documents up to 50 sources and then you can talk to all that and have sort of a chat GPT clone that is based on all of the knowledge that you add in the sources there's a few extra features here but essentially this is a very powerful way to research documents and I also want to highlight one interesting fact is that within the a Advantage Community notebook LM has been actually the one tool that has been most discussed out of all of our guides first of all because the guide is really good shout out to Ruka for doing an amazing job in researching and putting that together but secondly because people seem to find this really useful especially when it comes to wrapping your head around brand new topics you can just go in here at various links you find at various Tech Snippets articles you find and then you can use an llm on top of all of that it's a really good way to interact with documents and now you can turn anything that you upload from PDFs slides or charts into a little audio summary all with the power of AI and this product is obviously powered by Google's Gemini 1. 5 Pro which currently is the king in terms of context 2 million tokens in total and it can even take video and that's why something like notebook LM works really well because you can add all different types of sources that it will then use okay to show this off an action there's this introduction book right here and I could just start typing and asking questions like I would with a chatbot or I can actually go to this notebook guide button and here you can generate an audio overview and these are really the standout features that I was talking about because look you have suggested follow-up questions you have a full summary and if you want to listen to a podcast likee conversation referring to everything inside of your notebook well you can do that right here that's so much easier to listen to than just a monotonous voice reading out the summary I really like this feature and again if you haven't tried notebook Alum yet give it a shot you might be surprised all
13:29

Ask Photos

right next up I want to talk about a smartphone feature that is either in your phone already or coming in the next week so this is definitely AI news that you can use and it's something that I personally have been hoping for since a while it's the ability to search through every single photo or video you have on your device by its content not just by the name metadata like the location now if you look for shanie dancing in a red dress then it finds those pictures and the same thing goes for Action within a video now this little clip comes from within the latest Apple presentation they're shipping this to their brand new phones but also Google photos has announced this and is rolling this out already essentially you can look inside of every picture and video on your phone and search it that way I can tell you how many times I've been in a situation where I was looking for a specific video clip but it was from four years ago and there's hundreds of videos on my phone especially when you're talking to people it's annoying to have to go through all of that it's just want to find the moment that the conversation might be happening around and this solves it so this is just the beginning it might take a few more weeks before this ships to you depending on your Hardware but I suppose with Google photos everybody could get this soon all I have to do is sign up for the wait list which is over here and then you can have access to this feature independent of your device the privacy concerns here ah sticky one right big tech company access to all your photos and videos I'll leave that up to you I have to be fair and say at this point that I think that apple is the only player in this game that so far made at least the promise of something that actually solves privacy their private Cloud compute that is independent verifiable and built on custom Hardware to make the entire process safe is the only thing I've seen in this industry so far that promises what we all really want and that is privacy with our data I think it's just common sense not to want to give all of your pictures and videos to Big tech company just like that but then as per usual I'm not here to judge I'm just here to give you my takes and what's new in this space let's be honest privacy concerns asides this by itself is a pretty magical feature that I hope to see integrated everywhere in a safe manner soon okay next up this is a super
15:29

Riddle Me This

quick one but I just like this little interface this little website you just go to Riddle Me this. XYZ you type in a username and then you have to guess what the prompt for this is we've seen this before out of Google's Test Kitchen but this one updates live and you basically have to guess the prompt Behind These AI generated images but I like the fact that this is live generated what the problem for this one is just hot dog no real use for this except this might be a fun little game to play with friends me and the team just had some fun testing this while researching all of these tools so I wanted to share it let's move on to the next one all right next
15:59

Anthropic Workspaces

up we have an announcement out of anthropic and they keep shipping new and highly requested features this one is on the def side but I really like this because it allows you to create different workspaces for different projects the feature is very similar to the open AI project feature that came out recently you can create new projects name them and then allows you to find tune models create assistants and do different types of testing and work inside of this project now we have the same thing for anthropic workspaces simply go ahead add a workspace pick a color and there you go we have a nice little test workspace and then they have a standalone set of API keys so for example if I'm going to be doing tutorials in the future I'll have a separate test workspace for those API keys to make sure I can track the API usage separate from my automations or projects that I'm experimenting with regularly so essentially this is just a welcome way to organize your API Keys which is very welcome cuz I've been using the anthropic API more and more recently we've been cooking up a few interesting workflows that I'll be sharing with the full AI Advantage email list relatively soon okay and here I
16:57

Minimax Stop Motion

want to spend a minute on AI video generators and what changed first of all an interesting use case came out for Minimax that we featured last week the free AI video generator out of China and it's stop motion they seem to have trained this on a lot of stop motion animation which has people creating these little stop motion sets it's quite interesting and there's many examples of this across Reddit but it seems like it performs better at these little stop motion animations than any other model out there so if you wanted to create a clip of Superman saving a cat from a tree well Minx is your go-to model here as you can see there's more and more examples of this but yeah I would say this actually works but look these are the limited capabilities of AI video today I think it's way more interesting in the AI video category to actually look at the things that are coming and that's the point I'll end on today
17:45

AI Video Editing

because big things are coming and by big things I mean actual use cases that integrate into your workflow stuff that we would love to use in our video production workflow and I can't even imagine how documentary filmmakers event videographers or even commercial or narrative filmmakers will be using features like this coming up from Adobe because this is what the future of video looks like because you're going to be editing videos and if you're missing a clip well there's going to be things like generative extent where these AI video generators will be used to extend the clip that you already have color correction visual effects extra b-roll all of these are things that actually make sense in a video production workflow and whereas now we're in a super experimental phase of AI video I think this is really going to be the Tipping Point where it's just is going to become a staple for every filmmaker to be aware of these tools and know how to use them but to be fair that is sort of the story of AI right now all of these tools are being developed right in front of our eyes and this show keeps track of how these developments go and if something shows up that you could be actually putting to work I'll feature it on the show that is uploaded every single Friday that's all I got for today there's a lot to try out this week don't forget to have some fun while exploring it all and I'll see you next week

Ещё от The AI Advantage

Ctrl+V

Экстракт Знаний в Telegram

Транскрипты, идеи, методички — всё самое полезное из лучших YouTube-каналов.

Подписаться