OpenAI FINALLY Added Video to ChatGPT & More AI Use Cases
19:55

OpenAI FINALLY Added Video to ChatGPT & More AI Use Cases

The AI Advantage 13.12.2024 34 309 просмотров 978 лайков обн. 18.02.2026
Поделиться Telegram VK Бот
Транскрипт Скачать .md
Анализ с AI
Описание видео
Start using the Lyzr AI studio to build and deploy your own custom AI agents today! 👉 https://hubs.ly/Q02_6Tsc0 The 12 Days of OpenAI are in full swing! There were a ton of releases in the first few days, including Sora, Live Video in Advanced Voice Mode, updates to Canvas. This week we also got AI updates from Google, Midjourney, Grok and more. We'll break it all down for you in today's video. Links: https://help.openai.com/en/articles/10271060-12-days-of-openai-release-updates https://blog.google/technology/google-deepmind/google-gemini-ai-update-december-2024/ https://www.midjourney.com/updates https://devin.ai/ https://x.com/i/grok https://countless.dev/ https://huggingface.co/spaces/JeffreyXiang/TRELLIS https://www.info.writewithpearl.com/ Chapters: 00:00 What’s New! 00:37 12 Days Of OpenAI 01:00 Sora Release 02:10 ChatGPT Canvas Releases 03:53 ChatGPT x Apple Inteligence 04:18 Advanced Voice Mode Video Option 07:04 Lyzr Agent Studio 09:00 Gemini 2.0 10:01 Midjourney Canvas 11:18 Devin Finally Launching! 13:58 Grok Updates 16:04 Countless.dev 16:50 TRELLIS Image to 3D 17:56 Pearl AI Journal #ai #openai #chatgpt This video is sponsored by Lyzr AI. Free AI Resources: 🔑 Get My Free ChatGPT Templates: https://myaiadvantage.com/newsletter 🌟 Receive Tailored AI Prompts + Workflows: https://v82nacfupwr.typeform.com/to/cINgYlm0 👑 Explore Curated AI Tool Rankings: https://community.myaiadvantage.com/c/ai-app-ranking/ 🐦 Twitter: https://twitter.com/TheAIAdvantage 📸 Instagram: https://www.instagram.com/ai.advantage/ Premium Options: 🎓 Join the AI Advantage Courses + Community: https://myaiadvantage.com/community 🛒 Discover Work Focused Presets in the Shop: https://shop.myaiadvantage.com/

Оглавление (14 сегментов)

  1. 0:00 What’s New! 150 сл.
  2. 0:37 12 Days Of OpenAI 72 сл.
  3. 1:00 Sora Release 284 сл.
  4. 2:10 ChatGPT Canvas Releases 416 сл.
  5. 3:53 ChatGPT x Apple Inteligence 113 сл.
  6. 4:18 Advanced Voice Mode Video Option 665 сл.
  7. 7:04 Lyzr Agent Studio 404 сл.
  8. 9:00 Gemini 2.0 252 сл.
  9. 10:01 Midjourney Canvas 262 сл.
  10. 11:18 Devin Finally Launching! 623 сл.
  11. 13:58 Grok Updates 488 сл.
  12. 16:04 Countless.dev 167 сл.
  13. 16:50 TRELLIS Image to 3D 264 сл.
  14. 17:56 Pearl AI Journal 507 сл.
0:00

What’s New!

wow what a week for AI we had more innovation in this week than probably in the past two months combined at least when it comes to the biggest players releasing new tools on the one hand we have open AI continuing their 12 days of open AI Christmas with some new announcements that we'll be reviewing here but then on the other hand Google made their massive swing with Google Gemini 2. 0 I can see your desktop and your camera also we had the Sora release and Devon AI finally dropped at $500 a month all this and so much more in this week's episode of AI news you can use the show that reviews all the Practical AI tools that have been released this week filtered for the ones that you can actually put to work today let's get into it starting with 12 days of opening
0:37

12 Days Of OpenAI

eyes so I'm recording this on thday night so we had four days of releases so far on Monday we had the Sora release on Tuesday we had chat GPT canvas rolling out to free users and upgrades being made to it on Wednesday we had the Apple intelligence integration which was a little underwhelming and first day we had them ship the camera and desktop recording access of the advanced voice
1:00

Sora Release

mode big so let's talk through these chronologically starting with Monday's announcement the Sora release let me tell you the reception on this release was not good at all a lot of people are mad at the price tag because as I pointed out in a dedicated video on the S release so if you want to learn more and check out the video we'll link it on screen right now but the main issue there is as I pointed out in my release video the $20 plan is just sort of a preview you 50 Generations per month with watermarks you don't get to make full HD files and not just the limitation but also the capabilities limited because you cannot do image to video of humans meaning that if you really want to use Sora it costs $200 which is absolutely a crazy price tag and yeah it is a unlimited plan so if you use unlimited Generations it can be a great price if you're a company that puts this to work but as I've said many times most people don't need AI video yet plus they were facing a lot of issues with signups like heck I still didn't get to create an account and I'm paying for the pro subscription $200 a month and I wasn't even able to use Sora yet bit of a mess there I still stand behind kind opinion that they nailed the platform and the video generated while maybe not being state-ofthe-art is really good okay again if you want more details on this check out the separate video that I made just about the Sora release moving on to Tuesday we had chat
2:10

ChatGPT Canvas Releases

GPD canvas releases and not just that they also overhaul the interface a little bit so if you heading over to my computer you can see that there's a new button here where you can use various tools and these really have been piling up not everybody was aware of all the things hiding in here I mean there's a Code interpreter behind all of this that people don't even know about I guess that is down here the point is that they now upgraded the canvas and not just that they gave free access to all users even without a paid account which I think is fantastic CU canvas is one of my favorite features in here but they also added the ability for chat GPT to run python code meaning if it writes a little script or if it uses some package it can actually run the code and if there's bugs they pop up in the terminal and you can debug right from this web interface no need to copy paste this into an IDE that would need to run the code for anybody working with python this is absolutely amazing and while it doesn't match something like r artifact instead of claudet that can also run HTML and therefore build complete websites like dashboards or portfolio websites and many more this is definitely a step in the right direction if you haven't tried canvas yet well now you can do it on the free plan too and I would say this especially if you're a student the ability for chat to go in and to comment on things like so and then you can just apply these edits to the comment by the way this also works if you're doing it for creative writing like essays or research papers is one of the most powerful ways to work with this because if I write an essay about penguin and canvas here and I want to edit something about it while I don't need to copy paste in something like word and then bring it back into here no I can make my edits right in here and maybe even more importantly I can use the AI assistant I have here with chat GPT surgically by selecting certain parts and prompting on top of it rather than the entire text which is more akin to hitting the entire text over the head with a mace or something like that so those are canvas
3:53

ChatGPT x Apple Inteligence

updates next day we got these Apple intelligence launch and this was more of an announcement that hey this is available in the iPhone now and if you're in the US you can make your Siri called chat GPT I saw one comment on this video which essentially said that this announcement could have been an email and I kind of agree the ditch of how this works and how you can upload files through Siri now but yeah I would just say if you're not a Siri user already just use the chat GPT desktop app for this and we had this for months so not the biggest of news and then
4:18

Advanced Voice Mode Video Option

today on firstday we've got something that has been long overdue the ability for advanced voice mode on the phone to actually see through the camera let's check out a clip here and then once it connects I'll share my video hey chat how's it going I'm doing great thanks for asking I see you're wearing a Santa hat I am and do you see what I have in front of me yes I see a coffee set up with a kettle and a dripper are you planning to make some coffee I'd love to do you think you could walk me through the steps sure I'd love to first place a filter in the dripper and rinse it with hot water to eliminate any papery taste and I think this is pretty amazing especially in the light of the fact that yesterday Google announced Gemini 2. 0 with similar capabilities and they showed off a preview of their project Astra which is essentially this it's a voice assistant that can use your camera so while Google did release their new model that can see your screen and use your camera they have not released this full application this assistant that can use your camera and open AI took less than 24 hours to actually update their advanced voice mode where the voice assistant by the way is just way better I mean in the Google video I mentioned that maybe the interruption on Google's product works a bit better but every everything else the quality of the voices all the styles that it can emulate it's just better about advanced voice mode and if you're on a plus teams or Pro Plan in the US this should be rolling out to you as soon as possible as this just came out a few hours ago this has not rolled out to me or anyone on the team yet but trust me we will be following up with coverage of this either in next week's news you can use or a separate video cuz this is a really amazing feature that everybody has been waiting for it's a true assistant that can actually see and it works for your phone which just gets my mind spinning this obviously opens up a whole new bag of use cases that are possible here very cool and then in addition I guess I should mention they also have a Santa mode which all chat users have now that immediately rolled out to everybody bring a little bit of the Christmas spirit into the chbt app and that's it from open a for now we'll be following this as there is six more days of 12 days of openi most of the things that were expected have been released by now so I wonder what next week holds I guess you'll just have to tune into next week's episode of AI news you can use and I quickly just want to point out that today we're actually celebrating the oneyear anniversary of AI newsic can use I started this show live last December 9th and have not missed a single Friday and at this point I just want to say a big thank you to the entire team that works on these videos Robert Lucas for editing this and Daniel and ariadna helping with the research and preparation for all of these every single one of these episodes takes somewhere from 30 to 40 hours of Labor if you add up all the different people working on it and we've been doing it for a year straight every single Friday we didn't miss a single one we didn't upload on Saturday a single time I think that's quite something and thank you to all the viewers of this for tuning in regularly this wouldn't be possible without you and with that being said let's move on to the next piece of news that you can use one really exciting
7:04

Lyzr Agent Studio

thing we're seeing in AI happen right now is this rise of AI agents and people throw this term around a lot so just quickly want to clarify what a AI agent really is according to me it's a program or a system built with an llm in the background that carries out actions on its own it's really not more complicated than that and obviously if executed well agent like this can be very useful as it can take over your work and this is where I want to introduce you to lizard agents to Studio created by the team at lizard AI the sponsor of today's video this platform takes AI agents to a whole new level by making it easy for anyone yes even non-technical users to build and deploy custom AI agents in just minutes lizard agent studio is an all-in-one workspace for building AI solutions that are custom tailored to your exact needs so for example you could automate customer support optimize existing workflows or even build tools for HR and finance departments within your company and one interesting thing is that these agents don't just solve a task they also evolve over time as they get used and this is really what sets lizard apart from a lot of the competition these agents evolve as they are used and they learn all have focus on safe and responsible AI lizard agent studio is built on the lizard agent framework the only agent framework that natively incorporates safe and responsible AI into the core agent architecture it even includes modules where you can toggle these features on or off based on your needs so if you're just getting started you can use these pre-built agents for various use cases like social media automation knowledge search sales Outreach newsletters and competitor analysis or you can even build your own custom Agents from scratch all without needing to write any code at all whether you're a developer looking for full stack control or a decision maker trying to scale Enterprise automation lizard gives you the flexibility to achieve your goals securely and effectively so go check out studio. lizard. from the link in the description below and start building your very first agent for free today a big thank you to lizard for sponsoring this video and let's get back to the next piece of AI men that you can use
9:00

Gemini 2.0

okay next up we have a bunch of announcements coming out of Google and actually they had so many announcements and most of them you can't even use today with one exception which is the new model that can actually see your camera see your screen and you can interact with it anyway there was so much there that I had to create a dedicated video just on the topic so I'll put a link up on screen right now and there's also a link in the description but the gist of it is what I pointed out here there's a brand new model I can see your screen us your camera it interacts in real time it can input audio and video in real time and output audio in response onor that extremely low latency just like advanced voice mode but hey this thing sees your desktop so it can assist you and then Google built a bunch of these experiences on top of that which they gave different names like project Astra and project Mariner that either uses your phone as an assistant or takes control of your browser to get things done all based on this new Gemini 2. 0 flash model again if you want all the details check out that video it's actually quite interesting and this experience of sharing your desktop and working with the assistant is available today and we try it out in that video okay on to the next story which is mid
10:01

Midjourney Canvas

Journey canvas and this just got announced and it's just a new thing that I quickly wanted to show you here and it's basically this new way of working with M Journey the focus of this is entirely on world creation so characters locations atmosphere they call this Patchwork and it uses the power of the mid Journey generator and the power of consistent styles with the style reference feature for you to generate a plethora of assets all within one board so here right here we have Amara vihan a resilient and resourceful historian obsessed with unraveling the mysteries of simul Kum Hollow and then it just generates a bunch of images of her to represent that so for example if you're writing a book and you need visuals or you're creating a comic strip this is a fantastic way to tell a story to build a world with multiple characters and it's all coherent right you set your one style reference and everything happens in that style same thing for locations look a dilapidated Opera I can't even read that let's move on in the video right here Z finds the weird abandoned theater in the woods and now it combines the character with the location you created previously all bringing it together making it easier to tell a story I find this quite interesting as AI video is moving forward and we're getting features like storyboard inside of Sora mid journey is also moving forward and you can try this out today if you have a ma Journey subscription
11:18

Devin Finally Launching!

that is okay next up we have Devon finally launching if you remember this story this went Giga viral on its first announcement and today it's finally available I think the big headline that everyone is talking about is hey it costs $500 a month which is crazy to most people it's probably the most expensive AI tool that we have seen yet on a subscription basis I mean heck one week ago CET gbt Pro with $200 now Deon with 500 with these subscriptions they're at pricing most of the world so that presents a question is this even worth it cuz with chat Pro that's a whole separate discussion that we could hold but for most people it's just not worth it 01 Pro doesn't deliver much value to virtually everybody that is not a coder and Devon pro has so much competition in the $20 range that why should you use this well let's talk about it and I have to say I'm saying this from the perspective of somebody who did not purchase access to this and hasn't played with this yet just because I personally don't really see the value and I'm also not clear yet on how I would even test this in a context that makes sense for this channel because what Devon essentially does is it connects to your slack and it acts as a junior developer that you hired and you can text it inside of slack and let it do various things it will figure out how to do them by itself and let you know that it saw the various messages but at the end of the day you're relying upon an AI agent to do things for you and the state of AI agents today is such that you cannot fully rely on them even if you achieve 80 90% reliability imagine an employee that completes his task eight out of 10 times and two times comes back with something that is completely disjointed from reality that's not an employee you're going to keep around and that's why in the development space idees like curser have seen such massive popularity because they give you the ability to enhance your existing skills they don't promise something that's currently impossible in my opinion which is doing all the tasks from A to Z but Deon has exactly this promise and that's why it also comes with this high price tag which obviously if that would work a junior year developer for $500 would make sense for a lot of people but from everything I've seen on the internet and that's why also this launch seems to be a bit underwhelming and it's not getting so much hype so far I'm just reporting what I'm seeing here it's not perfect it's just built on the state-of-the-art llms which also are not perfect and no matter how well you orchestrate these agents in the background which Deon seems to be doing extremely well still going to be a little inconsistent and you're going to send the agent out to do a task and it's going to come back 15 minutes and meanwhile you're just hoping that it gets it done correctly so interesting release and I love this category and I really love the vision they're going after it just feels like the state of llms currently only supports a workflow that enhances human ability and doesn't fully replace as a junior developer on most tasks but again that's just me echoing the opinions of what I've seen on the internet and maybe reflecting on some of the experiences I had with some of the Devon competitors that are currently available all right next up
13:58

Grok Updates

let's talk Rock announcements I always like what they do if you're following the channel you might know that I'm kind of a fan of their philosophy which is hey keep the AI truthful and let grown-ups make their own decisions I like that but what they did here well I don't know maybe it's a step in the wrong direction they released their own image generator okay and before this you might know that they integrated the flux 1. 1 model which was state-of-the-art and that's why Gro was extremely good at generating images quite unhinged ones in some cases now they replaced this image generator with their own one which they called Aurora now unfortunately for gro we actually went ahead and tested this versus some of the other state-of-the-art image generators as we usually do and here's some of the results so this is what moury puts out from this logo prompt this is what flux 1. 1 Ultra puts out and this is what Aurora puts out clearly M journey and flux are just better then we have a different portrait photography prompt mid Journey flux Aurora over here I guess it's okay I still think flux is best here here's a cinematic still prompt again I think flux and Ma Journey win although it's closer than I would have imag imagined drone shots look at that it's not bad it's just not the best and then on text I was surprised look at that it actually did Super well Aurora kind of killed it on this one it might be even my favorite I mean okay this the artistic value of these maoury outputs here are insane but if you just want that image not album cover or movie poster this is perfectly fine and here's one more test prompt that we usually run of a superhero character on top of a moving train and yeah just moury Nails it here mour is so good at these this is arguably the worst result for Aurora but overall not so bad but definitely not state-of-the-art so instead of Gro you can access this now which goes along nicely with a second announcement which is the fact that grock is now widely available to everybody even to free user before this it was only for premium users now as mentioned before it is not a s tier model it's just inferior in certain ways like prompt adherence writing style quality code generation quality it's kind of just a bit worse across the board except of the fact that it can pull together data from Twitter and now everybody can access that and I'm sure they're going to keep iterating on this over time s go new image generator in grocket and it's freely available to all Twitter users up to 10 prompts every 2 hours and 10 image Generations every 2 hours okay next up
16:04

Countless.dev

this one will be super quick but it's a brand new website that I ran into it's called countless dode and it ranks all of the various models based on whatever you want actually check this out so if you want to see which model has the longest output LMS you could just rank it by that and voila you can see right here Jamba 1. 5 and Google vertex Ai and mistol cod stroll Mamba black mamba can actually put out a quarter million tokens impressive you could do the same thing for input length where Gemini 1. 5 Pro clearly wins with 2 million tokens of inputs look at that you also have a pricing calculator here and you could compare different models to each other just a free website that I thought was kind of cool and if you ever ask yourself which model does a specific thing best this might be a good bookmark and now let's move on to the next story
16:50

TRELLIS Image to 3D

trellis the most popular hugging face space of this week and we'll cover this quickly but essentially this is a brand new image to 3D generator that performs pretty well with seen many of these pop up across the past year or two but as somebody who doesn't work in 3D a whole lot I can only judge my subjective Impressions but this is not bad look at that honestly when I saw this I just thought to myself wow this is probably the highest quality I have seen yet you can input custom images but I just wanted to include this to show you that from what I can see these are getting better and these models live in between the two other options that people who build games or work with 3D have one of them is just modeling from scratch which takes forever compared to doing something like this where you can have a dragon in seconds or the other end is buying prefabs which can be pricey and if you want to customize you'll have to edit this anyway this kind of falls in the middle you get something custom immediately and then yeah you might need to clean it up improve it but with this image to 3D models getting this good it should be too long until we see this implemented in some consumer friendlier way I don't know leave a comment below if you know of a use case for image to 3D I'd be interested to see something like that in practice okay next up we
17:56

Pearl AI Journal

have this little tool this is completely free also not sponsored I would always mention that but I just thought this was sort of cool it's an AI journaling app where you can log in for free and just kind of give this a shot so I'm going to do exactly that right now and just journal for a second and see how this goes Daniel on the team that has been researching this told me that this really positively surprised him and that AI power journaling experience might be the little push that even he needed as he decided to start journaling recently but wasn't able to maintain the habit but as per usual with a new habit sometimes it's a bit tricky to get into the routine of it and here you have a AI to reflect on it so let's have a look let me describe how I feel today okay so there are some of my thoughts about this day this was a very long work day with all of these announcements Wednesdays are always busy AI news you can use always gets it done and see you can just say reflect and it will come back with a question here how does spending such a long day focused on AI and internal systems affect your overall well-being and work life balance well there is no balance it's all work as AI goes crazy uh okay let's reflect on that one too would you like another question how do you plan to recharge and take care of yourself after such a okay interesting so I guess now this is sort of acting like a therapists kind of but then also up here it analyzes what the mood of the day was and when you come back on the next day you can kind of keep track of how you were feeling how you were doing you can get suggestions you can have a little conversation in here oh look it even says anxiety I think it might actually be right I didn't even consider that I will go for the walk in the morning and read in the park for a bit reflect on that how do you feel your productivity and creativity are affected by such long intense work sessions let focus on AI and internal systems that's a good question so there you go so this is just a different way of journaling you can try this out you can come back a few days in a row see how this goals to cont track your moves like this I think it's sort of interesting to have this creative collaboration with the AI here I don't know it's completely free thought this was interesting wanted to feature it all right that's all we got this week don't forget to check out the Gemini and the Sora release videos that we uploaded this week and with that being said I'll see you next week and I hope you have a wonderful day

Ещё от The AI Advantage

Ctrl+V

Экстракт Знаний в Telegram

Транскрипты, идеи, методички — всё самое полезное из лучших YouTube-каналов.

Подписаться