This Free AI Is Smarter Than Most Humans
16:25

This Free AI Is Smarter Than Most Humans

The AI Advantage 03.01.2025 16 492 просмотров 604 лайков обн. 18.02.2026
Поделиться Telegram VK Бот
Транскрипт Скачать .md
Анализ с AI
Описание видео
Test drive your ideal AI PC including the new AMD Ryzen Pro today! 👉 https://ryzen.pro/en/laptop/?utm_medium=social&utm_source=youtube&utm_campaign=q3_comm_influencerryzencpu_36pi&utm_content=theaiadvantage We're back from the holiday break with some big news and releases in the AI world, including a new model from China that rivals OpenAI's o1, new interactive podcasts from NotebookLM and ElevenLabs, and more! Links: https://chat.deepseek.com/ https://blog.google/products/gemini/google-gemini-deep-research/ https://qwenlm.github.io/blog/qvq-72b-preview/ https://x.com/midjourney/status/1871630465532297489 https://x.com/MushtaqBilalPhD/status/1874131430491775124 https://elevenlabs.io/blog/genfm-podcasts-in-projects https://huggingface.co/spaces/hkchengrex/MMAudio Chapters: 00:00 What’s New? 00:38 3 New Reasoning Models 05:41 AMD Ryzen AI 07:44 Midjourney Sref Codes of 2024 08:29 NotebookLM Interactive Mode 09:10 ElevenLabs GenFM 13:21 MMAudio #ai This video is sponsored by AMD. Free AI Resources: 🔑 Get My Free ChatGPT Templates: https://myaiadvantage.com/newsletter 🌟 Receive Tailored AI Prompts + Workflows: https://v82nacfupwr.typeform.com/to/cINgYlm0 👑 Explore Curated AI Tool Rankings: https://community.myaiadvantage.com/c/ai-app-ranking/ 🐦 Twitter: https://twitter.com/TheAIAdvantage 📸 Instagram: https://www.instagram.com/ai.advantage/ Premium Options: 🎓 Join the AI Advantage Courses + Community: https://myaiadvantage.com/community 🛒 Discover Work Focused Presets in the Shop: https://shop.myaiadvantage.com/

Оглавление (7 сегментов)

  1. 0:00 What’s New? 148 сл.
  2. 0:38 3 New Reasoning Models 1202 сл.
  3. 5:41 AMD Ryzen AI 447 сл.
  4. 7:44 Midjourney Sref Codes of 2024 186 сл.
  5. 8:29 NotebookLM Interactive Mode 162 сл.
  6. 9:10 ElevenLabs GenFM 1010 сл.
  7. 13:21 MMAudio 664 сл.
0:00

What’s New?

the first episode of AI news you can use in 2025 we had a little Christmas break not many releases wanted to give the team a little break too but now we're back at it covering everything that came out over the course of the past two weeks and we'll be starting with a look at the various reasoning models that came out I mean heck now we have Chinese competition 2011 and also Google released a reasoning model but then later in the video we'll be exploring a way to create custom podcasts with your very own voice in a super intuitive way this should be a fun episode of AI news you can use the show that looks at all the AI releases of the past week and we filter it for just the stuff that you can use right now let's get into it all
0:38

3 New Reasoning Models

right so to kick things off I want to talk about these various reasoning models and some new open- source competitors that came out over the course of the past two weeks and there's a few releases in this category but I'm going to compress this into one segment namely we're going to talk about deep seek V3 the new open source King in terms of llms leading all the leaderboards and even beating models like sonnet and GPT 40 while being fully open source then we have Google Gemini's deep research this is their take on 01 and similarly we have a new Alibaba model called qvq This is the Chinese counterpart 2011 with the slight twist that they focus on reasoning over visual inputs not just text inputs and the reason I want to talk about these together is because they obviously are all trying to catch up with what opening I released with o1 and now the O free preview that we got in December as you might know all cat G Plus subscribers now have access to 01 and on the new Pro tier you get 01 Pro but the problem with demoing these things testing these things is that yeah they're really good at reasoning over specialized problems and most people don't have complex coding questions or specific scientific inquiries as a part of their routine and therefore these models are not as useful to them as they might initially seem and now that we're getting a plethora of models I had an idea of how we could tackle this as a community and how we could solve this question of what could I use this for what are other people using this for and that idea is running another public challenge over the course of January so the stream for that is already scheduled I'm going to be hosting that this coming Monday but you can already check out the challenge and I'll be doing these once a month in other words I will be asking people to submit various use cases for these models like 01 qvq 72b or Gemini deep research and then the most useful ones will get prices and the entire Community gets to see what people are actually using this stuff for I think overall that's just a great idea and look I wish I could go in here and give you the specific examples of how to use this or if qvq is better than 01 or how Gemini deep research relates to the 01 product or 01 Pro but at the end of the day a lot of the outputs are just very similar in a lot of the tasks that I'm using this day-to-day too and it seems kind of meaningless to just throw random logic problems ated here in the video that's good for benchmarking but what I care about here on this channel is how to use the technology in your life with that being said there are a few details about these releases that I'll quickly summarize to you namely that this Chinese model qvq 72b preview is a reasoning model that focuses on the vision aspect so this is something open ey hasn't done yet now as you might know this is something that's actually included in the 01 product as you can give it images and it can interact with them but the difference is that this model is actually freely usable you just have to attribute that it has been built with Quinn but you get the full model including the weights and they give you the license to use this for whatever projects you want to build with it and then we have Gemini's deep research that we tested here for you again the conclusion is that the differences between these models they're just very use case specific and we'll have to crowdsource this within the community as I said with the challenge to really learn what people are using this for I can tell you on basic coding tasks that I've been doing with 01 and 01 Pro and this is purely based on my intuition and usage of these tools all of these competitors don't even come close to 01 especially 01 Pro when I have coding problems I throw it into 01 Pro and it just figures it out it works like a charm when I tested some of those on Google's deep research or ran it through son 3. 5 or deep seek V3 the consistency of the results just isn't as high is 01 Pro worth the money no probably not for most people absolutely not but is it a better model than anything that we've seen from the competition here I'll go out on a limb and say yeah it does feel like that but even if we accept that as a fact honestly one completely suffices for pretty much any task you will be throwing at this 01 probably sort of a luxury product so in summary 01 is fantastic and I would recommend you start using it and testing it with some prompts where you think that you might not even need it o1 pro is simply put a luxury good that is probably unnecessary and Gemini deep research and qvq are just slightly weaker versions of this again I'm not speaking from benchmarks I'm just speaking B based on what I saw how I use these tools and what I've seen expressed within our community and that leaves us with the last Model here that is brand new and that is deep seek V3 now we have a fully open source model that performs better than GPT 40 better than sounded 3. 5 on some benchmarks with the big difference that this is not just fully open source but they also provide you with this website chat. deep seek. com where you can freely use this so you get four all like quality for free plus you could use this model for anything you want and you also have this option here to turn on the reasoning case capabilities turning this into sort of a watered down version of 01 now on this one I can very confidently say that it is actually way worse than 01 this doesn't even come close Gemini deep research sort of does I would say but yeah those are all paid products this is completely free you can just go to this website and use the reasoning model use the base language model that just works at this point all these models just do great but you'll probably know that if you're consuming this video right now yeah and there you go that's all the news on the latest model releases that happened over Christmas and the last week of 2024 again check out the upcoming stream in the challenge will be running to see what this community that used these videos is actually using these models for CU they are very useful but always for very specific situations and I can't wait to hear from maybe even you on how you have used these to
5:41

AMD Ryzen AI

enhance your life lately I've seen a lot of interest in running AI locally whether it's for privacy security or just knowing that you always have the AI available even if something is wrong with open AI servers like during the recent Sora launch or the outages in December but to take advantage of the latest local llms you do need a proper setup and if you're looking to upgrade create your rig or are in the market for a new laptop let me introduce you to the ryzen pro from AMD the sponsor of today's video now the great thing here is that the AMD ryzen Pro checks of all the boxes when it comes to AI Readiness it is optimized to run AI locally it has Enterprise grade security top-of-the-line graphics and provides all day battery life predictably my favorite thing about this is the built-in AI Readiness they're truly designed from the ground up with AI as the end goal here they've got a combination of a dedicated npu GPU and CPU that lets you run AI powered apps directly on your PC without needing to offload stuff to other devices or the cloud but even apart from the AI aspect the ryzen pro processor is fantastic at anything else you might want from a processor like security or battery life there's a built-in AMD secure processor that verifies code before it runs it helping ensure that your data stays safe and that just comes along with many other security features like the AMD memory Guard ryzen Pro processors can give you up to 29 hours of battery life in a laptop that is a lot of time you can work or play all day and then some plus they also make life easier for it teams that are all on the same processor with features designed to minimize disruption and reduce support costs so whether you want to use AI applications for yourself or as a team the ryzen pro helps you keep things running smoothly and here's the best part check this out they're actually letting you try out this processor inside of a laptop for free yeah you heard that right they will lend you a laptop for free so you can see if this is something for you no strings attached here so if you're ready to rethink your workplace with the AMD ryzen Pro hit up the free loaner offer from the link in the video description below a big thank you to MD for sponsoring today's video and now let's move on to the next piece of AI news that you can use okay this is just a
7:44

Midjourney Sref Codes of 2024

super quick feature but mid Journey shared some of these srf codes if you're not familiar you can think of these as a very specific style that is captured inside of this preset that you can access by using one of these srf codes in the end these are literally the six most used codes of the year so these are the styles that people have been using the most inside of M Journey if you like any single one of these six you can simply include this at the end of your mid Journey prompts that is behind a paid wall but I thought this was a wonderful thing for them to share with the public because these srf codes are truly powerful we have a set that we use for the AI Advantage branding and if you ever want to create Graphics that look consistent this is a great way to do it and here you can simply use one of these codes and now you're at least aware of the fact that these are the popular codes that people have been using on
8:29

NotebookLM Interactive Mode

year okay onto the next story which is one that I'm quite passionate about myself cuz I really like notebook alignment they're extending it with a new feature we're not going to be focusing on that cuz I don't think it's that big of a deal but it's basically this interactive mode they adding so you can participate in the podcast that notebook LM will generate for you if you're not familiar it's app where you add a bunch of sources and then it summarizes them in a quite Advanced Manner and turns that into a humanik sounding podcast now you can join a conversation you turn on the mic and you talk to the two podcasts hosts so you can think of it as continuing a chat CHD conversation you're just doing it with a voice interface so quickly wanted to point that out but it perfectly transitions into the main story that I wanted to highlight here which is 11
9:10

ElevenLabs GenFM

Labs shipping like crazy and I don't know at this point I would like to make a little disclaimer that hey 11 Labs never sponsored this channel whatsoever but they keep shipping so many features and the quality of these products is so high that I just love featuring them and that's what we do in this show just to reiterate something that I've often said and is very important to me every single thing that we've done at the advantage that has ever been sponsored we clearly communicated that's a thing that is extremely important to me and you can hold me accountable to that with that being said here's the new projects feature which is essentially notebook Alm but with way more customizations so you can create these podcasts but you get control over the dialogue you get control of the voices you can use your own custom voices let's have a brief look at this together here basically if I log into my 11 Labs account and I go into the application this thing has been extending I remember a year ago there was basically just a voice creation now you can do things like this and this is the feature I'm referring to they call it Gen FM it comes under this orange icon that says create a podcast and I'm just going to take a random Wikipedia article here the article on new years's and I'm going to put this in as the input URL and in my headphones I should have hear these voices a single rose can be my garden and then Jessica if you spend your whole life waiting for the storm you'll never so how about we change this up we keep Chris but Jessica we're going to switch out for my very own custom voice which sounds like this as we are liberated from our own fear our presence automatically liberates others okay and one little Pro tip when using some of these custom voices we played around with all of these is you want to actually go to the turbo model and if you don't need multilingual just go with turbo V2 that just has the highest quality and makes way more of a difference than some of these quality differences that are hidden behind the more expensive plans standard we'll do it for now and I'll just simply say generate and it will look at this article and come up with a script and do the voices by itself now here's the kicker once it's done I will be able to actually edit the text which is something that you can't do in Notebook Al they have this customized feature where you can prompt and change the whole thing but you don't get a text editor interface where you really get to dial in what the podcast hosts say and of course you don't get the ability to use a custom voice like my own voice that I trained into this and there you go took about 3 minutes let's so listen today we're exploring how cultures Around the World Mark the passage of time in ways you might never have imagined well that's certainly an intriguing way to kick off our discussion I'm curious to hear more about these diverse tradition and in this case you might want to play with the stability here this is something that notebook Alum does really well it just legitimately sounds human because you get the interation they do the little pauses you can get a little bit of that with these sliders well that's certainly an intriguing way to kick off our discussion I'm curious to hear more about these diverse Traditions as you can see the pacing there sounds more natural than before and that's basically it you can go ahead and you can edit this transcript and then once you're happy with this you could go ahead and convert this into a project in this case I'm just going to show shorten it so it happens faster say convert project and then it's automatically going to render our audio file for us a so that was pretty fast let's have a quick listen here and in some areas Easter was considered the start of the new year you can imagine how confusing that must have been H yeah I can see how that would lead to some serious calendar chaos so when did other parts of the world start celebrating on January 1st pretty good and that's my voice right there so definitely something you can't do inside of notebook Alm and now you know how to do it too do it now I wanted to highlight this cuz this is an interesting use of multiple tools that we already have but they just combined them in a unique way and that's what a lot of useful AI tools are like they're not Reinventing the wheel they're just combining different building blocks in an unexpected Manner and I think when we look back at 2024 notebook LM was the app that surprised Everybody by how sticky and useful it was people just Lov that product and they keep returning to it and now you can have a custom version of it also by the way I uploaded a video recently where I showed you how to build an even more customized version of a meaning SU ier that is similar to this sort of like a custom notebook LM but like really custom even more than this and just different people are going to have different needs but if you want to learn how to do something like this yourself selfhosted without any technical knowledge required you might want to check out that video if you want a small amount of control 11 Labs it is and if you don't need the control at all and you just want an audio file of two people talking about a topic just use notebook Alum all right next up very
13:21

MMAudio

briefly this just popped up on my radar and I wanted to introduce this as it shows a new category of tool that I think we're going to see a lot over the coming year as you can see a lot of these tools and techniques are starting to merge and that is really the underlying Trend here because you think about 01 it's something that people discovered through prompting it's Chain of Thought prompting just think step by step and then they just embedded that into the model on a deeper level by actually training it with those examples and that prompting approach in mind now surely that's just a gross oversimplification but the basic idea is hey you had a large language model then you had a prompting method and when you merge those you get something better same thing with level apps that we just looked at hey you have a custom voice you have llm that generates text the llm can also generate podcasts what if you use the custom voices to voice the podcasts you just get something brand new and because all of these AI tools are built on Transformers they're really good at transforming one thing into another and I think a big theme of this year is going to be multiple Transformations being combined and that's exactly what this GitHub repo right here called mm audio does it's basically a video to audio synthesizer and I know most people won't be using this I just want to open your minds to the possibility of these Transformations happening every direction everybody has seen text to text now right it's just basic chat GPT pretty much everybody's also aware of all the text to video progress and image or text to audio and right here is the open source version of something that you might refer to as video to Audio Plus it also syncs it at the same time so you give it a video file from Sora here for example and it comes up with what it should sound like okay yes I don't know about those AI videos but I think you see the point and again I'm not bringing this up because you will be using this in your next project I mean you certainly can it's out there the link is in the description below but I just want you to be aware of the fact that you can transform any form of media into any other form now and people will be combining this and really what all these agents that we keep touching on again and again are just very specific ways to transform one piece of information into another and then obviously there's a lot of Orchestra ation that happens there but just like here it generates the audio and then it so to say orchestrates it in a way where it syncs with the video agents also just transform one form of input into an output and they just need to coordinate for it to be productive but understanding that in 2025 every piece of media can be somehow turned and transformed into something else might just open your eyes to solving problems or to understanding some of these tools more in depth and that just might give you an advantage of a person who maybe use chat for 30 seconds and called it a day with that being said I'm genuinely excited about going into 2025 together with you exploring this landscape covering everything that matters and as per usual we'll be mixing in tutorials because this channel is all about the use cases and education around them and I'm just grateful to have you along for the ride with that being said check out the challenge that I pointed out I can't wait to see what this community is using 01 and the other reasoning models for and with that I hope you'll have a wonderful day

Ещё от The AI Advantage

Ctrl+V

Экстракт Знаний в Telegram

Транскрипты, идеи, методички — всё самое полезное из лучших YouTube-каналов.

Подписаться