Advanced Voice Tricks & More AI Use Cases
21:12

Advanced Voice Tricks & More AI Use Cases

The AI Advantage 27.09.2024 31 123 просмотров 945 лайков обн. 18.02.2026
Поделиться Telegram VK Бот
Транскрипт Скачать .md
Анализ с AI
Описание видео
Use code AIADVANTAGE10 to get 10% off your Hostinger plan! 👉 https://hostinger.com/aiadvantage10 Today I'll showcase some of the most impressive capabilities of OpenAI's new Advanced Voice Mode, plus I'll tell you everything you need to know about the new Gemini and Llama models, and much more. Links: https://youtu.be/mJ1rWch5Ekw https://x.com/sama/status/1838864011321872407 https://developers.googleblog.com/en/updated-gemini-models-reduced-15-pro-pricing-increased-rate-limits-and-more/ https://blog.google/technology/ai/notebooklm-audio-video-sources/ https://huggingface.co/spaces/lamm-mit/PDF2Audio https://ai.meta.com/blog/llama-3-2-connect-2024-vision-edge-mobile-devices/ https://x.com/LeonardoAi_/status/1837122201944047941 https://x.com/tripoai/status/1836790302876524615 https://www.reddit.com/r/StableDiffusion/comments/1focbhe/invoke_50_massive_update_introducing_a_new_canvas/ https://www.invoke.com/pricing https://klingai.com/release-notes https://docs.qingque.cn/d/home/eZQDvlYrDMyE9lOforCeWA4KP Chapters: 00:00 Whats New? 00:22 OpenAI’s Advance Voice Mode 05:58 Hostinger 07:32 Two New Gemini Models 08:23 Llama 3.2 10:12 Meta AI announcements 11:26 PDF2Audio 14:55 Tripo AI v2.0 16:51 Invoke 5.0 17:52 Leonardo Upscaler 19:54 Kling 1.5 and Motion Brush #ai #openai #chatgpt Free AI Resources: 🔑 Get My Free ChatGPT Templates: https://myaiadvantage.com/newsletter 🌟 Receive Tailored AI Prompts + Workflows: https://v82nacfupwr.typeform.com/to/cINgYlm0 👑 Explore Curated AI Tool Rankings: https://community.myaiadvantage.com/c/ai-app-ranking/ 🐦 Twitter: https://twitter.com/TheAIAdvantage 📸 Instagram: https://www.instagram.com/ai.advantage/ Premium Options: 🎓 Join the AI Advantage Courses + Community: https://myaiadvantage.com/community 🛒 Discover Work Focused Presets in the Shop: https://shop.myaiadvantage.com/

Оглавление (11 сегментов)

  1. 0:00 Whats New? 89 сл.
  2. 0:22 OpenAI’s Advance Voice Mode 1110 сл.
  3. 5:58 Hostinger 387 сл.
  4. 7:32 Two New Gemini Models 184 сл.
  5. 8:23 Llama 3.2 400 сл.
  6. 10:12 Meta AI announcements 275 сл.
  7. 11:26 PDF2Audio 757 сл.
  8. 14:55 Tripo AI v2.0 397 сл.
  9. 16:51 Invoke 5.0 248 сл.
  10. 17:52 Leonardo Upscaler 485 сл.
  11. 19:54 Kling 1.5 and Motion Brush 299 сл.
0:00

Whats New?

okay let me tell you practical AI releases have been heating up recently this week we got open ai's Voice Assistant upgrades to various products like Google's notebook LM and their very best models meta released a bunch of AI features and a new version of their llama models and so much more me and my team researched all of it tested a lot of it and in this week's episode of aus you can use we serve it on a silver platter to any screen of your choice
0:22

OpenAI’s Advance Voice Mode

let's begin all right so the big story of this week obviously was open AI releasing the advanced voice mode now a few updates since my release video that I recorded put it right away first things first the roll out is complete it actually took them less than 24 hours to get this into every plus and team subscribers phones that's really amazing except that people in the EU UK and a few more countries didn't get access due to regulations it was actually interesting that Sam for the very first time tweeted about this fact and he mentioned that these jurisdictions require additional external review meaning we'll get the future it's just a question of time but I myself I'm in Europe and check out my phone right here hey what's up not much just here to chat that's advanced voice mode and I shared the step-by-step way on how to access this if you're from Europe and want to get it it's simple you uninstall the app you connect to a VPN doesn't matter which one a lot of phones have this built in already just search your settings then you reinstall the App log back in and voila you have advanced voice mode on your phone so that's a nice little workaround I want to point out one more thing in the video that uploaded right away I gave you a bunch of presets matter of fact I gave you over 15 different prompts that you can run in there all of them take different angles and assume that you have a certain profession and then there's different role playing or brainstorming scenarios that you could engage in and my hope is that these prompts will spark your creativity and show you what's possible here now I did make one mistake though I made the video as soon as it released I didn't have it on my phone yet I said that you simply need to copy them from the video description right here I will actually copy them directly from the product that we developed with 33,000 prompts for the voice assistant in total so for example here with the roast comedian and if you want to generate fresh material you would just copy it from your very own copy of the notion template heading over to cat GPT and the correct thing is not actually posting it as a message here you go to the custom instructions which you can access in the settings personalization customize chat GPT and here in the first box you just paste this preset save and now you set up your voice assistant to act as a rast comedian that will help you prepare fresh new materials AK brand new jokes you just have to make sure to be in a brand new chat and now if you engage with the voice assistant all right let's begin awesome give me a current event or pop culture topic and I'll cook up some roast angles for you yeah what about Sam ultman now actually owning a part of open AI oh that's interesting Sam mman well I guess when it comes to AI Sam's all in he's got more investment in Bots than a Sci-Fi villain now he's not just calling the shots he's owning them too oh brother this guy stinks okay I mean fair enough can you be a little more ruthless and comedic got it so Sam mman now owns a piece of open AI looks like someone's taking owning the future a bit too literally this guy so deep into AI soon he'll be arguing with his toaster about who controls the bread okay I don't know about those jokes but I think you get the point this is the way you set it up there's a bunch of presets I gave you to show you the possibilities and we created an entire product which is all about showing you what's possible for a grand total of 1,600 professions this is how you correctly set it up just wanted to put that correction out there beyond that the voice assistant has been well probably the most entertaining release in AI ever I would even State because it has one standout feature and that's imitating voices now it's lobotomized enough so that it can't address specific names my name is Jeff so if you name celebrities and just say speak in their style it won't do it but you could broadly describe the characteristics of That Celebrity and then often it will work or you just go with something simple like in this case I want you to roast Marvel for their degrading movie quality but do it in the voice of a Russian mobster I can't do specific voices but imagine this with a thick Russian accent Marvel movies they used to be like rare caviar now they are more like water down bed served in Every Corner Cafe no yes my friend I think you get the points you could even tell it to imitate your very own accent and it will do that all right now I want you to roast the Apple for the lack of innovation over the past years and I will have my cigarette in between listening to your yapping go ahead do it in my accent too oh no no no oh no my French exit was so bad that it actually fought I was French I don't know what to think of that but as you can see you can make it switch voices it's multilingual Styles it's incredible and with all of the prompts for inspiration from the first video the opportunities here are truly endless so go out and enjoy it I just wanted to tease that we're already preparing a big video with various use cases where we go a little deeper into different use cases for various professions so look forward to that on the YouTube channel and until then if you don't have the plus plan you can actually go ahead and use Gemini's alternative it's not as good but you can get a one Monon free trial and also get a taste of The Voice Assistant even grock rolled out some voice features now when I updated my app it now has voice input it can speak yet so next to that and meta AI announcements that also integrated the voice feature into meta AI also not available in Europe we just got all of these different voice assistants throughout the past weeks open AI being the best one by far all
5:58

Hostinger

right so with the next tool I'll show you how to build one of the most important things you can have in 2024 your own websites that is clean yet professional and to do this let me introduce you to hostinger the sponsor of today's video hostinger makes it easy to build and host all kinds of websites that's a great platform for anybody new to website building especially with all the AI features that they recently added they even have this AI website builder that I'll show you now all I need to do is give it a few descriptive words and VOA you have a website so once you sign up for the link in the description below you will be faced with this welcome page if you decide to go with a business plan it even comes with a free domain so you could just go here and look for yep I could just get the website Advantage right here but now let's get into the promise step by step of building the actual website I'll just click set up here with the business web hosting I will be done in no time I'm doing this for myself or my business I'll create a website I'll go with the hosting a website builder because this is the simplest alternative right here I'll secure the domain for myself and less than a minute later I can start creating my website with AI I'll make this a portfolio page for this imaginary project which is AI website gallery and I'll keep this really simple but the more detail you give it the more it will incorporate and that's it I'll hand this off to the AI and there it is a clean and professional website with our various projects all I have to do now is change out all the data like the pictures or the text but this is absolutely beautiful functional I can just go to edit and change all of the media just like so they even help you with the simple checklist let's be real could this be any easier if you want to use this yourself check out hostinger first link in the video's description and now let's move on to the next piece of aanus that you can use okay
7:32

Two New Gemini Models

next up we have some major improvements from Gemini's models namely 1. 5 Pro and Flash this is sort of like GPT 40 and this one is like GPT 40 mini so these are all API improvements but the 1. 5 Pro API has been really good I talked about Google's AI Studio before it has the biggest context window the model is really capable and you can upload videos to it straight up it's the only model that can do that now it has 50% reduced prices both models have higher rate limits two times faster output and lower latency this is going to play really well with the voice mode which they need to upgrade further to catch up with open AI but nevertheless these are some major upgrades and if you check out all the benchmarks although they didn't release new models Gemini 1. 5 Pro improved so much since May this is kind of ridiculous Google is really catching up they're not making big waves like open AI with every single release but these products are getting super solid and
8:23

Llama 3.2

fast and next up talking about new llms we got to talk about the brand new models from meta namely lree. 2 this is the first time meta is open sourcing multimodal llms meaning they can also process images but it doesn't end there because with this family of models they open sourced a bunch more than just Vision models there's a bunch of them that are really small meant for on device usage on phones and because these are open source people can build with them now but some of them are really specialized towards specific use cases I'm talking about these super small 1B and fre billion parameter models that are specialized for one thing and meta just doesn't care they open sourcing it all right here so people can start building apps on top of this or simply include one step in their workflow enabling workflows for developers that are just not feasible if you're calling something like the big chat GPT models that have no way of running locally as you can see in these little gifs all of the generation happens on device because it's local and also as mentioned these include Vision models so if you're following the show you would have noticed that over the past months actually a lot of vision models came out some of the best ones came out of China but now we have llama beating some real contenders like GPT 40 mini or claw-free hiu that's their smallest model but again I cannot overstate the fact that these models you can download and use whereas gbt 4 mini you can't you need to pay for the API and go to the Internet this you could just run locally all you need to do is flat this information and then you can download the models or they're also available over on hugging phase so look I realized for most viewers of this show this is not something they'll be doing right away but just know that meta is not stopping on its conquest of leveling the playing field for all players which in my opinion is really good because now we have a open-source alternative that anyone can build on that is as capable as GPT 40 Mini Plus developers can easily build things like mobile writing assistance without needing to enter the wall Garden of something like apple
10:12

Meta AI announcements

intelligence but I just quickly want to touch on meta's AI announcements there have been some incredible things in there I mean obviously the Orion glasses were mindblowing I guess most of us are in Tech so if you haven't seen this you need to check this out this is probably the future Computing platform especially if you pair it with multimodal AI but my point here is a different one some of these meta AI announcements are exactly what I've been preaching for a while and it might be helpful to contextualize all the content in this weekly show because we didn't see anything revolutionary out of the meta AI announcements there's photo editing with prompts inside of their meta AI app there's a voice assistant there's AI avatars and they're adding voice dubbing to their shorts meaning you can upload in one language and then it automatically translates into others it's not perfect arguably there's even better tools out there right now if you were to do it manually but the point is all of these innovations that we look at week by week eventually will be gobbled up by these Giants and implemented into their applications and as they already have the distribution with Instagram Facebook WhatsApp and messenger they don't need to be first they let everything shake out and then adopt the best features into their apps like they did with the meta AI announcements I mean heck just a week ago we looked at an open source model that does photo editing just like this new feature and soon it's going to be available inside of Instagram okay so
11:26

PDF2Audio

I have to show you this app that I was really excited to find because we talked about notebook LM before most other people actually talked about it I'm kind of excited about the fact that we featured an episode of news you can use with the brand new voice updates and then about four to 5 days after upload a lot of the internet caught onto it and I won't take the credit for that our community disc discovered it in there was a lot of conversations around notebook LM for a few months already concretely the educator sub Community was using it all the time of their students or to research different projects my point being that notebook LM is still really good and it doesn't seem to be stopping anytime soon because right now they shipped two new updates one of which I was asking for since the beginning it's the ability to include YouTube links into your sources a quick recap of notebook LM is that on the left side there's a bunch of sources you can add different articles different PDFs and now also YouTube videos that it then transcribes and takes into its knowledge base then at the bottom you have a chatbot which can suggest prompts and those prompts run on the entire knowledge base of your notebook so if you're researching a new topic you just throw everything in there and then you talk to the entire knowledge base and the standout feature that caught everyone of g a few weeks ago was the audio overview which generated the audio podcast that just sounded really good really human so you could listen to audio summary as presented in a podcast conversation between two human sounding voices now these audio overviews are accessible via public URL so you can share them more easily and you can add YouTube videos to your sources which is massive cuz let's be real YouTube is one of the best sources for information in our generation and now you can throw the YouTube videos in there and generate these podcast summaries and this week here's another piece of AI news you can use that you could actually build something interesting on this feature caught so much popularity inside of notebook LM that a brand new open source alternative to it popped up this week it's called PDF to audio and it does exactly that you can upload PDFs and it turns it into audio now the interesting thing here is you can use a variety of models if your API key supports 01 preview you could even use 01 preview to process all the text from the PDFs and generate a summary that way but at the end of the day you can pick multiple speaker voices and create audio podcasts similar to the ones in Notebook element how similar well we wondered too and that's why we went in and tested this right away so let me put my headphones on and have a listen we did this based on Sam alman's blog post here's the version from PDF to audio today we dive into a fascinating topic the intelligence age it's a term that sounds futuristic doesn't it the intelligence age is as thrilling as it sounds essentially it's a term that signifies our era where artificial intelligence often abbreviated as AI is a major Force shaping economies societies and even personal Lifestyles so you get the point pretty straightforward with standards Texas speech quality that we're used to and here's the notebook LM audio comparison we're going to unpack Sam alman's latest on AI and the future intelligence age and it's a fascinating read bold predictions some really nuanced arguments we're talking AI tutors for your kids AI managing your Healthcare even reshaping what work means so there you go I would say it's pretty clear that notebook LM is better but this thing is open source meaning you could build this into your own applications or build something completely new from it I don't know I was really excited to find this because in my mind the point of this show is really getting you ahead of the curve so you can use some of these opportunities to your own Advantage just like I did by discovering cat GPT on the day of its release all it takes is being slightly ahead of the her and you can potentially build your entire life upon it all right in the creative Department
14:55

Tripo AI v2.0

this week there have been some interesting upgrades because first of all we got the state-ofthe-art 3D model generator and while this is not really my area of expertise it's been really interesting to watch developments in this space and this new model tripo 2. 0 is supposed to be state-of-the-art so what me and the team went ahead and did is we tested tripo 2. 0 versus tripo 1. 0 versus stable diffusions 3D model generator I have all the results of the testing right here I'm seeing this for the first time myself and I'm just curious to see how far 3D model generation with AI has come over the past year or two so let's just start with triple 1. 4 again these are the same input images okay this is very crude let's look at triple 2. 0 oh wow massive difference in detail right okay what about stable Fast 3D stable diffusions 3D generator wow hold up this is way better am I right okay what about typography this is triple 1. 4 that's okay triple 2. 0 that looks way more like wood no doubt comparing it to stable diffusion well okay that's just weird maybe a food truck stable diffusion versus tripo 2. 0 yeah these details are matched hold up this generator is actually really good I mean sure the text is not perfect but this might be usable in the game whereas the stable diffusion results are you know they tried I guess yeah and 1. 4 is not even comparable this is not even close I didn't expect this to be this big of a jump okay let's do something hard like this steampunk Warrior triple 1. 4 versus Triple 2. 0 right here wow the level of detail so look is certainly not perfect but I think this really crosses a line where I would say this is good enough if you put this into a living world a game animation whatever I wouldn't mind this little ghost should be pretty simple to generate so let's have a look yeah it's clearly it's just better in every single case and I have to say just as a beginner looking at this it's an impressive tool and you can try it out yourself for free on the tripo 3d. a links in the description as per usual
16:51

Invoke 5.0

all right next up I want to show you this AI image edit you can think of it as a canvas that uses AI to generate different aspects of this and this to is not new but in the 5. 0 version they introduced this canvas where you can really create things from scratch then match them with others I don't know I personally learned Photoshop around 14 years ago when I was 16 I remember taking a linda. com course and then also infecting other classmates with my enthusiasm for the learning I believe three other guys in my class took the same course and then all of us were photoshopping things into each other and something like this we could have only dreamed of back then look at that you just generate a tree you cut it out you insert it into Scene It Blends it all together so this is the announcement demo but you have all the AI Tools in one you have layering and again this canvas interface is brand new in here and it all started with this simple little mask that then AI turned into this with precise control if I was a designer I'd be probably learning this right now I just have to note that you can get started for free but this does cost money but what a great interface I didn't know of this tool before and I wanted to show it to you right here okay
17:52

Leonardo Upscaler

next up we have a upscaling upgrades to Leonardo a you might know them there a suite of tools and now they also included an upscaler just like many other apps and I was curious because Leonardo AI is a pretty good deal I think at $12 a month you get a suite of tools that now also has an upscaler when you compare it to some other tools like in our case we use magnific I still think it's state-ofthe-art but it cost around $40 a month and we only use it for the upscaling so I was curious if Leonardo AI could do the same thing so we went ahead and tested a bunch of images in this brand new upscale of The Have and as you can see here from the original image this is the Leonardo result and this is the magnific result and I have to say right way I just prefer magnific if you're fine with good results then Leonardo is fine but magnific is just better isn't it but some of these other tools that have 16x upscaling not just 2x are just better there's more detail now you could use something like leonado to get to the 16x resolution but you would need to run it and then run it again and as you might know these AI UPS scalers introduce new details that's how they work they hallucinate up what should be in between the pixels with AI and that way they can really blow up the image to incredibly large proportions from something that originally should not have all that resolution I mean look at these books this is the original image going to Leonardo over to magnific these are pretty comparable a few more a cat in a hats in Leonardo wow look at this one this is really good and then magnific again slightly more detailed but I think you can see the pattern here it's pretty good and I think this can be said for most upscaler these days every single app seems to integrate a simple AI upscaler just like Leonardo did here and they're really good they add a little bit of detail you get a bigger resolution but as you can see in this image it can backfire sometimes and you have to play with the settings on something like magnific that will blow it up so much actually on this elephant example I might prefer Leonardo same goes for this portrait what's the conclusion here AI ups scalers are amazing if you or your marketing department are not using them in your workflows you definitely should be this is not just image sharpening that we had for decades now this invents new pixels and it can be really useful especially if you pair it with other eii Generations that often have the downsides of being a smaller resolution
19:54

Kling 1.5 and Motion Brush

in AI video lands there's a super quick update for you cling one of the top video generators has a brand new model cling 1. 5 it outputs at 1080p HD and there's a few changes but honestly all of these video updates lately feel marginal there's an entire page that I'll leave below where they really nicely Illustrated all the changes here as you can see it performs way better on this good old Will Smith eating noodles type of test so that's great but you don't get the motion brush and camera control in this new model ah by the way that's also a new thing they now added the motion brush making this the first one of the top three video generators to actually add selective animation with the motion brush this has been one of the best features of the original Runway if you want to learn more about these they put together this guide with a lot of detail I'll also link this below this thing is freely available includes a bunch of prompts and examples and it's all in English any AI video enthusiasts you'll love this should be really helpful and they even have cats with Hats so I don't know what's going on over in China but they're getting something right so I'm not sure I put this together and if they watch the a Advantage but if they do hello cling guide Rider nice to meet you and that's pretty much everything we got for today check out the AI Advantage newsletter in the description below to get a free template with a bunch of beginner prompts now also including voice prompts and as per usual I'll see you in the next episode of AI newsic can use next Friday

Ещё от The AI Advantage

Ctrl+V

Экстракт Знаний в Telegram

Транскрипты, идеи, методички — всё самое полезное из лучших YouTube-каналов.

Подписаться