AI Just Took an INSANE Leap Forward!
22:24

AI Just Took an INSANE Leap Forward!

The AI Advantage 19.04.2024 22 505 просмотров 823 лайков обн. 18.02.2026
Поделиться Telegram VK Бот
Транскрипт Скачать .md
Анализ с AI
Описание видео
You can start using Grammarly for free today! To get 20% off Grammarly Premium visit: https://www.grammarly.com/theaiadvantage This was one of the craziest weeks ever for AI news and new releases. We just got open source LLMs that actually rival GPT-4, a new realistic text-to-speech model you can use for free, and way more. Join me to explore it all in depth! Links: https://huggingface.co/spaces/parler-tts/parler_tts_mini https://github.com/huggingface/parler-tts https://ai.meta.com/blog/meta-llama-3/ https://www.meta.ai/c/37e7d1a3-86fe-48fd-8d22-3f07865198a9 https://openai.com/research/gpt-4 https://about.fb.com/news/2024/04/meta-ai-assistant-built-with-llama-3/ https://twitter.com/OpenAIDevs/status/1780640119890047475 https://platform.openai.com/playground https://www.udio.com/ https://suno.com/explore https://www.reka.ai/ https://twitter.com/WizardLM_AI/status/1780101465950105775 https://huggingface.co/alpindale/WizardLM-2-8x22B https://arxiv.org/pdf/2304.12244.pdf https://huggingface.co/spaces/TencentARC/InstantMesh https://arxiv.org/pdf/2404.05014.pdf https://huggingface.co/spaces/BestWishYsh/MagicTime https://twitter.com/0xTyllen/status/1779972538745106446 https://www.microsoft.com/en-us/research/project/vasa-1 Prompts: Write me a standup routine about [prompt]. Only include the dialogue (without the speakers names) and add laughter in square brackets when appropriate. Chapters: 0:00 What’s New? 0:50 Parler-TTS 2:23 Llama 3 7:06 OpenAI Assistant API Upgrade 8:37 Grammarly 10:53 Udio Usecases 13:38 New Music Styles with Suno 14:58 RekaAI 15:58 WizardLM-2 17:35 InstantMesh 18:23 MagicTime 19:01 Future Usecase 19:55 VAS A-1 21:07 AIA Community #ai This video is sponsored by Grammarly. Free AI Resources: 🔑 Get My Free ChatGPT Templates: https://myaiadvantage.com/newsletter 🌟 Receive Tailored AI Prompts + Workflows: https://v82nacfupwr.typeform.com/to/cINgYlm0 👑 Explore Curated AI Tool Rankings: https://community.myaiadvantage.com/c/ai-app-ranking/ 🐦 Twitter: https://twitter.com/TheAIAdvantage 📸 Instagram: https://www.instagram.com/ai.advantage/ Premium Options: 🎓 Join the AI Advantage Courses + Community: https://myaiadvantage.com/community 🛒 Discover Work Focused Presets in the Shop: https://shop.myaiadvantage.com/

Оглавление (14 сегментов)

  1. 0:00 What’s New? 172 сл.
  2. 0:50 Parler-TTS 358 сл.
  3. 2:23 Llama 3 1106 сл.
  4. 7:06 OpenAI Assistant API Upgrade 343 сл.
  5. 8:37 Grammarly 582 сл.
  6. 10:53 Udio Usecases 688 сл.
  7. 13:38 New Music Styles with Suno 217 сл.
  8. 14:58 RekaAI 250 сл.
  9. 15:58 WizardLM-2 363 сл.
  10. 17:35 InstantMesh 210 сл.
  11. 18:23 MagicTime 144 сл.
  12. 19:01 Future Usecase 228 сл.
  13. 19:55 VAS A-1 296 сл.
  14. 21:07 AIA Community 339 сл.
0:00

What’s New?

ladies and gentlemen this week in AI news you can use no we're not going to use that you have no power here what a productive week and I believe this week is truly going to have something for everybody because we have updates from open AI we have a brand new open source Texas speech model that is indistinguishable from real people speaking there's a new best-in-class open source llm and I'll show you a super easy and free technique on how to craft standup sets for you or your friends with a combination of two AI tools we already covered plus meta open source llama so we're also going to talk about that although it came out after recorded this video this is a massive release plus it's usable today we'll talk about everything in a second here there's not time to waste the space moves way too fast so let's Dive Right In and let's cover all of the brand new AI news that you could be using today
0:50

Parler-TTS

all right first things first let's talk about this text to speech model that is actually incredibly good now the amazing thing about this one is that it's not paywalled by a subscription or it's not some service that you can only use for an API this thing has actually been open sourced on GitHub okay meaning you could build on top of this and people will but also you can just use it through the hugging face space like so I'm going to pick one of these presets and I'm going to add something to the input text all right hit generate and here in the description box you could prompt it to generate it with different voices before I H play one interesting fact about this is that it has been trained up on 10,000 hours of narrated audio books okay the so what you can expect here is a pleasant voice like you would expect from Audi book let's have a listen remember this is only the first iteration of the model so maybe we will hear more about the monus you can use on the bage B Channel I was to foot him foot okay we're going to have to work on that AI Advantage YouTube channel part maybe give it one more generation remember this is only the first iteration of the model so maybe we will hear more about the M news you can use on the adantage other channel okay it apparently doesn't know the word YouTube what bro what are you talking about cuz my guess is that just wasn't present in the training materials but this was incredibly good right and you could just try it for yourself I found this to be consistent among many different voices and messages that you send to it also they're saying this is just a start they'll keep adding more to the training data and making this better and better so this is an absolutely incredible release that you're probably going to find in various apps very soon again this quality level just hasn't been publicly available up until now okay so
2:23

Llama 3

on firstday April 18th meta came out with llama free their new state-of-the-art open source mod model and this is the big news here this is an excellent model that beats out all other free models and it's fully open source which is absolutely amazing but it doesn't end there they're also integrating this into their entire Suite of products WhatsApp Facebook messenger and even Instagram and they released a chat GPT competitor a online chat interface where you can use Lama free for free without history and it's not available yet in Europe but let's start by talking about the model itself cuz that's the big news here so what did they drop two things namely a 8 billion model and a 70 billion parameter model so this is important because they're not trying to compete with GPT 4 here okay they're doing that too but that model is still in training they talk about this at the bottom of the announcement they're training a 400 billion parameter model which will come out later this year and this is going to be the GPT 4 competitors so today we got these two models which are nevertheless extremely capable so how do they Stack Up on benchmarks well they're really good especially this small model blows most other similarly sized models out of the water and the 70b model beats all these competitors in this medium-sized category like M for medium gbt 3. 5 or odd Sonet and meanwhile the chatbot Arena has updated the leaderboard and 70b model is currently in fifth place and as mentioned the 400b model that is going to be their open source GPT 4 competitor actually later on in this video we talk about an open source GPT 4 competitor that came out this week so hang around for that but metas lree is going to release later this year and purely judging of the benchmarks that's all we get with this one on the MML 5 shot benchmarks it performs almost equally as well as gp4 look at this in summary they just open source the best-in-class small and mediumsized model and soon they're going to open source a gp4 competitor big now why should you care what are they going to integrate this into and what is available today what about this news here can you use well they have a separate blog post outlining all the features that this will enable for their other application so from here and out if it's available in your region so you know I'm going to use VPN here in a second to access their new chat interface but basically it's a question of time until everybody gets features like you're going to be able to tag at meta AI AKA their Lama free model and ask questions in your conversations inside of WhatsApp inside of messenger heck they're even putting it into their feet this is the one that really surprised me I wouldn't have expected them to put this into their main product their main Revenue driver right look at that there's a button in the feed where you can consult the AI this is making these things mainstream right A lot of people try to chat gbt but that doesn't mean they're using it regularly in contact with it on everyday basis and if you didn't know this is kind of a big deal because meta has a whooping 3. 9 they have 4 billion users worldwide on their apps and meta AI is shipping to all of them well you know except Europe and not just that you're getting the LGE language model they also have a new image generation model which actually generates in Real Time spoiler alerts it's not as good at what we're used to with the newest releases I would say it's maybe at a d 2 level anyway so that's all exciting news but what about this can you actually use well if you head on over to meta. vn here you have a chat interface okay so this is a chat GPT competitor with llama free 70b running in the background now you can use these prompt presets and right away you can see this is way more consumer focused right there's images along with the prompts it doesn't even show the prompts it just shows you the outcomes quick side note this is eerily similar to The Prompt templates that I've been providing for free or with my course since over a year now you have an image and the outcome this is really the best way to teach people what you can do with these models and meta knows that so in this new chat interface you can use the chat box or you can use one of these and start interacting with it is Speedy it's at the Quality level of a medium sized model which is very decent and very usable again it's not a gp4 or a claw Opus but it's very good I also tested the number one thing which I test these days which is the writing style it's nothing special for Unique writing styles I think Opus and Gemini are still the best right now so again the way to think about this is really as a better GPT 3. 5 that is completely free met. AI one thing that it does very differently than all the other models is it uses bullet points and these like paragraphs a lot I think this might be a decision that makes it even more consumer friendly right because then it's not a wall of text but it's segmented which makes it easier to use I think and that's the product they released now and this is going to be available inside of all their other native applications to 4 billion people huge this is what mainstream AI adoption looks like right and then also you have their image generator which is also going to be integrated into all their apps quick side note if you have a Facebook profile in a country that doesn't have access to theads but you have a VPN you can still use this as I said this is D to like Generations I would say in other words it's good enough but there's certainly better ones out there last thing that I want to underline again is that this thing is fully open source so people can build on this what a weak for open source this is and that's really Lama free more coming later in the video okay
7:06

OpenAI Assistant API Upgrade

so next up open ey announcement this one is for the developers but I think it might be interesting to non-technical people too because what they did is essentially they upgraded their assistance API if you're not aware it's essentially a GPT that you could call through the API or built into your applications or websites and what they did is they 500x amount of files that you can attach to it so what they're really trying to do is they're trying to compete with a lot of the other solutions that allow you to add a lot of context this is one of the main things you want to do if you use AI for a business right you wanted to be aware of all the business context you have not just part of it now played with this a bit today and I got to say there's a big difference in speeds too it's not just that you can attach way more this thing actually became quite Speedy I am speed if you used assistance API before you'll know that it wasn't the fastest and it was prohibitively expensive which is still the case but at least it's better now there you go already running so this is really good and I did attach two files here but as mentioned it could have up to 10,000 of them incredible the main problem still being the very high pricing currently there's just so many other Solutions which do a similar thing at a fraction of the cost that's why this is not as widely used as open AI maybe would like for it to be but with upgrades like this step by step this is becoming more and more capable one quick side note here too is that they added this new feature that you could add projects and therefore organize your API costs via projects rather than by API keys if you work with a team you will appreciate this and that concludes our quick little overview now let's move on
8:37

Grammarly

all right so for this next one I want to show you some new AI features of one of my favorite tools you might have seen me use this before in other videos grammarly now I've actually used grammarly for years and in some prompt engineering tutorials I even recommended as it correct spelling mistakes while you type and recently they added some upgrades to bring it to the next level and they reached out to sponsor this video namely they're using large language models to add an assistance to your Brower browser whenever you're typing something so you can always just click this little G button and here it is as I haven't typed anything there's no suggestions yet but before we dive into these suggestions I want to show you this customization feature because it actually adjusts to your personal context it's sort of like custom instructions light so let's just say I'm on X or Twitter and I'm writing a post here now if I select this I could click this button saying rewrite with grammarly and up here you can set your voice so this is sort of a light version of custom instructions set of cat GPT and once you set this preset everything you'll be doing with this extension will consider that context as you might already know the best way to use AI for Content creation today is not for writing it's for editing and that's what this does really well so I select this text and I'll simply say improve it and it will give me a suggestion on how to improve this post now I could prompt it further here as you can see or use one of the presets here but this makes it really simple I can operate within the context of what I'm doing on the internet right now but there's also other scenario if you just staring at a blank page and not knowing where to start well here you can use a too I just simply prompted and there you go here's my email draft that I can now insert in the conversation and as you can see this gets me started in no time now I could customize and rewrite this further as I just showed you by simply selecting it and using this button to rewrite and for anybody wondering yes you can absolutely use this for other use cases like brainstorming it's an integrated llm right there you go this took less than a second oh and did I mention that all of these features are free up to a certain limit that is you can run 100 prompts per month included in grammar Le free plan plus all these spelling and sentence structure suggestions that I've been using for years are also included in the free plan if you upgrade you get 10 times as many prompts and strategic suggestions these are in context recommendations on what you could be doing with your text in order to improve the impact so go ahead and try gramal for free today I use it as a Google Chrome extension but they also have a standalone app and if you want to use it more often or use some of the advanced features you can upgrade to the Premium plan thanks again to grammarly for sponsoring this video and now let's get back to the next piece of AI news that you can put to work today
10:53

Udio Usecases

because next up we have probably my favorite for this week it's a use case for something that we covered last week already it's Studio this new Music Creation app that competes with sunno it's really good it went kind of viral across the internet because it is so good but people discovered capabilities of this app that go beyond just Music Creation namely they found out that you could create standup routines with it and they actually sound really good so it's not just for music you can also create other things with it so let me just show you this quick workflow you can prompt the llm of your choice with something like write me a standup routine about two AI assistants trying to hack each other and we're keeping it simple here we just want a quick standup routine now quick suggestion here is that you could totally use this to for example roast your friends and then turn that into song and send it to them right this is all free by the way yes I'm using chat gp4 but you could use any llm any one of the open source Alternatives we cover here on a weekly basis or something like Cloud High cou or gbt 3. 5 and there you go here you have a basic standup routine now this won't work perfectly so we will add something to The Prompt here and there you go for anybody intimidated by prompt engineering this is how it goes you just look at the outputs and you adjust your prompt according to what you like and what you don't like now I've played with this a little bit so I know what we need we only need the dialogue and we want to avoid the speaker names otherwise they're going to be spoken in our standup routine prompt is going to be in the description below okay so I'm just going to hit copy down here head on over to udio and here it is quite simple because what you want to do is you want to switch from autogenerated lyrics here to custom like so and then I'm going to Simply paste what I just copied from chat GPT here and at the top I'm going to say standup comedy routine and you just hit create and there you go it's going to create two routines you can listen to these again at the time of this recording this is still in beta so it's completely free that will probably change over time okay let's have a listen all right so the other day I caught two AI assistant trying to hack each other yeah I was like watching a cyber version was five versus five if anyone remembers that one says the other hey I bet I can get your root password the other replies oh real I am okay so I don't know about the writing on this is totally up to you and the llm that you use you could even write this by hand right but there you go what a fantastic use case and this is really what I envisioned for the show I'm not interested in covering all the different dram that happens in the space I want to show you different ways how to use these tools for yourself or what you could be building with them or what you're going to see implemented in other apps soon one quick side note before we move on people have been using this even more creatively and while browsing Instagram and Tik Tok over the last weeks I caught all of these German Tik Tok that take conversations on eBay with different sellers like these awkward and funny conversations and they essentially turn them into a song with you the newest trend on German Tik Tok for faceless channels just thought that was interesting and I wanted to share it with you with that being said let's move on to the next piece of AI news that you can use and this is going to be a quick
13:38

New Music Styles with Suno

one but sunno os's competitor released this tool that allows you to explore all these different genres of music this might be really useful if you're creating with yudo Oro and you're not sure what musical style you want to be creating in now mind you some of these are completely wild and don't even make sense I mean can anybody in the comments explain what hypnogogic SHO es or Goa trans AFR Cuban Jazz okay so let's try some breakbeat balcon Brass Band okay that's actually legit let's try one more okay so this is a really funky one Roots reggae avantgard Jazz swaying tring Melodies Rivers whispering that's surprisingly good so if you're looking at my screen right now and you're like I wonder what this one does well this is your Quee to open up the description open up this link and sit down and explore some new musical genres that AI created I mean to be fair some of these don't make any sense at all because they're combinations of two genres that have completely different bpms but AI just makes it happen anyway somehow it's very interesting and fun to explore and by the way great Jobo for this incredible interface here so there you go anybody who's into music this is
14:58

RekaAI

something new all right let's get a little more serious here because the open source Pace has seen some serious releases throughout this week I want to touch on two of them one of them is R Ai and the second one is Wizard LM now RI came out with a brand new multimodal llm that actually beats out Opus which is quite impressive and on human eval which is arguably the most important Benchmark but then again you know my opinion on benchmarks they're not to be trusted anymore as the model makers expect them you could be using this model through the API today and look at first when these open source models came out I tried to test them a little bit run them for a few prompts but I found that to be largely pointless over time only time will show if these models are actually any good and you need so many test cases and at the end of the day any opinion that I would give you right here a day or two after release and maybe an hour or two of me using it would be super subjective and wouldn't do it justice because again only time will show but nevertheless I want to point you in this direction and I want to forward the feedback that people have been having that the vision capabilities of this are actually surprisingly good it's good to get more competition in the space one thing that
15:58

WizardLM-2

we can objectively talk about though is the fact that wizard LM came out with the new best-in-class open source llm that actually beats gp4 that's what they claim that's what some benchmarks say that's what people have been saying that are using it this is legitimately the first model that even claims that doesn't mean it's a fact but nobody else has been able to even claim that not even on the benchmarks and look I have to say if you look into the paper a lot of the data is based on this human evaluation where some people simply prefer this model over chat GPT they even admit that down here but nevertheless this apparently beats out all the other open source llms but there's a little bit of a story with it because it was released this week and then it was deleted again why would they release something amazing and then deleted well it turns out that it's been a while since they released the model and they're unfamiliar with the new release process now and they forgot a required item on hugging face called toxicity testing so they deleted it and now they're adding it and they're going to publish this again soon but I scoured the internet for you and I found this hugging face Space by alendale this is the hero of the week while the model was publicly available he downloaded the whole thing and reuploaded it here for us to enjoy so if you want to use it before they finish updating it with toxicity testing here you go I personally can't wait for the chat but Arena to update with these new models we're going to see how these stack up against the rest of the competition but yeah generally speaking the story has been that the 70b model of the wizard lm2 is a GPT 4 class model that is fully open source so it took a while but we're finally getting to a point where open ai's hand must be forced soon where is GPT 5 we'll see and you bet we'll cover it on this channel okay to round things
17:35

InstantMesh

out I have a few more quick releases here so the first one is a hugging face space for image to 3D generator that is actually incredible we've seen a lot of these I haven't covered the new ones in a while cuz they were all equally as meh this one is impressive look at that from this image it creates a model that actually is decent look at that just from an outside perspective this looks better than anything I've seen yet from tools like this interesting I'm going to try it with an image that should be really hard for it this is a fine tune stable diffusion model that I created over the last week let's see if it can cut it out successfully yes that worked and now it's going to turn me into a 3D model okay this didn't work as well but I have to say it's not as bad as many others that I've tried the face and the hair are kind of messed up here but this is also one of the harder tasks that you could throw at it always good to see the progress in the space and let's move on to this one which I was so excited to
18:23

MagicTime

see at first because this is essentially a AI Tim Li generation model if you're not familiar with time lapses they're essentially videos of a longer period of time that are sped up now the demos where I showed this off looked absolutely incredible but I actually spent over an hour trying to prompt this I don't know if I'm doing something wrong I even recreated The Prompt that they had in the demo this is very similar to what they showed this off with yet the results are just not there I wanted a two season time lapse progressing from Summer to winter so I don't know the space is trending right now and people are using this left and right I wasn't able to get what I wanted from it maybe you will get better results it's all linked below and then
19:01

Future Usecase

to round things out I want to talk about a potential future use case this is what we do sometimes if I find something really interesting worth talking about and this is really an interesting idea that I ran into on Twitter and I thought you might enjoy this basically it's upwork where AIS hire humans that's a bit crazy right but as you know there's many tasks that AIS can't perform right now and no this is not a real site it's just a concept somebody's building this I do not think this really has legs but the idea is interesting right if things keep moving as fast as they are we could get to this point where yeah autonomous agents are going to be pretty good but they're not going to be able to do everything and that's where something like this comes in the creator of the agent might give it a budget and then the agent will go on a platform like this and the little gaps in its workflow it will simply fill in with human labor for a platform like this I don't know I just thought this was really interesting I understand that this is a bit of a dystopian thought nevertheless an interesting way to round out this ple of AI tools that we just had a look at and
19:55

VAS A-1

actually there's one more story that I want to share with you it's this Vasa one model that was published by Microsoft research which you know kind of surprising that Microsoft would be developing something in this direction but then at the end of the day I guess a lot of the large corporations just do this now and it's maybe less of a taboo than I would have thought because this is essentially deep fake technology where from a single image and a single audio clip you can generate humanlike avatars now look at some of these examples this thing is not available today will be available soon we have similar ones that we covered that are available today but this stuff is pretty wild and I really wanted you to see this so you know sometimes nothing happens and sometimes everything happens all at once and you just got to deal with it the first thing we need to look at is the letter H so the sound at the beginning I would just say this is way better than many of the ones that we have seen before by no means is this perfect but I can totally see this as a customer support agent for many companies very soon and while yeah it's not a human and a lot of people argue that hey it's so annoying that all these companies are going to be uses as the customer support it's at least better than those telephone Bots right unfortunately at the end I state that given the context that this is dangerous technology we have no plans to release an online demo API product or any additional implementation details or anything really until there's certain that technology will be used responsibly
21:07

AIA Community

if you found this interesting covering apps and what's coming up in the ipace then you'll enjoy what I'm about to share for the very first time on this YouTube channel it's the opening of the I Advantage Community I don't want to make too big of a deal around it I created a whole sales page with all the details but essentially what I learned over the last two years is that the best way to stay on top of this wild space and really the only way of implementing it into your own life is by actually using these tools and the best way to do that is by surrounding yourself with other people who are trying to do the same thing one of many things that we do in there is we run weekly challenges and this week's challenge happens to be a battle between sunno and judios so we challenge you to create two tracks give you a detailed step by step on how to do it and then people submit them here and then there's a discussion in a 1-hour event where we evaluate the winners take the learnings and share what we learned if this way of educating yourself sounds enticing there's a link in the description to it as you might be able to tell I'm beyond proud of what we were able to build here over the last year we have nine people working on the community okay five of those full-time it's incredible you should at least have a look on what we created here there's so much power in numbers and it's really the best way to stay on top of these tools and not just that figuring out how to use them to your advantage that's what a Advantage Community is all about but if all you want is another video with AI news that you can use and check out this playlist because I do this show every single week and I will see you next Friday

Ещё от The AI Advantage

Ctrl+V

Экстракт Знаний в Telegram

Транскрипты, идеи, методички — всё самое полезное из лучших YouTube-каналов.

Подписаться