They Really Open-Sourced AI Video (and More AI Use Cases)
16:39

They Really Open-Sourced AI Video (and More AI Use Cases)

The AI Advantage 25.10.2024 17 847 просмотров 622 лайков обн. 18.02.2026
Поделиться Telegram VK Бот
Транскрипт Скачать .md
Анализ с AI
Описание видео
This was an absolutely MASSIVE week for AI releases and updates to apps you actually use. Canva, ElevenLabs, Runway, Ideogram, ChatGPT and more all released significant updates, and we'll break them all down for you in today's video. Free Tool Rankings: https://community.myaiadvantage.com/c/ai-app-ranking/ Links: https://elevenlabs.io/app/speech-synthesis/speech-to-speech https://x.com/AnthropicAI/status/1848742740420341988 https://runwayml.com/research/introducing-act-one https://www.canva.com/droptober/ https://x.com/amasad/status/1848763999594418539 https://x.com/ericciarla/status/1848811140861858194 https://x.com/dani_avila7/status/1848509182229533155 https://huggingface.co/spaces/stabilityai/stable-diffusion-3.5-large https://about.ideogram.ai/canvas https://ideogram.ai/canvas/7cWezf_wSC-ZDi-w7VlR2g https://docs.google.com/spreadsheets/d/1BlDyNMjbY5aiLpqTSNvJiEhfDPoNwjfqB2W7RK3rnq4/edit?usp=sharing https://x.com/midjourney/status/1849213115009056919 https://www.genmo.ai/blog Chapters: 0:00 What’s New? 0:31 Advanced Voice Available in Europe 0:50 ElevenLabs Voice Design 2:24 Canva Droptober 4:21 Claude Computer Use Use Cases 6:45 Grok API 8:07 Runway Act-1 10:19 Midjourney Image Editing 10:54 Ideogram Canvas 11:41 Stable Diffusion 3.5 13:38 Genmo Mochi-1 14:55 Haiper 2.0 15:41 Google Photos Update #ai Free AI Resources: 🔑 Get My Free ChatGPT Templates: https://myaiadvantage.com/newsletter 🌟 Receive Tailored AI Prompts + Workflows: https://v82nacfupwr.typeform.com/to/cINgYlm0 👑 Explore Curated AI Tool Rankings: https://community.myaiadvantage.com/c/ai-app-ranking/ 🐦 Twitter: https://twitter.com/TheAIAdvantage 📸 Instagram: https://www.instagram.com/ai.advantage/ Premium Options: 🎓 Join the AI Advantage Courses + Community: https://myaiadvantage.com/community 🛒 Discover Work Focused Presets in the Shop: https://shop.myaiadvantage.com/

Оглавление (13 сегментов)

  1. 0:00 What’s New? 115 сл.
  2. 0:31 Advanced Voice Available in Europe 68 сл.
  3. 0:50 ElevenLabs Voice Design 334 сл.
  4. 2:24 Canva Droptober 466 сл.
  5. 4:21 Claude Computer Use Use Cases 580 сл.
  6. 6:45 Grok API 319 сл.
  7. 8:07 Runway Act-1 471 сл.
  8. 10:19 Midjourney Image Editing 143 сл.
  9. 10:54 Ideogram Canvas 181 сл.
  10. 11:41 Stable Diffusion 3.5 472 сл.
  11. 13:38 Genmo Mochi-1 331 сл.
  12. 14:55 Haiper 2.0 164 сл.
  13. 15:41 Google Photos Update 236 сл.
0:00

What’s New?

we've had another intense week of AI releases ranging all the way from aeropic releasing a llm that can remote control a computer we have some use cases of that in today's video and various creative applications that we didn't even know were possible before as you can see I'm still on the road here I'm in Guru Brazil where the wind is absolutely insane and I've been using starlink to upload these video clips to my video editors which sit in Europe and Asia respectively so here we go a truly Global episode of AI news that you can use featuring all the AI releases of this week that you can actually put to
0:31

Advanced Voice Available in Europe

work today okay so this update made a lot of users very happy because advanced voice mode finally shipped to All European users including Switzerland Iceland Norway and lonstein so now everybody has access to Advanced voice check out another video on the channel for 20 plus use cases that you can put to work today with it there's a lot you can do today including the universal
0:50

ElevenLabs Voice Design

translator next up we have 11 Labs coming out with an update and this is easily summarized as prompt to any voice you can imagine you can find this in their web app under voices and then if you go to add new voice the very first option is voice design now and voice design lets you design an entirely new voice from a text prompt let's give it a shot shall we maybe randomize a few times how about an angry old pirate loud and boisterous okay but let's add a little bit of a personal touch here how about this okay you think you can cross Captain match carel latte and LIF to share the insta post okay so this is a pirate with a valley girl accent why you so rude man she stop doing that can't help it this is my voice let's see if it can actually pull this off I imagine this would be quite challenging let's have a look I face storms that would turn your bougie hair white and sea monsters that would make your knees like totally Shake I face storms that would turn your bie I face storms that would turn your bougie hair white whoa okay interesting I don't know I feel like doing one more how about a massive evil ogre troll with a pirate accent let's generate your weapons are but toothpicks to me surrender now and I may Grant you a swift end your weapons are but toothpicks to me surrender now and I make weapons are but toothpicks to me okay granted these were super tricky let's give it one more that is quite simple let's do this movie trailer voice presets that they have in here in a world on the brink of chaos one hero will rise prepare yourself for a story of Epic Proportions coming in a world on the brink of now that's pretty good at this pace we'll soon have text to literally everything
2:24

Canva Droptober

we have some updates from canva with all of their AI features upgraded even further they're calling this drop toe and throughout October they're releasing a bunch of features and you can check out all of them in the link in the description below I'm going to highlight one that I find particularly interesting because minor improvements or a writing tool which is essentially a cat GPT wrapper is not something I consider worth featuring but they have this brand new whiteboard plus AI feature and I always thought that all of these whiteboard and mind mapping type features work really well with AI so what I'm going to do is open up canva here and just click on one of these whiteboard presets how about the SWAT analysis that looks good to me and as you can see there's a lot here okay there's different layouts and you can fill all of these out zoom in and out this is nothing revolutionary but if I went ahead and used all of this and not just used I could also collaborate in here right I can add other users and we could all work on this collaborative SWAT analysis here's the new thing you can go ahead and select everything and now you should be able to say magic right create summary and you will summarize everything on this whiteboard with the power of AI and I think that's just pretty amazing look at this obviously there's no info in here so this content will be useless but I just thought it was an interesting workflow to have a collaborative visual workspace like this where multiple people can contribute and then you can use AI on top of that as a final step or as an addition not as a core feature and the summary you could sense to whomever might care about the results of it but not about the intricate details of the process as you were developing this SWAT analysis mind map site Maps heck you could have entire business plans in here and then just summarize it with AI in the end and for many visual Learners this be a better way to lay out things than simply going ahead and throwing everything into a Word document and as you can see there's a lot of these whiteboards so you could do customer Journeys too and also there's a whole lot more AI features so if I just select all of this we can do all of these presets or even a custom prompt on top of the visual elements pretty interesting and if you want to check out some of the other features there's a lot of minor features here that relate to both Ai and design so this is actually
4:21

Claude Computer Use Use Cases

absolutely massive news you can news you might have heard that Claud released a brand new set of models the Sonet 3. 5 new and the Haiku 3. 5 5 is coming and also they have their brand new computer use API but as you might know I created a separate video on the channel going into all the details and showing you where you can access this even as a non-technical person so if you want to see all that you can check out the dedicated video but what I have for you here is the first use cases that have been popping up and as promised we'll do a dedicated video on this exploring all of them and not just showing you what we found while researching this but also what all of the internet has been doing with this brand new feature that is essentially a llm that remote controls your computer so here's a few while examples one of them would be this prompt go to YouTube find the video and Skip all the ads and then look at it doing it full screens the video it finds a skip button it presses it and then you can get Rick Rolled without ads stopping you okay but admittedly that's not very useful how about this one where it goes ahead and fills out different job applications for you all with a simple prompt that says first scrape enr. com with fir crawl next scrape to their career pages with fir craw and find a job navigate to the job page using Firefox and click the apply Now button until you see a form then find the why do you want to work at a propic text box and enter a great answer into the form box based on the scrape and look at it going to the correct page and applying to the job and using an llm to fill out this field and you can imagine that if you give it extra context in form of your CV it could take all the info and fill out all the fields for you send it and then you could use a prompt generator to generate variations of this prompt with different websites where you can apply to different jobs and then this thing would just go ahead and apply to different jobs for you all day long with your CV and custom answers and this is where the power of prompting comes from because if you prompt it really well it's even going to sound like you it's going to have all the context on know which pages to go to and now all of the prompt generators that I've been teaching you for a while now with the various products I mean since over a year in the freeb get with our newsletters you get 10 prompt generators well now you could go ahead and repurpose those to create different anthropic computer use prompts and then this Dam thing goes out and does the work for you where are all the people claiming that prompt engineering will be completely useless in 2024 where it's not 2030 yet you need to know how to communicate with the AIS to get things done today this is super interesting to me I'll be playing around more with it than reporting back in a dedicated video just focused on various use cases of how to put this to work
6:45

Grok API

next up we have X releasing their Gro API if you're not familiar this is Twitter SLX large language model that has access to all the Twitter data that is its main advantage but to be honest I don't know many people that actually use it regularly if you do please leave a comment below mostly the story of this AI has been that they're catching up to the other players in the game they do have the unique data but just the quality of the outputs and the tooling around it haven't been there yet but now they have an API meaning you can build this into various applications and pay per use and people are trying things out with it like Daniel San over here uses it to generate code inside of vs code now why would you use grock beta over Sonet 3. 5 that is state-of-the-art at code generation especially with the new updates this week I'm not exactly sure but you can do it but then this use case might be a bit more interesting XI actually put on a hackathon and so here built a Chrome extension that allows you to bring your own Twitter algorithm to websites and it filters it using grock and they're using this Onix allowing you to effectively modify the algo on your own Twitter feed with this extension it checks out the different posts and adjusts them based on the topics that you picked in your preset I mean this is interesting but it should also be possible with other API I guess the advantage that you have here is that this does have all the Twitter data so it probably makes most sense to use it to moderate Twitter posts not exactly sure but nevertheless xcii is catching up there's an API now you can use it and now let's move on to the next story
8:07

Runway Act-1

which is Runway act one and this is one that might not be available yet for you they claimed that they started rolling this out I don't know a single person who has this yet but this thing is super fascinating in a nutshell this is essentially motion capture without the crazy device you're probably familiar with some behind thes scenes footage of how Hollywood movies are made especially in the VFX Department wear these green suits or these suits with all of these different tracking points or tracking devices on a person so that when he moves around they can map characters on top of them perfectly now Runway is the first player in the AI video game to release a feature that is trying to mimic this without all of the technology and all of the extra equipment to track something here all you need is an actor performing a certain expression or moving ahead in a certain way and then so let me get this straight you came all the way down to the Department of Motor Vehicles and didn't bring your driver's license do I understand that correctly you're going to have to go in the uh separate line isn't that amazing I mean look at this there's a bunch of examples on this release page and again they're claiming that they're slowly releasing this but in all of these demos this looks absolutely incredible and this is something we haven't seen before now we had an interesting discussion with the team about what's next with this and obviously what's next is well the Avatar will be you and then somebody else can reenact you as you speak so I don't know this could be Ai igore and somebody completely else could be sitting here presenting and then I could just map my avatar on top of it use my levels voice to actually reproduce the voice I mean Heck if you check out 11 labs they do have the voice changer where I can pick Eagle AI advantage and then somebody could record audio and it just gets reproduced in my voice with this Tech doing the video it's about to get crazy I might even be able to take a week off for the first time in years because somebody else will be presenting news you can use and all of the itch will do the makeup of the voice and the Avatar now is this actually good H I don't know I guess it depends on the use case I personally think there's value to this human touch of the interaction that we're having right now but this certainly opens up some new opportunities that most people haven't thought of so far and I personally can't wait to try this myself once I get
10:19

Midjourney Image Editing

access okay when it comes to image generation there's a bunch of new releases this week starting with M Journey actually announcing something but this is only available to a exclusive set of users it's new image editing features and they're only accessible to people who are subscribed for on the yearly membership subscribed for the past 12 months or have at least 10,000 images my account actually does not fall in this category because I started using other tools next to my journey and I got to admit I canceled my sub around two months ago as I mostly go to flux these days if I need something but essentially they're adding some of these editing features that we've seen in Photoshop a while ago and that's pretty much the story here bringing me to the next release of this week which
10:54

Ideogram Canvas

is ideogram canvas magic fill and extent this is very similar to M Journey's release they're adding these in andout painting features which allow you to modify only parts of the image or areas outside of the image but let me tell you all of these features that you see here in both ideogram and M Journey are things that we've had in Photoshop for a while and essentially they're features that you could do manually if you knew how to photoshop properly before this is just ease of use being enabled by artificial intelligence and what we're seeing this week is some of these feature and trickling down from the pro level apps like Photoshop into something like a deogam or M Journey making it accessible to most consumers so if you could benefit from something like extending an image into something wider or replacing a specific object in an image well this week we got multiple alternatives on how you can easily do that in the apps that you might already be using next up we have stability I
11:41

Stable Diffusion 3.5

releasing stable diffusion 3. 5 large now rather than me telling you about this let me just show you because me and the team actually went ahead and created this new Excel sheet that compares all the major image generators on a few prompts that we deemed to be quite useful book covers portrait photography logo design and some specialty techniques that we like as you can see you have the comparison of all the different models here ranging from mid Journey 6. 1 across flux 1. 1 Pro but also IDE 2. 0 and what I did here is I took some of these prompts and I also ran them through stable diffusion 3. 5 so we can compare the quality levels of this versus some of the other top tier image generators and by top tier I mean we have this monthly ranking that we freely publish we updated it once a month so you can stay up to date on what tools are the best in our opinion link below but now let's have a look at what stable defusion 3. 5 produces for some of these images first up we have this portrait photography prompt so right away I just noticed that these eyes are a bit off they just don't look very real especially when you compare to something like mid Journey or flux I mean this is just not on the same level fair enough that's one image let's not judge too quickly how about this logo okay really this is what it comes up with versus these results in M Journey flux kind of a magic Studio didn't get the text right but it's a little more detailed and actually really like these ones from ideogram versus again this okay that's not good let's give it one more chance how about this cinematic still prompt this is a technique that we originally saw from Tim from theoretically media and Matias then runs the events in our community absolutely loves this he uses it all the time and it produces stunning results across all generators arguably Leonardo does the worst but I suppose it's okay flux imaginary are super impressive here the other ones are okay and this is what I got from SD 3. 5 this is terrible this person doesn't even look like a person come on this is D two level humans so I don't know am I missing something here this release is just very underwhelming one thing that I should warn you about is that it is quite un sensored so if you go in here sometimes you will just get graphic images without a warning so you can generate all sorts of unsensored stuff here but other than that not sure why one would use this over flux all right
13:38

Genmo Mochi-1

then next up in AI video generators we have two new releases one of them fully open source and another one is version 2. 0 of hyper we went ahead and tested these two for you so here are the results from the fully open source Mochi One release this is the open source video generator by genmo We compare them to the meta movie gen prompts as metam movie gen seems to be the best thing we have seen so far maybe exora well not bad on this ghost prompt this is a tricky one next up we have this monkey prompt again we'll put up a comparison on screen so you can see the difference between metam mov gen and this but this is surprisingly good physics look realistic it handles the fog well the consistency on this monkey is super good eyes look realistic I mean it's a bit of a ridiculous scene but I don't think there's anything obviously bad about this and next up we have two more shots of a sloth chilling in a floaty I don't know there's something about this one that is extra fun and it sort of works there's not a lot of movement it's quite subtle but the Shadows the water the love with the glasses it all looks good now we did generate this one more time and this generation looks absolutely terrible so I also wanted to include this in here I mean this looks like something I would have made inside of Photoshop when I was 15 uh that's when I was learning Photoshop by the way this is just not good but fair enough just reran the prompt and all of a sudden it was great so there you go Mochi really impressive and this thing is available under an Apache tool license meaning you can use this for commercial purposes in your own project that's pretty amazing at this quality level and then we also
14:55

Haiper 2.0

have the hyper 2. 0 release and Hyper 1. 0 was actually the model that we were surprised by how good it was and they have a 2. 0 model so let's have a look this one we ran through some of the image to video prompts We compare it with you might be familiar with these and the results are surprisingly good even better than hyper 1. 0 is it quite as good as minia Max probably not especially when it comes to something like these animated characters I mean this is just not it minia Max did it perfectly but these balloon animations might be some of the best we have actually seen in this category and the car is okay and this abstract man underwater is also okay 3D object is actually really cool and Abstract so my verdict here would be for animated 3D objects or vectors this is really fantastic something like animated characters or humans let's Soul okay
15:41

Google Photos Update

next up I want to quickly show you an update from Google photos that actually shipped over the past weeks but now it's becoming publicly available and it's the ability to search across your entire photo library so team member Daniel actually went in and tried this out on his own Android phone and if he types in something like show me photos with soccer it does that and it looks for entire library and you don't need to look for a specific date location none of that it just recognizes soccer and bundles all of the pictures into one album for you could also do this on specific people and even video clips now this is pretty amazing and not all Google photos users have this yet Daniel pre-applied to this and now got access but expect this to come to your Android phone soon and these are just the quality of life improvements that AI can bring to these consumer apps and I just wanted to show you as they happen and that's really everything for this week if you enjoyed this subscribe to the channel I do this every single Friday I haven't missed upload since almost a year it's going to be the new you can use anniversary soon here's another video you might enjoy and for me it's time to pack all of this up and move to the next hotel

Ещё от The AI Advantage

Ctrl+V

Экстракт Знаний в Telegram

Транскрипты, идеи, методички — всё самое полезное из лучших YouTube-каналов.

Подписаться