Meta's New AI Tool DESTROYS GOOGLE! (Now Released)
14:31

Meta's New AI Tool DESTROYS GOOGLE! (Now Released)

TheAIGRID 15.06.2023 20 199 просмотров 268 лайков

Machine-readable: Markdown · JSON API · Site index

Поделиться Telegram VK Бот
Транскрипт Скачать .md
Анализ с AI
Описание видео
Meta's New AI Tool DESTROYS GOOGLE! (Now Released) Welcome to our channel where we bring you the latest breakthroughs in AI. From deep learning to robotics, we cover it all. Our videos offer valuable insights and perspectives that will expand your knowledge and understanding of this rapidly evolving field. Be sure to subscribe and stay updated on our latest videos. Was there anything we missed? (For Business Enquiries) contact@theaigrid.com #LLM #Largelanguagemodel #chatgpt #AI #ArtificialIntelligence #MachineLearning #DeepLearning #NeuralNetworks #Robotics #DataScience #IntelligentSystems #Automation #TechInnovation

Оглавление (3 сегментов)

Segment 1 (00:00 - 05:00)

okay so one thing that you might not have known is that meta actually has an AI division in which they release new products all the time and today was no different they decided to release a new paper in which they discover a new way to generate music so they released this paper today and called it simple and controllable music generation so we tackle the task of conditional music generation we introduce music gen a simple language model that operates over several streams of compressed discrete music representation unlike prior work music gen is comprised of a single stage Transformer language model together with efficient token interleaving patterns which eliminates the need for cascading several models so following this approach they're able to generate high quality audio and through their studies they've shown that this improves on previous Benchmark you might be thinking this is just another research paper in which they talk about products but never allow people to demo it or use it but actually meta actually did open up a hugging space spot where you can actually use this product if you want to now it's important to note that hugging face isn't exactly a final product website there are many different various projects that you can use for applications that are going to be released in the future but it is a place where you can test them and see how effective they are and as you can see from here music gen is no different now another thing that I really do like about music gen is that they offer the ability to condition on a Melody which essentially means that when you try to generate your soundtrack you can actually use a different kind of soundtrack in order to influence how it sounds it's essentially quite similar to Runway where you sometimes do need a Driving Image or a driving video in order to create that texture video so I did actually test this with a specific type of music and I'll show you exactly what it sounds like and then I'm going to compare it to another company that produced another music generator and that company is Google so if you hadn't known already Google also produced a music generator now I gotta be honest with you guys the results are very interesting because they are both actually very high quality now something you may have not known is that before we did make a video on Google's music generator however at the current time it wasn't actually open for usage to the average user but now Google has actually opened up the wait list a little bit and let some people through so if you did watch that video and you did apply it when I said then it is likely that now your application is probably going to be accepted so this is going to be another tool that you can also use in your AI music generation so I'm going to firstly show you three different soundtracks number one I'm going to show you the kind of soundtrack that is completely professional that is the soundtrack that I've actually paid for use on a different project then I'll show you the Facebooks or metas AI soundtrack which was generated using a description of that soundtrack then I'll show you Google's other soundtrack which was also generated using the same exact description and then we're going to be able to compare these and see just how far meta has come so this is premium B it's a website in which you can get music tracks for whatever project you're working on now there's many different music tracks for whatever creative projects you might want but they do sound very high quality now a recent project that I did work on was one that involved funky downtown music and you're going to hear exactly what these soundtracks sounded like the reason I'm going to play these is that when I generate the music by the AI generator you're going to be able to compare to C how realistic it does sound so I'm gonna play three of these tracks so you get a sense of to what the track should kind of sound like and then I'm gonna play Facebook dot com so now that you've heard those three soundtracks let's go ahead to meta's soundtrack so you can see right here that I haven't actually put a condition on the melody which means I haven't actually given it a sample track to use so that might actually affect the output and then the later part of the video we actually will go ahead and do that so you can see in the description I've put downtown Funk Jazz comedy and this is what the AI generated soundtrack sounds laughs now if I'm being honest I do think it sounds okay but I only think the reason that this doesn't sound that high quality is because it doesn't seem less like too much of a rhythm now I do know that there are other soundtracks that do have conditioned Melodies and will produce better results but I want to also compare this to Google's one because it will go to show exactly where the music is now previously on Google's one they did actually have many different ones with much longer descriptions that sounded completely realistic if I'm being completely honest with you but what you're about to hear is the same exact description but just by Google's AI music generator now what's interesting also about music LM which is by Google is that they offer two tracks much like how mid Journey offers many different image variation options once you've used The Prompt music LM offers you two different tracks in order to pick One's best now that was soundtrack One remember

Segment 2 (05:00 - 10:00)

and that was soundtrack two now I've got to be honest if I was listening to that soundtrack I would say that soundtrack 2 is good and it definitely does improve on meta's music gen now on that one solo example it's clear to say that it does sound like music LM is better however take a note from the actual research paper what they say about the comparison between facebook slash metasmusic gen compared to Google's music LM so according to the evaluation results presented in the paper music gen has been shown to be superior to music LM on a standard text to music Benchmark whilst music LM is a competitive bass line music gem outperformed it in both objective and subjective metrics however it's worth noting that the authors of This research paper use the public API for human studies for music LM while they retrain the musi model on the same dataset therefore the comparison between music gen and music LM might not be entirely Fair as the models were evaluated under different conditions nonetheless the results may suggest that music gen is a promising approach for text to music generation and it might actually perform other competitive bass lines such as music LM now I also do want to show you what that soundtrack would sound like if it was conditioned with a Melody so this is the melody that I put inside you can hear that was one of the soundtracks that was from another project so I decided to rename it again and then what was interesting about the soundtrack is that the first three seconds are good and then the rest of it is just a bit strange I'm wondering if it copies the first couple of seconds and then the rest of it sounds strange just take a listen yeah I've got to be honest the first couple of seconds were promising but then the last bit wasn't as promising so I'm wondering if this AI generator was firstly imitating the first couple of seconds and then later when there was a drop off it sort of just got confused that is potentially what happened now this stuff is definitely interesting and I do know that there are some inconsistencies among Generations but what I want to do is I want to actually compare Google's paper to what meta has released here so one of the things that I wanted to note about music LM was they had certain things on their actual page so if you come to Google researches GitHub you can actually find the page where they have multiple and numerous different examples which are honestly genuinely interesting but one of them one of the key ones was called relaxing Jazz and it definitely sounded really interesting now you're gonna see the differences in the quality and you're about to hear relaxing Jazz from Google's actual official paper foreign which is really cool because not only is this AI generated it's around five minutes long and then I took this description and then what I've done is I've also put it into Google's music LM generation software because I knew that this may generate something different and I wanted to see if there was any consistency and if Google actually did cherry pick or hand pick their music results and then I also once again put it into meta's music generator so what you're going to see right here is that I'm going to go ahead and click generate and I want you to hear the differences now I think we all know what relaxing Jazz sounds like it definitely was pretty accurate so I'm gonna play meta's one then I'm going to go ahead and play Google's one thank you thank you okay so with that being said I gotta be honest the relaxing Jazz background music does definitely sound great from both of them and I do think that whatever sort of music track you do pick I do think it is going to affect how exactly it sounds now I do think that this does have some application maybe for background music maybe you don't want to buy music and maybe you just want to have this in the background of a Lobby a hotel lobby one of your video games maybe you're starting a small project stuff like this you generally wouldn't be able to tell because it's in the background and it's a supporting track however if we do come to the point where people are actually trying to create entire music tracks where it's going to be the main theme that might be a different story now another thing that I also did want to compare because it was truly impressive in Google's demo was the rich captions so rich captions are essentially these long captions with a lot of descriptions and you can see

Segment 3 (10:00 - 14:00)

that the first one here was the main soundtrack of an arcade game it is fast-paced and upbeat with catchy electric guitar riff the music is repetitive and easy to remember but with unexpected sounds like symbol crashes or drum rolls so this one sounded super realistic I do think that being able to generate music from this kind of description is really good so I'm gonna go ahead and test this in metas okay so if we look at the arcade soundtrack I think that music gen for some reason didn't get this right and honestly it sounds pretty weird so I did actually generate two different soundtracks and one I used the same text that Google used and then on the other one I actually used a normal soundtrack and I even conditioned it so the results here are interesting maybe this is just a personal experience but you're going to see what they both sound like and this is what the condition Melody sounds like which is a normal arcade soundtrack and then I called it upbeat arcade soundtrack for a video game and then this is what it sounds like definitely not that I'm not sure why it got it wrong in this instance but I did try again with this one and you're going to hear exactly what it sounds like and yeah I don't want to play the entirety of that because it definitely sounds really it actually does actually for the most part sound AI generated but um I think that maybe if you do try and regenerate these they will essentially have a much better output but for now it would seem that from my guess that in certain categories it excels whereas in other categories it just doesn't know what it is for example in relaxing jazz music it can do very well whereas in upbeat arcade music it just doesn't do well at all and just for reference if you think that the text was wrong or something this is Google's one when we use Google's musical. ly and then this is what track two sounds like I mean arguably they're both not that great I'm guessing there isn't enough of that data in the sample set I'm guessing what they trained this model on only had a certain number of bass soundtracks which they used I do know that it was trained on a data set of over 20 000 hours of Music which includes an internal data set of around 10 000 high quality music tracks as well as shutter stock and 0. 5 music data but I'm thinking that maybe some of these may have either been categorized wrong or perhaps not correctly sampled so either way I do think that this is interesting and for those of you who think that yes right now this isn't plausible to use as a complete and working product I think for background music some of the soundtracks generated by these AI tools could 100 be used without people knowing but I do think that we are a long way away from the complete soundtracks now it's also important to know that AI development can move pretty rapid heard Lisa whilst this might seem like that now in June by the end of the year we could have something that is 100 fully functional and works much more efficiently than these tools now one thing that I did forget to add from this video was that music gen is actually open source which means that people are going to be able to use this fine tune it and essentially work with this program and make it much better if they want to and that's definitely something that we have seen so I do think like we said before these advancements are going to continue to go pretty quickly especially when we have open source projects and there are many different talented individuals out there that can make these models a lot more consistent and additionally for those of you who do think that this is just something that meta are working on there are other companies that also actually have ai music generation for example if you do some browsing you'll come across this website called sound raw which says stop searching for The Sounds you need create it royalty free music AI generated for you and you can simply generate some music tracks we also have another website right here called boomi Unleash Your creativity make music with a boomi EI create original songs in seconds even if you'd never made music before submit your songs to the streaming platforms and get paid when people listen then we also have this website here called soundfall and it's the best license for creators and AI music generator where you can literally just generate background music like we talked about in the video before so I do think that once again this is going to be moving very rapidly and with open source projects you do see a lot of solo developers and a lot of smaller teams working on that and they usually come out with many different things such as Auto GPT which was a fine-tuned version of a large language model so it definitely will be interesting to see what comes of this and it's definitely exciting for the future

Другие видео автора — TheAIGRID

Ctrl+V

Экстракт Знаний в Telegram

Экстракты и дистилляты из лучших YouTube-каналов — сразу после публикации.

Подписаться

Дайджест Экстрактов

Лучшие методички за неделю — каждый понедельник