New AI Listened To 20,000 Hours Of Music. What Did It Learn?

7:33

New AI Listened To 20,000 Hours Of Music. What Did It Learn?

Two Minute Papers 25.09.2023 107 545 просмотров 5 100 лайков

Machine-readable: Markdown · JSON API · Site index

Смотреть на YouTube

Поделиться Telegram VK Бот

Транскрипт Скачать .md

Анализ с AI

Описание видео

❤️ Check out Weights & Biases and sign up for a free demo here: https://wandb.me/2mp ❤️ Their mentioned post is available here: https://wandb.me/audiocraft_2mp 📝 The paper "MusicGen: Simple and Controllable Music Generation" is available here: https://ai.honu.io/papers/musicgen/ https://arxiv.org/abs/2306.05284 Try it out! https://huggingface.co/spaces/facebook/MusicGen My latest paper on simulations that look almost like reality is available for free here: https://rdcu.be/cWPfD Or this is the orig. Nature Physics link with clickable citations: https://www.nature.com/articles/s41567-022-01788-5 🙏 We would like to thank our generous Patreon supporters who make Two Minute Papers possible: Aleksandr Mashrabov, Alex Balfanz, Alex Haro, Andrew Melnychuk, Benji Rabhan, Bret Brizzee, Bryan Learn, B Shang, Christian Ahlin, Gaston Ingaramo, Geronimo Moralez, Gordon Child, Jace O'Brien, Jack Lukic, John Le, Kenneth Davis, Klaus Busse, Kyle Davis, Lukas Biewald, Martin, Matthew Valle, Michael Albrecht, Michael Tedder, Nikhil Velpanur, Owen Campbell-Moore, Owen Skarpness, Rajarshi Nigam, Ramsey Elbasheer, Richard Sundvall, Steef, Taras Bobrovytsky, Ted Johnson, Thomas Krcmar, Timothy Sum Hon Mun, Torsten Reil, Tybie Fitzhugh, Ueli Gallizzi. If you wish to appear here or pick up other perks, click here: https://www.patreon.com/TwoMinutePapers Thumbnail background design: Felícia Zsolnai-Fehér - http://felicia.hu Károly Zsolnai-Fehér's links: Twitter: https://twitter.com/twominutepapers Web: https://cg.tuwien.ac.at/~zsolnai/

Оглавление (2 сегментов)

Segment 1 (00:00 - 05:00)

Yes, this new AI has listened to more than 5,000 songs, and what did it learn? Well, it learned to perform text to music similarly to Google’s MusicLM. But this technique is truly something else. This is MusicGen, an AI where we enter the kind of music we wish to hear, and it just gives it to us. Like this. Wow. Yes, all this was generated by an AI. This was a pop dance track with catchy melodies, tropical percussion, and upbeat rhythms with the new MusicGen. And now let’s compare it to Google’s MusicLM. This one is real good too, and it was published in this year’s January. However, the first one you’ve heard, MusicGen has something up the sleeve. It is an open source technique that is free for everyone. Wow. Now, smooth jazz, with a saxophone solo, piano chords, and snare drums. First with MusicGen. That was real good, with a bit of a saxophone solo. Perhaps a snippet. Now, the previous work, MusicLM. I have to say that I really liked the smooth jazz part better here, that was real good, however, no saxophone. It is one thing to do well, but it is not enough - it also has to follow the prompt as well. Now, wait a minute. These samples are 30-seconds long, but real songs are typically much longer than that. So is that it? No real songs for us? Well, don’t despair because this paper has two more absolutely incredible features. One, long music generation. Oh yes, these are two-minute long samples, but we are experienced Fellow Scholars here, so we are looking for long-term coherence. That means that we will look at snippets of different parts of the song, and listen if they really sound like they belong to the same song. Note that the previous MusicLM was not as good in this task. Here is the snippet we looked at in a previous episode. Now, let’s see if the new MusicGen is any better on this. I can’t wait - let’s do it! Wow. First one is flying colors. Not even a little bit of a drift. Second one. Maybe you noticed that I liked this one so much that I was a little reluctant to skip forward because I wanted to just keep listening. Coherence was not as tight as with the previous one, art and music is subjective, but if we have a song that has a little more variation, this is a good one too. Loving it. And third. This absolutely knocked it out of the park. Absolutely the same song for two minutes. This is super difficult to pull off, and in my opinion, the previous MusicLM was not as good as this new technique. So much improvement in just a few months! Incredible. But we haven’t looked at my absolute favorite feature yet, I am pretty sure it will be yours too. They call this feature conditioning, this essentially can take an already existing piece

Segment 2 (05:00 - 07:00)

of music and remix it with a text prompt. Listen to the input. And the output. Wow, this is insanity. However, not even this work is perfect, here is something that you might call somewhat of a failure case. Yes, the input was maybe recognizable, but it was jumbled up and not nearly as good as the other samples. But wait, here comes the best part. All this is available to all of you, free of charge. The source code of this paper is available, and as of the making of this video, you can even run it on the web, I’ve put a link to it in the video description. So, in the future, we will all have our own AIs that create a nearly infinite amount of music personalized to you in real time. And just imagine what we will be able to do just a couple more papers down the line. What a time to be alive!

Другие видео автора — Two Minute Papers

Ctrl+V

Экстракт Знаний в Telegram

Экстракты и дистилляты из лучших YouTube-каналов — сразу после публикации.

Подписаться

Лучшие методички за неделю — каждый понедельник