Neural Audio Compression | What is Residual Vector Quantization?
Machine-readable: Markdown · JSON API · Site index
Описание видео
AI based methods for learnable codecs are revolutionizing how we store and transmit audio and video data, and lie at the heart of cutting-edge AI models like Google's SoundStream and Meta's EnCodec. Learn how RVQ and neural compression work in this video explainer.
References:
- Blog post on latency at AssemblyAI https://www.assemblyai.com/blog/lower-latency-new-pricing/
- Tutorial on Text-to-Video apps in Python https://youtu.be/Tlxe3l_m3PA
- Google’s Soundstream https://research.google/blog/soundstream-an-end-to-end-neural-audio-codec/
- Meta’s Encodec paper https://arxiv.org/abs/2210.13438
- Paper on perceptual metrics https://arxiv.org/pdf/1801.03924
Video sections:
00:00 Why learnable codecs?
01:50 Neural compression
03:17 RVQ
04:27 Final thoughts
▬▬▬▬▬▬▬▬▬▬▬▬ CONNECT ▬▬▬▬▬▬▬▬▬▬▬▬
🖥️ Website: https://www.assemblyai.com
🐦 Twitter: https://twitter.com/AssemblyAI
🦾 Discord: https://discord.gg/Cd8MyVJAXd
▶️ Subscribe: https://www.youtube.com/c/AssemblyAI?sub_confirmation=1
🔥 We're hiring! Check our open roles: https://www.assemblyai.com/careers
🔑 Get your AssemblyAI API key here: https://www.assemblyai.com/?utm_source=youtube&utm_medium=referral&utm_campaign=yt_marco_5
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
#MachineLearning #DeepLearning