Evolution of Autoencoders - Explained!
Machine-readable: Markdown · JSON API · Site index
Описание видео
In this video, we take a look at a core component of DALL-E text-to-image generation: discrete autoencoders. What is it? Why do we have it? How does it look? We specifically looks at vanilla Autoencoders, Variational Auto-encoders and VQ-VAEs.
ABOUT ME
⭕ Subscribe: https://www.youtube.com/c/CodeEmporium?sub_confirmation=1
📚 Medium Blog: https://medium.com/@dataemporium
💻 Github: https://github.com/ajhalthor
👔 LinkedIn: https://www.linkedin.com/in/ajay-halthor-477974bb/
RESOURCES
[1 📚] Slides: https://link.excalidraw.com/p/readonly/GElfVd51jvXhYuQ6yjns
[2 📚] Paper that suggests Autoencoders address “back propagation without a teacher”: https://proceedings.mlr.press/v27/baldi12a/baldi12a.pdf
[3 📚] Early paper on auto encoders that compared performance vs PCA (2006): https://www.cs.toronto.edu/~hinton/absps/science.pdf
[4 📚] 2013 VAE paper: https://arxiv.org/abs/1312.6114?utm_source=chatgpt.com
[5 📚] 2017 VQ-VAE paper: https://papers.nips.cc/paper_files/paper/2017/file/7a98af17e63a0ac09ce2e96d03992fbc-Paper.pdf
[6 📚] 2017 paper on discrete variational auto encoders: https://arxiv.org/pdf/1609.02200
[7 📚] A digestible, yet formal introduction to discrete VAE: https://arxiv.org/pdf/2505.10344
[8 📚] Paper that shows how to recover from posterior collapse: https://openreview.net/pdf/729562a11b8fe6b0af7244d73dea98ec6c5f8376.pdf?utm_source=chatgpt.com
PLAYLISTS FROM MY CHANNEL
⭕ Reinforcement Learning: https://youtube.com/playlist?list=PLTl9hO2Oobd9kS--NgVz0EPNyEmygV1Ha&si=AuThDZJwG19cgTA8
Natural Language Processing: https://youtube.com/playlist?list=PLTl9hO2Oobd_bzXUpzKMKA3liq2kj6LfE&si=LsVy8RDPu8jeO-cc
⭕ Transformers from Scratch: https://youtube.com/playlist?list=PLTl9hO2Oobd_bzXUpzKMKA3liq2kj6LfE
⭕ ChatGPT Playlist: https://youtube.com/playlist?list=PLTl9hO2Oobd9coYT6XsTraTBo4pL1j4HJ
⭕ Convolutional Neural Networks: https://youtube.com/playlist?list=PLTl9hO2Oobd9U0XHz62Lw6EgIMkQpfz74
⭕ The Math You Should Know : https://youtube.com/playlist?list=PLTl9hO2Oobd-_5sGLnbgE8Poer1Xjzz4h
⭕ Probability Theory for Machine Learning: https://youtube.com/playlist?list=PLTl9hO2Oobd9bPcq0fj91Jgk_-h1H_W3V
⭕ Coding Machine Learning: https://youtube.com/playlist?list=PLTl9hO2Oobd82vcsOnvCNzxrZOlrz3RiD
MATH COURSES (7 day free trial)
📕 Mathematics for Machine Learning: https://imp.i384100.net/MathML
📕 Calculus: https://imp.i384100.net/Calculus
📕 Statistics for Data Science: https://imp.i384100.net/AdvancedStatistics
📕 Bayesian Statistics: https://imp.i384100.net/BayesianStatistics
📕 Linear Algebra: https://imp.i384100.net/LinearAlgebra
📕 Probability: https://imp.i384100.net/Probability
OTHER RELATED COURSES (7 day free trial)
📕 ⭐ Deep Learning Specialization: https://imp.i384100.net/Deep-Learning
📕 Python for Everybody: https://imp.i384100.net/python
📕 MLOps Course: https://imp.i384100.net/MLOps
📕 Natural Language Processing (NLP): https://imp.i384100.net/NLP
📕 Machine Learning in Production: https://imp.i384100.net/MLProduction
📕 Data Science Specialization: https://imp.i384100.net/DataScience
📕 Tensorflow: https://imp.i384100.net/Tensorflow
CHAPTERS
00:00 What is DALL-E?
01:08 Autoencoders
02:59 What are Variational Autoencoders?
03:22 How VAE is better suited for generation than AE.
05:18 VAE structure and forward pass
07:09 Reparameterization trick
12:09 VAE loss function
13:49 VAE inference
14:44 What is VQ-VAE, forward pass, loss
18:06 Straight through estimator
20:08 Posterior Collapse
23:52 Discrete representations
25:00 Compatibility with sequence models (and DALL-E)
25:56 Quiz Time
26:52 Summary