Video details loadedContinue
HomeMIT 6.7960 Deep Learning, Fall 2024Lec 08. Architectures: Transformers
MIT 6.7960 Deep Learning, Fall 2024
Video 8 of 10
Lec 08. Architectures: Transformers
1:14:35
Up Next
Lec 09. Hacker's Guide to Deep Learning
This video introduces transformers, focusing on three key ideas: tokens, attention, and positional codes. It also explores how transformers relate to MLPs, GNNs, and CNNs as variations on common principles.