Video details loaded
HomeMIT 6.7960 Deep Learning, Fall 2024Lec 08. Architectures: Transformers

Lec 08. Architectures: Transformers

1:14:35

Up Next

Lec 09. Hacker's Guide to Deep Learning

Continue

This video introduces transformers, focusing on three key ideas: tokens, attention, and positional codes. It also explores how transformers relate to MLPs, GNNs, and CNNs as variations on common principles.