Lec 07. Scaling Rules for Optimization

Name: Lec 07. Scaling Rules for Optimization
Uploaded: 2026-02-11T14:39:31Z
Duration: 1 h 20 min 56 s
Description: This video explores neural computation from a spectral perspective, discusses feature learning and hyperparameter transfer, and presents scaling rules for transferring hyperparameters across network width and depth.

1:20:56

Up Next

Lec 08. Architectures: Transformers

Continue

This video explores neural computation from a spectral perspective, discusses feature learning and hyperparameter transfer, and presents scaling rules for transferring hyperparameters across network width and depth.