View on GitHubGitHub

Neural Networks: Zero to Hero

Building makemore Part 3: Activations & Gradients, BatchNorm

Course videos

01The spelled-out intro to neural networks and backpropagation: building micrograd

02The spelled-out intro to language modeling: building makemore

03Building makemore Part 2: MLP

04Building makemore Part 3: Activations & Gradients, BatchNorm

05Building makemore Part 4: Becoming a Backprop Ninja

06Building makemore Part 5: Building a WaveNet

07Let's build GPT: from scratch, in code, spelled out.

08State of GPT | BRK216HFS

09Let's build the GPT Tokenizer

10Let's reproduce GPT-2 (124M)

Loading player

Transcript

6088 segments