Skip to content

Blog

Deep-learning

Tutorials··11 min read

CS336 Notes: Lecture 5 - GPUs

GPU fundamentals for LLM training: memory hierarchy, arithmetic intensity, kernel optimization, FlashAttention, and bandwidth limits.

Read
Tutorials··10 min read

CS336 Notes: Lecture 2 - PyTorch and Resource Accounting

Resource accounting for LLM training: compute estimates, memory budgets, dtypes, tensors, and mixed precision.

Read