Tutorials·January 7, 2026·11 min readCS336 Notes: Lecture 5 - GPUsGPU fundamentals for LLM training: memory hierarchy, arithmetic intensity, kernel optimization, FlashAttention, and bandwidth limits.machine-learninggpustanford-cs336hardwareRead