Mar 20, 2025 CUDA from Scratch - Matrix Multiplication, Memory Models, and the Road to RL Acceleration