3 big parts: Forward pass Calculate model generation Backward pass Calculate gradient Update model, minimizing loss I like this medium article, lotsa good idea