Intermediate

Machine Learning from Scratch: The Systems View (Advanced)

A 6-week intensive program designed to transform ML practitioners into ML Systems Engineers by building a complete mini-PyTorch framework from first principles. Learners will implement automatic differentiation, neural network layers, optimizers, and GPU acceleration basics while understanding the computational and numerical considerations that drive production ML systems. By coupling each implementation with performance optimization and framework comparisons, students develop the deep systems knowledge required for mid-level engineering roles where architectural decisions and production trade-offs are daily responsibilities.

Firas BayramMachine Learning

2 students enrolled

Modules — Structured path

Interactive AI lessons

Flexible

Self-paced

Intermediate

Recommended experience

What You'll Learn

Implement a complete automatic differentiation engine supporting both scalar and tensor operations with dynamic computational graph construction

Build neural network primitives (linear layers, activations, batch normalization) from scratch with proper initialization strategies to prevent vanishing/exploding gradients

Code and analyze gradient-based optimization algorithms (SGD, Momentum, Adam) understanding their convergence properties and hyperparameter sensitivities

Write and profile basic CUDA kernels to understand GPU memory hierarchies and parallelism trade-offs in ML systems

Assemble a trainable mini-PyTorch framework and train a convolutional neural network achieving >70% accuracy on CIFAR-10

Benchmark custom implementations against PyTorch to understand the engineering trade-offs in production ML systems

Skills You'll Gain

124 skills

Computational graph constructionChain rule applicationBackpropagation intuitionPython operator overloadingDynamic computational graph constructionAutograd preparation for reverse-mode differentiationTopological sorting on DAGsReverse-mode gradient propagationNumerical validation of derivatives

Program Structure

The Chain Rule as a Computational Graph

Implementing the Value Class with Forward Ops

Reverse-Mode Differentiation: Topological Sort and Backward Pass

Extending to Non-Linearities: Tanh and ReLU

Debugging Gradient Flow with Visualization

Building a Scalar Neural Network

Comparing to PyTorch's Autograd: Design Choices

Meet Your Expert

Firas Bayram

Machine Learning

Artificial Intelligence researcher and university instructor specializing in adaptive learning systems, large language models, and generative AI. Passionate about bridging cutting-edge AI research with real-world applications and helping leaders adopt modern practical, data-driven AI solutions.

Machine Learning from Scratch: The Systems View (Advanced)

What You'll Learn

Skills You'll Gain

Program Structure

Computational Graphs and Scalar Autograd

Tensor Operations and Vectorized Autograd

Neural Network Layers and Numerical Stability

Optimization Algorithms from First Principles

GPU Acceleration Fundamentals

Building and Benchmarking Mini-PyTorch

Meet Your Expert

Firas Bayram

Ready to start learning?