Tag

Transformers

Tag

Browse Transformers

1.4 Mathematics of Large Language Models: Training, Inference, Attention, Scaling, and Alignment

We build the mathematics behind modern large language models from scratch: text as probability, tokenization, embeddings, transformers, attention, loss functions, backpropagation, …

May 1, 2026 · 5 min read

Open →

Course

1.5 Mathematics of Machine Learning and Deep Learning for Time Series

We build the mathematics of time-series learning from scratch: sequences, forecasting, stationarity, autocorrelation, ARIMA, state-space models, Kalman filtering, recurrent …

May 1, 2026 · 16 min read

Open →