Notebooks — Overview
(1) Activation Functions |
(2) Artificial Neural Networks (Basic Architecture) |
(3) Attention & Multi-Head Attention |
(4) Backpropagation |
(5) Backpropagation (Generalization) |
(6) Backpropagation Through Time (BPTT) |
(7) Bias & Variance (Machine Learning) |
(8) Bias-Variance Decomposition |
(9) Building a GPT-Style LLM from Scratch |
(10) Building a Word Tokenizer from Scratch |
(11) Byte-Pair Encoding Tokenization |
(12) Curse of Dimensionality |
(13) Data Batching for Training LLMs |
(14) Data Normalization — Motivation & Overview |
(15) Data Preparation for Training LLMs — An Overview |
(16) Decision Trees |
(17) Decision Trees — CART (Classification and Regression Trees) |
(18) Decision Trees — Implementation from Scratch |
(19) Dropout |
(20) Gradient Descent with Momentum |
(21) Gradient Descent — The (Very) Basics |
(22) Handwritten Digit Recognition with Artificial Neural Networks (ANNs) |
(23) Implementing an ANN from Scratch (NumPy only) |
(24) Language Models |
(25) Layer Normalization |
(26) Linear Regression |
(27) Linear Regression — Assumptions & Caveats |
(28) LoRA Fine-Tuning — A Basic Example |
(29) Logistic Regression — Basics |
(30) Logistic Regression: The Math |
(31) Logit Distillation |
(32) MLP-based Language Models |
(33) Machine Translation with Transformers |
(34) Masking in Sequence Models |
(35) Mixture of Experts (MoE) |
(36) Model Fine-Tuning for LLMs — An Overview |
(37) Multinomial Naive Bayes (Basics) |
(38) NumPy — Basic Tutorial |
(39) Part-of-Speech (POS) Tagging (Basics) |
(40) Porter Stemmer |
(41) Positional Encodings — Overview |
(42) Principal Component Analysis (PCA) |
(43) RNN-based Language Models |
(44) Random Forests |
(45) Recurrent Neural Networks — An Introduction |
(46) Residual Connections |
(47) Resource-Efficient LLMs — An Overview |
(48) Retrieval-Augmented Generation (RAG) — A (Very) Basic Example |
(49) Retrieval-Augmented Generation (RAG) — Basics |
(50) Rotary Position Embeddings (RoPE) |
(51) Sinusoidal Positional Encodings (Original Transformer) |
(52) Stemming & Lemmatization |
(53) Subword Tokenization (WordPiece) |
(54) Text Classification with Recurrent Neural Networks (RNNs) |
(55) Text Normalization |
(56) Text Tokenization |
(57) The AdaGrad Optimizer |
(58) The Adam Optimizer |
(59) The Linear Layer |
(60) The Math Behind Linear Regression |
(61) The RMSProp Optimizer |
(62) The Softmax Function |
(63) Token Indexing with Vocabularies |
(64) Training Word2Vec from Scratch |
(65) Transformers — Basic Architecture |
(66) Using Pretrained LLMs Locally — A Starter Guide |
(67) Vector Space Model |
(68) Word & Text Embeddings — An Overview |
(69) Working with Batches for Sequence Tasks |
(70) Working with the OpenAI API — An Introduction |
(this list of notebooks is auto-generated)