SELENE (Quick Access)

(2) Artificial Neural Networks (Basic Architecture)

(3) Attention & Multi-Head Attention

(4) Backpropagation

(5) Backpropagation (Generalization)

(6) Backpropagation Through Time (BPTT)

(7) Bias & Variance (Machine Learning)

(8) Bias-Variance Decomposition

(9) Building a GPT-Style LLM from Scratch

(10) Building a Word Tokenizer from Scratch

(11) Byte-Pair Encoding Tokenization

(12) Curse of Dimensionality

(13) Data Batching for Training LLMs

(14) Data Normalization — Motivation & Overview

(15) Data Preparation for Training LLMs — An Overview

(16) Decision Trees

(17) Decision Trees — CART (Classification and Regression Trees)

(18) Decision Trees — Implementation from Scratch

(19) Dropout

(20) Gradient Descent with Momentum

(21) Gradient Descent — The (Very) Basics

(22) Handwritten Digit Recognition with Artificial Neural Networks (ANNs)

(23) Implementing an ANN from Scratch (NumPy only)

(24) Language Models

(25) Layer Normalization

(26) Linear Regression

(27) Linear Regression — Assumptions & Caveats

(28) LoRA Fine-Tuning — A Basic Example

(29) Logistic Regression — Basics

(30) Logistic Regression: The Math

(31) Logit Distillation

(32) MLP-based Language Models

(33) Machine Translation with Transformers

(34) Masking in Sequence Models

(35) Mixture of Experts (MoE)

(36) Model Fine-Tuning for LLMs — An Overview

(37) Multinomial Naive Bayes (Basics)

(38) NumPy — Basic Tutorial

(39) Part-of-Speech (POS) Tagging (Basics)

(40) Porter Stemmer

(41) Positional Encodings — Overview

(42) Principal Component Analysis (PCA)

(43) RNN-based Language Models

(44) Random Forests

(45) Recurrent Neural Networks — An Introduction

(46) Residual Connections

(47) Resource-Efficient LLMs — An Overview

(48) Retrieval-Augmented Generation (RAG) — A (Very) Basic Example

(49) Retrieval-Augmented Generation (RAG) — Basics

(50) Rotary Position Embeddings (RoPE)

(51) Sinusoidal Positional Encodings (Original Transformer)

(52) Stemming & Lemmatization

(53) Subword Tokenization (WordPiece)

(54) Text Classification with Recurrent Neural Networks (RNNs)

(55) Text Normalization

(56) Text Tokenization

(57) The AdaGrad Optimizer

(58) The Adam Optimizer

(59) The Linear Layer

(60) The Math Behind Linear Regression

(61) The RMSProp Optimizer

(62) The Softmax Function

(63) Token Indexing with Vocabularies

(64) Training Word2Vec from Scratch

(65) Transformers — Basic Architecture

(66) Using Pretrained LLMs Locally — A Starter Guide

(67) Vector Space Model

(68) Word & Text Embeddings — An Overview

(69) Working with Batches for Sequence Tasks

(70) Working with the OpenAI API — An Introduction