Peregrinations - Models | Peregrinations

AI capability scales predictably with compute, parameters, and tokens. Understand the power laws and optimal boundaries that dictate frontier training.

Read chapter

Ch. 04

Birth of a Model (The Pre-training Run)

A training run is a months-long industrial process that turns data, compute, and engineering discipline into a new capability curve.

Read chapter

Ch. 05

Alignment & Direct Instruction (SFT & Post-Training)

The base model learns broad capability. Post-training decides whether that capability behaves like a useful colleague, a tool user, or a liability.

Read chapter

Ch. 06

Policy Optimization (RLHF & DPO)

Pretraining learns from examples; reinforcement learning learns from outcomes. It is the loop behind RLHF, reasoning models, and agents.

Read chapter

Ch. 07

Reasoning & Test-Time Search (Inference Compute)

Scaling compute at inference time. Moving from instant feed-forward responses (System 1) to search, verification, and self-correction (System 2).

Read chapter

Ch. 08

Serving Physics (Quantization & Precision)

Precision is the lever. A weight is a number, and numbers have a size. Count the parameters, multiply by the bytes each takes at FP16, FP8, or FP4, and read the memory bill.

Read chapter

Ch. 09

The Context Wall (FlashAttention, GQA, & SSMs)

Long context feels like memory. Under the hood it is a live working set that has to be read, routed, cached, and paid for.

Read chapter

Ch. 10

Which Model? (Workflow Trade-offs)

A model choice is a workflow choice: capability, latency, cost, context, privacy, tool use, modality, and failure tolerance all matter.

Read chapter