Skip to content

Instantly share code, notes, and snippets.

View bug39's full-sized avatar
๐ŸŒŽ

bug bug39

๐ŸŒŽ
  • Berkeley, CA
  • 16:21 (UTC -07:00)
  • X @bug39
View GitHub Profile
@bug39
bug39 / microgpt.py
Created February 11, 2026 23:10 — forked from karpathy/microgpt.py
microgpt
"""
The most atomic way to train and inference a GPT LLM in pure, dependency-free Python.
Differences from GPT-2 are minor: rmsnorm instead of layer norm, no biases, square ReLU instead of GeLU nonlinearity, no weight tying.
The contents of this file is everything algorithmically needed to train a GPT. Everything else is just efficiency.
Art project by @karpathy.
"""
import os # for os.path.exists
import math # for math.log, math.exp
import random # for random.seed, random.choices