Skip to content

Instantly share code, notes, and snippets.

View Physicworld's full-sized avatar

SundayTheQuant Physicworld

  • University of Guanajuato
  • Guanajuato, México
  • 21:13 (UTC -06:00)
View GitHub Profile
@Physicworld
Physicworld / microgpt.py
Created February 11, 2026 23:25 — forked from karpathy/microgpt.py
microgpt
"""
The most atomic way to train and inference a GPT LLM in pure, dependency-free Python.
Differences from GPT-2 are minor: layer norm -> rmsnorm, no biases, GeLU -> square ReLU, no weight tying.
The contents of this file is everything algorithmically needed to train a GPT. Everything else is just efficiency.
Art project by @karpathy.
"""
import os # for os.path.exists
import time # for time.perf_counter
import math # for math.log, math.exp