Paper ID: 2410.19370

Notes on the Mathematical Structure of GPT LLM Architectures

Spencer Becker-Kahn

An exposition of the mathematics underpinning the neural network architecture of a GPT-3-style LLM.

Submitted: Oct 25, 2024