Heavily optimized single C file that can train the same model as Karpathy's microgpt to lower loss in under a second on a single Mac core.
Heavily optimized single C file that can train the same model as Karpathy's microgpt to lower loss in under a second on a single Mac core.