OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens

by erlend_shon 4/26/2025, 2:30 PMwith 1 comments

by erlend_shon 4/26/2025, 2:34 PM

Paper: https://arxiv.org/pdf/2504.07096 (original blog title was too long so I used the paper’s title)

HF: https://huggingface.co/allenai/OLMo-2-0325-32B-Instruct

GH: https://github.com/allenai/infinigram-api