• Top
  • New

MegaScale: Scaling Large Language Model Training to More Than 10k GPUs [pdf]

by yankcrimeon 11/4/2024, 7:46 PMwith 0 comments

0