• Top
  • New

Mosaic trained a 1B parameter model on 440 GPUs for 200B tokens

by ovaistariqon 4/21/2023, 2:39 PMwith 0 comments

0