Hacker News

by burningionon 12/16/2024, 8:35 PMwith 26 comments

by srushon 12/20/2024, 1:23 PM

Full blog is here: https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling...

Happy to answer any questions about these methods.

by mentalgearon 12/20/2024, 11:27 AM

Happy to see "inference time compute" term being used primarily nowadays - it's a much more precise and appropriate term compared to the unwieldy "test-time compute" that openai used to call it when they thought they "invented" scaling inference time.

by emmelaichon 12/20/2024, 11:28 PM

The linked "bitter lesson" paper by Rich Sutton is so good!

http://www.incompleteideas.net/IncIdeas/BitterLesson.html

by boroboro4on 12/20/2024, 8:57 PM

What's a point of such inference time compute if verifier is 8B model itself? Am I missing something?

by bilsbieon 12/20/2024, 1:39 PM

Eli5?

Open source inference time compute example from HuggingFace