DeepSeek v3 beats Claude sonnet 3.5 and way cheaper

by helloericsfon 12/26/2024, 11:47 AMwith 9 comments

by patrickhogan1on 12/26/2024, 2:17 PM

It does not beat Claude Sonnet 3.5 on SWE Bench (42 to Claude's 50). It chooses 4 benchmarks of the 100s of available benchmarks and then decides it "beats" Claude Sonnet 3.5.

by Jet_Xuon 12/30/2024, 7:40 AM

Please refer to my recent AI Code review performance test include DeepSeek V3: https://news.ycombinator.com/item?id=42547196

by sam_goodyon 12/26/2024, 5:17 PM

What are the minimum and recommended amounts of RAM, hard disk space, CPU or GPU to run this locally.

As someone who just follows this stuff from afar, it is hard for me to conceptualize if this is a SaaS only model, or if it means we are getting to the point where you can have a A1 model on a local machine.