It's insane that 9b model beats gpt-oss-120b in GPQA, is highly competitive on instruction following, and beats out gpt-oss-20b in maths.
It's insane that 9b model beats gpt-oss-120b in GPQA, is highly competitive on instruction following, and beats out gpt-oss-20b in maths.