LLMs' "simulated reasoning" abilities are a "brittle mirage," researchers find

by merksittichon 8/11/2025, 9:45 PMwith 2 comments

by merksittichon 8/11/2025, 9:47 PM

Preprint discussed in the article:

https://arxiv.org/pdf/2508.01191

by verdvermon 8/11/2025, 11:48 PM

fyi, this is about Chain-of-thought, not <think>, is that still being used?

haven't read the paper closely enough to comment on the methods