>"Reasoning", however, is a feature that has been bolted on with a hacksaw and d...

direwolf20 · 2026-01-28T18:30:22 1769625022

it's meant in the literal sense but with metaphorical hacksaws and duct tape.

Early on, some advanced LLM users noticed they could get better results by forcing insertion of a word like "Wait," or "Hang on," or "Actually," and then running the model for a few more paragraphs. This would increase the chance of a model noticing a mistake it made.

Reasoning is basically this.

charcircuit · 2026-01-28T18:57:11 1769626631

It's not just force inserting a word. Reasoning is integrated into the training process of the model.

samrus · 2026-01-29T19:40:53 1769715653

Not the core foundation model. The foundation model still only predicts the next token in a static way. The reasoning is tacked onto the instructGPT style finetuning step and its done through prompt engineering. Which is the shittiest way a model like this could have been done, and it shows