> Throughout this series, “we” refers to maderix (human) and Claude Opus 4.6 (by...

brookst · 2026-03-03T05:35:29 1772516129

You’d feel better if it was two people you don’t know? Because obviously any random person is 100% accurate, never mistaken, never making shit up?

I don’t understand the mindset, I really don’t. Why are humans held to such a lower standard?

ezst · 2026-03-03T07:21:19 1772522479

Despite all the anthropomorphizing of LLMs, you must have come across already how each has VERY DISTINCT failure modes?

brookst · 2026-03-03T14:34:03 1772548443

Actually… no. Now that you mention it, and thanks for the interesting thought, the failure modes seem pretty similar to me.

Shoddy research / hallucination, tendency to lose the thread, lack of historical / background context… the failure modes are at least qualitatively similar.

Show me an LLM failure and I’ll show you a high profile journalist busted for the same thing. And those are humans who focus on these things!

michaelmrose · 2026-03-03T06:36:32 1772519792

Humans as a class are error prone but some humans in their respective fields are very very good. It's often not terribly hard to figure out based on resume and credentials who these folks are and as a shortcut we can look for markers in terms of terminology specifics confidence if it's less important like deciding what to read vs cancer care for your mom.

AI can trip all the right searches to fool these shortcuts whilst sometimes being entirely full of shit and they have no resume nor credentials to verify should we desire to check.

If you have such and vouch for it I can consider your trustworthiness rather than its. If you admit you yourself are reliant on it then this no longer holds

Anonbrit · 2026-03-02T16:19:56 1772468396

Humans also write endless amounts of convincing bullshit, and have done since time immemorial. False papers and faked results have been a growing scourge in academia before LLMs were a thing, and that's just counting the intentional fraud - the reproducibility crisis in science, especially medical and psychological science, affects even the best designed and well intentioned of studies.

Humans also make mistakes and assumptions while reverse engineering, so it will always need more engineers to go through the results, test things

withinboredom · 2026-03-02T15:59:14 1772467154

Claude likes to hide bad benchmarks from you, so it will show you where you are clearly winning. You even see some weird benchmarks in the article.

maderix · 2026-03-03T06:58:42 1772521122

Benchmarks all in part 2. Training progress in part 3(upcoming) Also I think AI human collaboration is important for goal management. Sure LLMs bullshit all the time, but that's the role of the human to create good goals and gating criteria to what constitutes as good.

this-is-why · 2026-03-03T04:51:51 1772513511

Agreed. Now is our chance to start pushing back on this. Don’t patronize this. Just glad author admitted it. Next time they won’t tho.