Accuracy is very rarely a useful metric. It's more an engineering metric than so...

DougBTX · on Sept 6, 2023

This is mixing two meanings of confidence which could lead to confusion. The OP is using confidence to describe how high the per-token probability scores are, while you are talking about the confidence expressed in the tone of voice of the language generated by the model. Really those are orthogonal issues. (Eg, a model could predict with high probability that a output should be “I don’t know”)

mjburgess · on Sept 6, 2023

It seems like i'm mixing them, but i'm not.

I'm saying as a matter of fact ChatGPT should have different confidences in propositions. My issue isnt the tone of voice, my issue is the content of what it's saying is wrong wrt what we care about, ie., expert credences (/confidences) in the claims it's generating.

It can "express confidently" scepticism; it does not. That's the issue.

In my lang above i was mostly using credence to talk about the strength of the mental state of belief; and confidence to talk about the model of that used in statistical AI.