Mastodon discussion Discussions 3d ago 1 views

Does RLHF wreck a language model's calibration? Not quite. The calibrated signal relocates rather than vanishes: a confi...

by Benjamin Han

Does RLHF wreck a language model's calibration? Not quite. The calibrated signal relocates rather than vanishes: a confidence the model states in words tracks accuracy better than its own token probabilities, often halving ECE. For instruction-tuned models, calibration becomes an elicitation problem more than a recalibration one. Just ask for the number.https://benjaminhan.net/posts/20260610-just-ask-calibration/?utm_source=mastodon&utm_medium=social#LLMs #Calibration #Metacognition #EMNLP #AI

Read Original

LLM

Metadata

Reblogs Count: 1
Account: BenjaminHan@sigmoid.social

Mastodon discussion 14m ago

🎮 Send Crimson Desert your "fun and incredible videos", and you could win an all-expenses-paid trip to South KoreaCrimso...

🎮 Send Crimson Desert your "fun and incredible videos", and you could win an all-expenses-paid trip to South KoreaCrimson Desert, the open-world action RPG from Pearl Abyss, has a ...

Mastodon discussion 19m ago

Anthropic Releases ‘Safe’ Version of Its Mythos A.I. Technology. Via @nytimes #AI #ArtificialIntelligence 💻 🧠nytimes.com...

Anthropic Releases ‘Safe’ Version of Its Mythos A.I. Technology. Via @nytimes #AI #ArtificialIntelligence 💻 🧠nytimes.com/2026/06/09/tec...

Mastodon discussion 19m ago

WWDC26: The Talk Show livestream returning to Theater for Apple Vision ProMissing out on this week’s WWDC fun in Califor...

WWDC26: The Talk Show livestream returning to Theater for Apple Vision ProMissing out on this week’s WWDC fun in California? Apple Vision Pro users can virtually join tonight’s The...

Does RLHF wreck a language model's calibration? Not quite. The calibrated signal relocates rather than vanishes: a confi...

Metadata

Related

🎮 Send Crimson Desert your "fun and incredible videos", and you could win an all-expenses-paid trip to South KoreaCrimso...

Anthropic Releases ‘Safe’ Version of Its Mythos A.I. Technology. Via @nytimes #AI #ArtificialIntelligence 💻 🧠nytimes.com...

WWDC26: The Talk Show livestream returning to Theater for Apple Vision ProMissing out on this week’s WWDC fun in Califor...