Does RLHF wreck a language model's calibration? Not quite. The calibrated signal relocates rather than vanishes: a confidence the model states in words tracks accuracy better than its own token probabilities, often halving ECE. For instruction-tuned models, calibration becomes an elicitation problem more than a recalibration one. Just ask for the number.https://benjaminhan.net/posts/20260610-just-ask-calibration/?utm_source=mastodon&utm_medium=social#LLMs #Calibration #Metacognition #EMNLP #AI
Related
🎮 Send Crimson Desert your "fun and incredible videos", and you could win an all-expenses-paid trip to South KoreaCrimso...
🎮 Send Crimson Desert your "fun and incredible videos", and you could win an all-expenses-paid trip to South KoreaCrimson Desert, the open-world action RPG from Pearl Abyss, has a ...
Anthropic Releases ‘Safe’ Version of Its Mythos A.I. Technology. Via @nytimes #AI #ArtificialIntelligence 💻 🧠nytimes.com...
Anthropic Releases ‘Safe’ Version of Its Mythos A.I. Technology. Via @nytimes #AI #ArtificialIntelligence 💻 🧠nytimes.com/2026/06/09/tec...
WWDC26: The Talk Show livestream returning to Theater for Apple Vision ProMissing out on this week’s WWDC fun in Califor...
WWDC26: The Talk Show livestream returning to Theater for Apple Vision ProMissing out on this week’s WWDC fun in California? Apple Vision Pro users can virtually join tonight’s The...