Reflexion splits self-correction in two: an Evaluator that detects success/failure, and a Self-Reflection model that dia...

Reflexion splits self-correction in two: an Evaluator that detects success/failure, and a Self-Reflection model that diagnoses what went wrong. The Evaluator's external signal — heuristic, exact-match, or test execution — gates whether diagnosis fires. When that signal misfires, as on MBPP Python's high false-negative rate, Self-Reflection rewrites correct code wrong, exactly the failure mode Cannot-Self-Correct documented.https://benjaminhan.net/posts/20260516-reflexion/?utm_source=mastodon&utm_medium=social#LLMs #AI #Reasoning #Agents #Metacognition

Read Original

Related