Home / Concepts / Explanation Correctness Thresholds

Explanation Correctness Thresholds

Auto-generated stub. Edit this file to add more details.

Empirically determined thresholds in explanation correctness (e.g., 70%) below which further degradation does not significantly impact human performance or learning of the model’s decision patterns.

Why It Matters

The finding that functional correctness degradation beyond a certain (70%) threshold does not cause further human understanding loss is a critical insight for metric design.

Evidence

performance dropped at 70% and 55% correctness relative to fully correct explanations, while further degradation below 70% produced no additional loss

openalex-2603.25251-does-explanation-correctness-matter-linking-computational-xa

Metadata & Links

source_papers: [[openalex-2603.25251-does-explanation-correctness-matter-linking-computational-xa]]
created_at: 2026-03-29T06:09:00Z

Explanation Correctness Thresholds

Why It Matters

Evidence

Related Papers

Metadata & Links