AI RESEARCH
Diagnosis Is Not Prescription: Linguistic Co-Adaptation Explains Patching Hazards in LLM Pipelines
arXiv CS.CL
•
ArXi:2605.21958v1 Announce Type: new When a multi-module LLM agent fails, the module most responsible for the failure is not necessarily the best place to intervene. We nstrate this Diagnostic Paradox empirically: causal analysis consistently identifies the routing module -- which selects which tool to call next -- as the primary bottleneck across three independent agent families. Yet injecting prompt-level correction examples into this module consistently degrades performance, sometimes severely. Patching an upstream query-rewriting module instead reliably improves outcomes.