AI RESEARCH
SCRIBE: Diagnostic Evaluation and Rich Transcription Models for Indic ASR
arXiv CS.CL
•
ArXi:2605.20712v1 Announce Type: new Automatic speech recognition replaces typing only when correction costs less than manual entry, a threshold determined by error types, not counts: fixing a misrecognized domain term costs far than inserting a comma. Word error rate (WER) fails on two fronts: it collapses distinct error categories into a single scalar, and it structurally penalizes agglutinative languages where valid sandhi merges inflate scores. We