A Visually Impaired Assistance Benchmark for VLM-as-a-Judge Evaluation

ArXi:2605.31351v1 Announce Type: new AI-based Visually Impaired Assistance (VIA) remains challenging, largely due to the high cost of human evaluation. The VLM-as-a-Judge paradigm may offer a promising alternative, although it has mostly been studied in general domains. We therefore ask whether such judges can be trusted for VIA tasks. To investigate this question, we