AI RESEARCH

Generating and Refining Dynamic Evaluation Rubrics for LLM-as-a-Judge

arXiv CS.CL

ArXi:2605.30568v1 Announce Type: new LLM-as-a-Judge is a scalable alternative to human evaluation, yet existing rubric-based methods rely on human-annotated data such as reference answers or expert-crafted rubrics. We propose to automatically generate fine-grained evaluation rubrics without any human annotation. Our