AI RESEARCH
K-FinHallu: A Hallucination Detection Benchmark for Multi-Turn RAG in Korean Finance
arXiv CS.LG
•
ArXi:2605.29523v1 Announce Type: new Large Language Models (LLMs) have advanced financial automation through Retrieval-Augmented Generation (RAG), yet hallucinations remain a critical barrier to deployment in high-stakes environments. Existing benchmarks focus on single-turn, English-centric tasks, leaving the multi-turn dynamics and linguistic-regulatory nuances of the Korean financial domain unaddressed. We