AlbanianLLMSafety: A Safety Evaluation Dataset for Large Language Models in Albanian

ArXi:2605.26954v1 Announce Type: new Safety evaluation of Large Language Models (LLMs) has largely focused on high-resource languages, leaving low-resource languages critically underserved. We present AlbanianLLMSafety, the first publicly available safety evaluation dataset for LLMs in Albanian, a linguistically distinct low-resource language with approximately 7.5M speakers across Albania, Kosovo, North Macedonia, and the diaspora.