AI RESEARCH

Semantic-Aware Interpretable Multimodal Music Auto-Tagging

arXiv CS.LG

ArXi:2505.17233v3 Announce Type: replace Music auto-tagging is essential for organizing and discovering music in extensive digital libraries. While foundation models achieve exceptional performance in this domain, their outputs often lack interpretability, limiting trust and usability for researchers and end-users alike. In this work, we present an interpretable framework for music auto-tagging that leverages groups of musically meaningful multimodal features, derived from signal processing, deep learning, ontology engineering, and natural language processing.