AI RESEARCH
Telenor Nordics Customer Service self-help corpus
arXiv CS.CL
•
ArXi:2605.26891v1 Announce Type: new This paper presents a multilingual customer service self-help corpus comprising 1,122 manually validated documents in Finnish, Danish, Norwegian, and Swedish, totaling over one million tokens. The documents have been sourced from the public self-help pages of four Nordic telecommunications operators and subsequently filtered for person-identifiable information and relevance through a combined LLM and human annotation pipeline.