AI RESEARCH

Bridging the Gap: Transfer Learning from English PLMs to Malaysian English

arXiv CS.CL

ArXi:2407.01374v2 Announce Type: replace Malaysian English is a low resource creole language, where it carries the elements of Malay, Chinese, and Tamil languages, in addition to Standard English. Named Entity Recognition (NER) models underperform when capturing entities from Malaysian English text due to its distinctive morphosyntactic adaptations, semantic features and code-switching (mixing English and Malay). Considering these gaps, we