AI RESEARCH
Best Text to Text Translation Model? [D]
r/MachineLearning
•
I'm working on a project that translates any language into English. So far, I've tried NMT models like NLLB, MADLAD, and SeamlessM4T v2. The main issue is that they struggle with proper nouns such as: - names - places - dates - organizations I also tried LLMs like Gemma 4, Qwen 3 4B, and Aya Tiny Global, but the issue still persists. The LLMs sometimes partially translate or modify entity names as well. I even tried NER masking / placeholder replacement before translation, but multilingual NER itself becomes a bottleneck.