Hy-MT2: A Family of Fast, Efficient and Powerful Multilingual Translation Models in the Wild

📅 2026-05-21
📈 Citations: 0
Influential: 0
📄 PDF

career value

228K/year
🤖 AI Summary
This work addresses the challenge of balancing efficiency, performance, and deployment cost in multilingual translation models for real-world complex scenarios by introducing a family of models supporting 33 languages across three scales: 1.8B, 7B, and 30B-A3B (Mixture-of-Experts). Leveraging a Mixture-of-Experts architecture, AngelSlim 1.25-bit extreme quantization, and multilingual instruction fine-tuning, the study achieves a high-performance lightweight model (1.8B at only 440 MB). Experimental results demonstrate that the 1.8B model outperforms commercial APIs from Microsoft and Doubao while achieving 1.5× faster inference; the 7B and 30B models surpass DeepSeek-V4-Pro and Kimi K2.6 in fast-thinking mode, establishing state-of-the-art performance across general, commercial, domain-specific, and instruction-following tasks compared to leading open-source and commercial systems.
📝 Abstract
Hy-MT2 is a family of fast-thinking multilingual translation models designed for complex real-world scenarios. It includes three model sizes: 1.8B, 7B, and 30B-A3B (MoE), all of which support translation among 33 languages and effectively follow translation instructions in multiple languages. For on-device deployment, with AngelSlim 1.25-bit extreme quantization, the 1.8B model requires only 440 MB of storage and improves inference speed by 1.5x. Multi-dimensional evaluations show that Hy-MT2 delivers outstanding performance across general, real-world business, domain-specific, and instruction-following translation tasks. The 7B and 30B models outperform open-source models such as DeepSeek-V4-Pro and Kimi K2.6 in fast-thinking mode, while the lightweight 1.8B model also surpasses mainstream commercial APIs from providers such as Microsoft and Doubao overall.
Problem

Research questions and friction points this paper is trying to address.

multilingual translation
real-world scenarios
instruction-following
on-device deployment
translation efficiency
Innovation

Methods, ideas, or system contributions that make the work stand out.

multilingual translation
extreme quantization
mixture-of-experts (MoE)
instruction-following
on-device deployment
🔎 Similar Papers
No similar papers found.
M
Mao Zheng
Z
Zheng Li
T
Tao Chen
B
Bo Lv
M
Mingrui Sun
M
Mingyang Song
J
Jinlong Song
H
Hong Huang
D
Decheng Wu
H
Hai Wang
Y
Yifan Song
Y
Yanfeng Chen
G
Guanwei Zhang
G
Guanghua Yu
Y
Yi Su
H
Hong Liu
J
Jinxiang Ou
Keyao Wang
Keyao Wang
Baidu Inc.
deep learningface-anti spoofingcomputer vision
W
Weile Chen
H
Haozhao Kuang
K
Kai Wang
N
Nuo Chen
Z
Zihao Zheng
Chenhao Wang
Chenhao Wang
Tencent
Natural Language ProcessingLarge Language Models
B
Bin Xing