GigaChat Family: Efficient Russian Language Modeling Through Mixture of Experts Architecture

📅 2025-06-11
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the lag in developing large language models (LLMs) for Russian—largely due to prohibitive computational costs—this work introduces GigaChat, the first open-source family of Mixture-of-Experts (MoE) LLMs specifically designed for Russian. GigaChat leverages Russian-centric pretraining, supervised instruction fine-tuning, and comprehensive multi-stage evaluation on Russian benchmarks (e.g., RuEval, RACE-Ru, XGLUE), achieving high performance while substantially reducing training and inference overhead. The family comprises multiple base model sizes and corresponding instruction-tuned variants, consistently outperforming multilingual baselines such as mT5 and BLOOM on Russian understanding and generation tasks. To foster adoption, three models are publicly released on Hugging Face, accompanied by production-ready interfaces—including a REST API, Telegram bot, and web application—enabling scalable industrial deployment.

Technology Category

Application Category

📝 Abstract
Generative large language models (LLMs) have become crucial for modern NLP research and applications across various languages. However, the development of foundational models specifically tailored to the Russian language has been limited, primarily due to the significant computational resources required. This paper introduces the GigaChat family of Russian LLMs, available in various sizes, including base models and instruction-tuned versions. We provide a detailed report on the model architecture, pre-training process, and experiments to guide design choices. In addition, we evaluate their performance on Russian and English benchmarks and compare GigaChat with multilingual analogs. The paper presents a system demonstration of the top-performing models accessible via an API, a Telegram bot, and a Web interface. Furthermore, we have released three open GigaChat models in open-source (https://huggingface.co/ai-sage), aiming to expand NLP research opportunities and support the development of industrial solutions for the Russian language.
Problem

Research questions and friction points this paper is trying to address.

Develop efficient Russian language models with limited resources
Compare GigaChat performance with multilingual models
Expand NLP research opportunities for Russian language
Innovation

Methods, ideas, or system contributions that make the work stand out.

Mixture of Experts for Russian language efficiency
Open-source GigaChat models on Hugging Face
API, Telegram bot, and Web interface integration
🔎 Similar Papers
No similar papers found.
G
GigaChat team Mamedov Valentin
SaluteDevices / Moscow
E
Evgenii Kosarev
SaluteDevices / Moscow
G
Gregory Leleytner
SaluteDevices / Moscow
I
Ilya Shchuckin
SaluteDevices / Moscow
Valeriy Berezovskiy
Valeriy Berezovskiy
HSE University
deep learningmachine learningnatural language processingcomputer vision
D
Daniil Smirnov
SaluteDevices / Moscow
D
Dmitry Kozlov
SaluteDevices / Moscow
S
Sergei Averkiev
SaluteDevices / Moscow
L
Lukyanenko Ivan
SaluteDevices / Moscow
A
Aleksandr Proshunin
SaluteDevices / Moscow
A
Ainur Israfilova
SaluteDevices / Moscow
I
Ivan Baskov
SaluteDevices / Moscow
Artem Chervyakov
Artem Chervyakov
SberAI
Искусственный интеллект
E
Emil Shakirov
SaluteDevices / Moscow
M
Mikhail Kolesov
SaluteDevices / Moscow
D
Daria Khomich
SaluteDevices / Moscow
D
Darya Latortseva
SaluteDevices / Moscow
S
Sergei Porkhun
SaluteDevices / Moscow
Y
Yury Fedorov
SaluteDevices / Moscow
O
Oleg Kutuzov
SaluteDevices / Moscow
P
Polina Kudriavtseva
SaluteDevices / Moscow
S
Sofiia Soldatova
SaluteDevices / Moscow
K
Kolodin Egor
SaluteDevices / Moscow
S
Stanislav Pyatkin
SaluteDevices / Moscow
D
Dzmitry Menshykh
SaluteDevices / Moscow
G
Grafov Sergei
SaluteDevices / Moscow
E
Eldar Damirov
SaluteDevices / Moscow
K
Karlov Vladimir
SaluteDevices / Moscow
R
Ruslan Gaitukiev
SaluteDevices / Moscow
A
Arkadiy Shatenov
SaluteDevices / Moscow
A
Alena Fenogenova
SaluteDevices / Moscow
N
Nikita Savushkin
SaluteDevices / Moscow
F
Fedor Minkin
SaluteDevices / Moscow