Scholar
Jan Ludziejewski
Google Scholar ID: YihTUGQAAAAJ
Mistral AI
scaling laws
mixture of experts
large language models
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
226
H-index
6
i10-index
5
Publications
15
Co-authors
4
list available
Contact
No contact links provided.
Publications
6 items
Ministral 3
2026
Cited
4
$μ$-Parametrization for Mixture of Experts
2025
Cited
0
Decoupled Relative Learning Rate Schedules
2025
Cited
0
Projected Compression: Trainable Projection for Efficient Transformer Compression
2025
Cited
0
Magistral
2025
Cited
0
Joint MoE Scaling Laws: Mixture of Experts Can Be Memory Efficient
2025
Cited
0
Resume (English only)
Co-authors
4 total
Marek Cygan
University of Warsaw
Maciej Pióro
PhD Student, Polish Academy of Sciences / IDEAS NCBR
Sebastian Jaszczur
Anthropic (past: IDEAS, University of Warsaw)
Jakub Krajewski
PhD Student, University of Warsaw, IDEAS NCBR
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up