Scholar
Soufiane Hayou
Google Scholar ID: JBb5zekAAAAJ
Assistant Professor, Johns Hopkins
AI
Deep Learning
Hyperparameters
Scaling
Stochastic processes
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
1,218
H-index
15
i10-index
19
Publications
20
Co-authors
15
list available
Contact
No contact links provided.
Publications
7 items
$\mu$pscaling small models: Principled warm starts and hyperparameter transfer
2026
Cited
0
Learning Rate Scaling across LoRA Ranks and Transfer to Full Finetuning
2026
Cited
0
A Proof of Learning Rate Transfer under $μ$P
2025
Cited
0
PLoP: Precise LoRA Placement for Efficient Finetuning of Large Models
2025
Cited
0
Optimal Embedding Learning Rate in LLMs: The Effect of Vocabulary Size
2025
Cited
0
On the Stability of the Jacobian Matrix in Deep Neural Networks
2025
Cited
0
Visualising Feature Learning in Deep Neural Networks by Diagonalizing the Forward Feature Map
arXiv.org · 2024
Cited
1
Resume (English only)
Co-authors
15 total
Arnaud Doucet
Google DeepMind
Co-author 2
Bin YU
Professor of Statistics and EECS, UC Berkeley
Co-author 4
Co-author 5
Jean-Francois Ton
ByteDance Seed
Yee Whye Teh
Professor of Statistical Machine Learning, Oxford, Research Scientist, DeepMind
Chris Mingard
DPhil student, University of Oxford
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up