Scholar
Yihe Dong
Google Scholar ID: AjX6hisAAAAJ
Princeton University
Geometric deep learning
large language models.
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
1,453
H-index
12
i10-index
14
Publications
20
Co-authors
8
list available
Contact
No contact links provided.
Publications
3 items
Cache Me If You Can: How Many KVs Do You Need for Effective Long-Context LMs?
2025
Cited
0
Attention Retrieves, MLP Memorizes: Disentangling Trainable Components in the Transformer
2025
Cited
0
Metadata Conditioning Accelerates Language Model Pre-training
2025
Cited
0
Resume (English only)
Co-authors
8 total
Co-author 1
Piotr Indyk
Professor of Electrical Engineering and Computer Science, MIT
Co-author 3
Yoshua Bengio
Professor of computer science, University of Montreal, Mila, IVADO, CIFAR
Samuel B. Hopkins
Massachusetts Institute of Technology
Jerry Li
University of Washington
Hao Chen
Facebook
Co-author 8
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up