Scholar
Luca Soldaini
Google Scholar ID: 3KPvwcgAAAAJ
Allen Institute for AI
Large Language Models
Open Source AI
Information Retrieval
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
4,323
H-index
29
i10-index
53
Publications
20
Co-authors
8
list available
Contact
Email
luca@soldaini.net
Twitter
Open ↗
GitHub
Open ↗
LinkedIn
Open ↗
Publications
28 items
Olmo Hybrid: From Theory to Practice and Back
2026
Cited
0
Olmix: A Framework for Data Mixing Throughout LM Development
2026
Cited
0
Overview of the TREC 2025 RAGTIME Track
2026
Cited
0
How2Everything: Mining the Web for How-To Procedures to Evaluate and Improve LLMs
2026
Cited
0
NeuCLIRTech: Chinese Monolingual and Cross-Language Information Retrieval Evaluation in a Challenging Domain
2026
Cited
0
Bolmo: Byteifying the Next Generation of Language Models
2025
Cited
0
Olmo 3
2025
Cited
0
DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research
2025
Cited
0
Load more
Resume (English only)
Academic Achievements
Olmo project received two Best Paper Awards at ACL 2024
Led the release of fully open Olmo 2 models (7B, 13B, 32B)
Launched Tülu 3 post-training pipeline and Molmo multimodal models
Developed olmOCR, a high-performance toolkit for PDF text extraction
Created predictive techniques and benchmarks to characterize LLM behavior during pretraining
Research Experience
Co-leads the data team for Ai2’s Olmo project with Kyle Lo
Develops adaptation recipes for LLMs, including the Tülu 3 post-training pipeline (supporting models up to 405B parameters)
Contributed to the Molmo family of open multimodal AI models
Co-developed tools for analyzing and improving LLM pipelines: AboutMe, WIMBD, WebOrganizer, and olmOCR
Investigated LLM-retrieval system interfaces; co-proposed FollowIR with Orion Weller, later extended to multilingual settings
Collaborated on OpenSciLLM, an end-to-end demo for literature-grounded scientific synthesis using LLMs
Miscellany
Enjoys brewing espresso and going on runs
Dreams about utopian mass transit systems
Curates a growing collection of laptop stickers
Spends time with his handsome cat
Believes raccoons are the best
Co-authors
8 total
Kyle Lo
Allen Institute for AI
Arman Cohan
Yale University; Allen Institute for AI
Nazli Goharian
Georgetown University
Andrew Yates
Johns Hopkins University, Human Language Technology Center of Excellence
Hannaneh Hajishirzi
University of Washington; Allen AI
Noah A. Smith
University of Washington; Allen Institute for Artificial Intelligence
Alessandro Moschitti
Principal Scientist at Amazon Alexa
Eugene Yang
Research Scientist, Johns Hopkins University, Human Language Technology Center of Excellence
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up