- GeneAgent: AI agent for gene set analysis, Nature Methods, 2025
- Matching Patients to Clinical Trials with LLMs, Nat Comm, 2024
- Tracking genetic variants in biomedical literature using LitVar 2.0, Nat Genet, 2023
- LitCovid: Keep up with the latest coronavirus research, Nature 2020
- How user intelligence is improving PubMed, Nature Biotechnology 2018
- Tools developed:
- TrialGPT: LLM for patient-to-trial matching
- MedCPT: foundation models for embedding bio-texts
- DeepSeeNet: automated AMD diagnosis & prognosis
- PubTator: Automated concept annotation for full-text articles
- TeamTat: a collaborative text/corpus annotation tool
- LitSuggest: a system for literature recommendation and curation
- LitSense: Making sense of biomedical literature at sentence level
- LitVar: a semantic literature search engine for genomic variants
- Organized events:
- Text Mining COSI at ISMB 2025
- Workshop on Generative AI and LLMs at PSB 2025
- Editor of 2024 JAMIA special issue on LLMs in Biomedicine
- Released ChestX-ray14 dataset and received 2017 NIH Clinical Center Director's Award
Research Experience
- Senior Investigator, National Institutes of Health (NIH)/National Library of Medicine (NLM)
- Deputy Director for Literature Search, National Center for Biotechnology Information (NCBI)
- Adjunct Professor of Computer Science, University of Illinois Urbana-Champaign (UIUC)
Education
PhD, specific school and advisor information not provided.
Background
Research interests include natural language processing, biomedical informatics, etc. Currently serving as a Senior Investigator at the National Institutes of Health (NIH)/National Library of Medicine (NLM), and as an Adjunct Professor of Computer Science at the University of Illinois Urbana-Champaign (UIUC).
Miscellany
Interested in recruiting postdocs and students. Please email for more information.