Published 'Assigning credit to scientific datasets using article citation networks' in Journal of Informetrics (SJR Q1, 2020)
Published 'Modeling citation worthiness by using attention-based bidirectional long short-term memory networks and interpretable models' in Scientometrics (SJR Q1, 2020)
Developed DATARANK algorithm to track and assign credit to datasets in citation networks
Proposed bi-LSTM-CRF network for dataset extraction from scientific publications
Built GotFunding, a grant recommendation system based on publication history
Designed large-scale author name disambiguation method using approximate network structures with PySpark
Discovered rapid decay of linked resources in biomedical articles—most disappear within eight years (2019)
Published multiple papers at top conferences including iConference, ASIS&T, and International Conference on Computational Social Science
Contributed book chapter 'Finding datasets in publications: The Syracuse University approach' (SAGE Publications, 2022)
Co-authored arXiv preprint 'Predicting the longevity of resources shared in scientific publications' (arXiv:2203.12800, 2022)
Co-authored under-review manuscript 'Determinants of diminishing returns on NIH-funded projects' (2024)