Published numerous papers, including 'NCBI Disease Corpus: A Resource for Disease Name Recognition and Concept Normalization' (with Robert Leaman and Zhiyong Lu), 'Author name disambiguation for PubMed' (with Wanli Liu et al.), and more.
Research Experience
Currently a Staff Scientist at NCBI, NLM, NIH. Involved in multiple research projects such as DNorm: Disease name normalization study, NCBI Disease Corpus, Medical Concepts Relations Study, etc.
Education
PhD in Computer Science, University of Maryland at College Park, USA, 2007.
Background
Research interests include construction of linguistic resources to support biological text mining, building tools that facilitate information retrieval and knowledge extraction from biomedical literature, and machine learning, data mining, feature generation.