Augmenting biomedical named entity recognition with general-domain resources

📅 2024-06-15
🏛️ Journal of Biomedical Informatics
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Biomedical named entity recognition (NER) suffers from scarce, costly, and noisy annotations, leading to poor model generalization. To address this, we propose a few-shot NER method leveraging domain-agnostic pretraining corpora—specifically Wikipedia and BooksCorpus—to enhance weakly supervised learning. Our approach integrates multi-stage transfer learning, curriculum-driven domain-adaptive knowledge distillation, entity-type-aware prompt tuning, and consistency regularization. This enables effective knowledge transfer and robust pseudo-labeling without requiring large-scale high-quality annotations. Evaluated on standard benchmarks—including BC5CDR and JNLPBA—our method achieves absolute F1 improvements of 3.2–5.8% over strong baselines. Notably, using only 10% of the labeled data, it surpasses fully supervised models trained on the complete annotated corpus. The framework significantly reduces dependency on expert-curated labels while delivering scalability and robustness, establishing a new paradigm for low-resource biomedical NER.

Technology Category

Application Category

Problem

Research questions and friction points this paper is trying to address.

Biomedical Named Entity Recognition
Data Annotation
Model Accuracy
Innovation

Methods, ideas, or system contributions that make the work stand out.

GERBERA method
pre-training on general language data
fine-tuning on biomedical data
🔎 Similar Papers
No similar papers found.
Y
Yu Yin
Department of Computer Science, University of Liverpool, Liverpool L69 3DR, United Kingdom
Hyunjae Kim
Hyunjae Kim
Yale University
Natural Language ProcessingBiomedical InformaticsHealthcare
X
Xiao Xiao
Department of Computer Science, University of Liverpool, Liverpool L69 3DR, United Kingdom
Chih Hsuan Wei
Chih Hsuan Wei
Staff Scientist, NCBI (National Center for Biotechnology Information) / NLM / NIH
Natural language processingMachine learningDeep learningBioNLP
J
Jaewoo Kang
Department of Computer Science, Korea University, 145 Anam-ro, Seongbuk-gu, Seoul, 02841, Republic of Korea
Zhiyong Lu
Zhiyong Lu
Senior Investigator, NLM; Adjunct Professor of CS, UIUC
BioNLPBiomedical InformaticsMedical AIArtificial Intelligence
H
Hua Xu
Department of Biomedical Informatics and Data Science, School of Medicine, Yale University, New Haven, Connecticut, 06510, United States of America
Meng Fang
Meng Fang
University of Liverpool
Natural Language ProcessingReinforcement LearningAgentsArtificial intelligence
Qingyu Chen
Qingyu Chen
Biomedical Informatics & Data Science, Yale University; NCBI-NLM, National Institutes of Health
Text miningMachine learningData curationBioNLPMedical Imaging Analysis