🤖 AI Summary
Biomedical named entity recognition (BM-NER) faces challenges including terminological complexity, pervasive nested entity structures, and inconsistent annotations across datasets. To address these, we propose SRU-NER: a framework featuring a Slot-based Recurrent Unit (SRU) that explicitly models nested entities; multi-task learning to jointly train on heterogeneous biomedical and general-domain NER datasets; and a novel dynamic loss adjustment mechanism that automatically suppresses supervision signals for unannotated entity types—effectively mitigating negative transfer induced by annotation discrepancies. Crucially, SRU-NER enables unified modeling of both biomedical and general-domain NER. Experimental results demonstrate state-of-the-art or competitive performance across multiple benchmark datasets. Cross-domain evaluation and human assessment further validate its strong generalization capability and practical effectiveness.
📝 Abstract
Biomedical Named Entity Recognition presents significant challenges due to the complexity of biomedical terminology and inconsistencies in annotation across datasets. This paper introduces SRU-NER (Slot-based Recurrent Unit NER), a novel approach designed to handle nested named entities while integrating multiple datasets through an effective multi-task learning strategy. SRU-NER mitigates annotation gaps by dynamically adjusting loss computation to avoid penalizing predictions of entity types absent in a given dataset. Through extensive experiments, including a cross-corpus evaluation and human assessment of the model's predictions, SRU-NER achieves competitive performance in biomedical and general-domain NER tasks, while improving cross-domain generalization.