π€ AI Summary
To address subjectivity and label noise in stance detection arising from annotator background discrepancies, this paper proposes a perspective-aware multi-source soft-labeling framework. Methodologically, we introduce the first dual-source collaborative annotation scheme integrating human experts and large language models, explicitly modeling annotation diversity and stance subjectivity, and design an uncertainty-aware multi-perspective ensemble classifier. Our contributions are threefold: (1) We release the first stance detection dataset featuring both human and AI annotations alongside fine-grained perspective metadata; (2) Our soft-label learning approach achieves significant F1-score improvements across multiple benchmarks; (3) The model produces better-calibrated confidence scores, jointly optimizing classification accuracy and decision accountability. Experiments demonstrate that explicit diversity modeling enhances both robustness and ethical compliance.
π Abstract
Prior studies show that adopting the annotation diversity shaped by different backgrounds and life experiences and incorporating them into the model learning, i.e. multi-perspective approach, contribute to the development of more responsible models. Thus, in this paper we propose a new framework for designing and further evaluating perspective-aware models on stance detection task,in which multiple annotators assign stances based on a controversial topic. We also share a new dataset established through obtaining both human and LLM annotations. Results show that the multi-perspective approach yields better classification performance (higher F1-scores), outperforming the traditional approaches that use a single ground-truth, while displaying lower model confidence scores, probably due to the high level of subjectivity of the stance detection task.