🤖 AI Summary
Modeling complex cross-modal dependencies among multimodal protein features—such as sequences, 3D structures, interaction networks, and physicochemical properties—remains challenging, limiting functional annotation accuracy. To address this, we propose a dual-branch dynamic selection and reconstruction-based pretraining framework. Methodologically: (1) reconstruction-based pretraining captures low-level semantic representations; (2) a bidirectional interaction module (BInM) enables deep cross-modal feature fusion; and (3) a dynamic selection module (DSM) adaptively tailors feature representations to diverse functional categories. Integrated with a hierarchical multi-label classification strategy, our framework achieves state-of-the-art performance on the human protein dataset across all three Gene Ontology (GO) domains—Biological Process (BPO), Molecular Function (MFO), and Cellular Component (CCO)—outperforming existing methods. It significantly enhances fine-grained functional annotation accuracy and model generalizability.
📝 Abstract
Multimodal protein features play a crucial role in protein function prediction. However, these features encompass a wide range of information, ranging from structural data and sequence features to protein attributes and interaction networks, making it challenging to decipher their complex interconnections. In this work, we propose a multimodal protein function prediction method (DSRPGO) by utilizing dynamic selection and reconstructive pre-training mechanisms. To acquire complex protein information, we introduce reconstructive pre-training to mine more fine-grained information with low semantic levels. Moreover, we put forward the Bidirectional Interaction Module (BInM) to facilitate interactive learning among multimodal features. Additionally, to address the difficulty of hierarchical multi-label classification in this task, a Dynamic Selection Module (DSM) is designed to select the feature representation that is most conducive to current protein function prediction. Our proposed DSRPGO model improves significantly in BPO, MFO, and CCO on human datasets, thereby outperforming other benchmark models.