A multi-stage augmented multimodal interaction network for fish feeding intensity quantification

📅 2025-06-17
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address low accuracy in assessing fish feeding intensity, inefficient multimodal fusion, and unreliable decision-making in recirculating aquaculture systems, this paper proposes the Multi-stage Augmented Inter-modal Network (MAINet). MAINet introduces two novel components: (1) an Auxiliary-modality Reinforced Primary-modality Mechanism (ARPM) that enhances dominant modalities via auxiliary ones, and (2) an Evidence Reasoning (ER)-based decision fusion framework. It integrates image, audio, and water-wave signals using Channel Attention Feature Normalization (CAFN) and Dual-modality Attention Feature Normalization (DAFN). Experiments demonstrate that MAINet achieves consistent performance across accuracy, precision, recall, and F1-score—ranging from 96.76% to 96.79%—significantly outperforming unimodal, bimodal, and conventional multimodal fusion approaches. Ablation studies validate that ARPM and the ER framework critically improve model robustness and feature utilization efficiency.

Technology Category

Application Category

📝 Abstract
In recirculating aquaculture systems, accurate and effective assessment of fish feeding intensity is crucial for reducing feed costs and calculating optimal feeding times. However, current studies have limitations in modality selection, feature extraction and fusion, and co-inference for decision making, which restrict further improvement in the accuracy, applicability and reliability of multimodal fusion models. To address this problem, this study proposes a Multi-stage Augmented Multimodal Interaction Network (MAINet) for quantifying fish feeding intensity. Firstly, a general feature extraction framework is proposed to efficiently extract feature information from input image, audio and water wave datas. Second, an Auxiliary-modality Reinforcement Primary-modality Mechanism (ARPM) is designed for inter-modal interaction and generate enhanced features, which consists of a Channel Attention Fusion Network (CAFN) and a Dual-mode Attention Fusion Network (DAFN). Finally, an Evidence Reasoning (ER) rule is introduced to fuse the output results of each modality and make decisions, thereby completing the quantification of fish feeding intensity. The experimental results show that the constructed MAINet reaches 96.76%, 96.78%, 96.79% and 96.79% in accuracy, precision, recall and F1-Score respectively, and its performance is significantly higher than the comparison models. Compared with models that adopt single-modality, dual-modality fusion and different decision-making fusion methods, it also has obvious advantages. Meanwhile, the ablation experiments further verified the key role of the proposed improvement strategy in improving the robustness and feature utilization efficiency of model, which can effectively improve the accuracy of the quantitative results of fish feeding intensity.
Problem

Research questions and friction points this paper is trying to address.

Accurate fish feeding intensity assessment in aquaculture systems
Improving multimodal fusion model accuracy and reliability
Enhancing feature extraction and decision-making for feeding quantification
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multi-stage Augmented Multimodal Interaction Network
Auxiliary-modality Reinforcement Primary-modality Mechanism
Evidence Reasoning rule for decision fusion
🔎 Similar Papers
No similar papers found.
S
Shulong Zhang
National Innovation Center for Digital Fishery, Beijing 100083, P.R. China; College of Information and Electrical Engineering, China Agricultural University, Beijing 100083, P.R. China
M
Mingyuan Yao
National Innovation Center for Digital Fishery, Beijing 100083, P.R. China; College of Information and Electrical Engineering, China Agricultural University, Beijing 100083, P.R. China
Jiayin Zhao
Jiayin Zhao
Tsinghua University
Computational Imaging
X
Xiao Liu
National Innovation Center for Digital Fishery, Beijing 100083, P.R. China; College of Information and Electrical Engineering, China Agricultural University, Beijing 100083, P.R. China
H
Haihua Wang
National Innovation Center for Digital Fishery, Beijing 100083, P.R. China; Key Laboratory of Smart Farming Technologies for Aquatic Animal and Livestock, Ministry of Agriculture and Rural Affairs, Beijing 100083, P.R. China; Beijing Engineering and Technology Research Center for Internet of Things in Agriculture, Beijing 100083, P.R. China; College of Information and Electrical Engineering, China Agricultural University, Beijing 100083, P.R. China