Quality-Aware Framework for Video-Derived Respiratory Signals

📅 2025-12-16
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Video-based respiratory rate (RR) estimation suffers from unreliable performance due to unstable signal quality. To address this, we propose the first segment-level quality-aware predictive fusion framework. It simultaneously extracts ten heterogeneous signals—including facial remote photoplethysmography (rPPG), torso motion, and deep learning features—and jointly applies four spectral estimation algorithms (Welch, MUSIC, FFT, and peak detection) to construct a supervised quality prediction model that dynamically generates segment-level quality indices. These indices drive machine learning–based adaptive signal selection and weighted fusion. The method significantly enhances signal robustness and cross-dataset generalizability. Evaluated on three benchmark datasets—OMuSense-23, COHFACE, and MAHNOB-HCI—it consistently achieves lower RR estimation errors than all individual signal sources and existing fusion approaches.

Technology Category

Application Category

📝 Abstract
Video-based respiratory rate (RR) estimation is often unreliable due to inconsistent signal quality across extraction methods. We present a predictive, quality-aware framework that integrates heterogeneous signal sources with dynamic assessment of reliability. Ten signals are extracted from facial remote photoplethysmography (rPPG), upper-body motion, and deep learning pipelines, and analyzed using four spectral estimators: Welch's method, Multiple Signal Classification (MUSIC), Fast Fourier Transform (FFT), and peak detection. Segment-level quality indices are then used to train machine learning models that predict accuracy or select the most reliable signal. This enables adaptive signal fusion and quality-based segment filtering. Experiments on three public datasets (OMuSense-23, COHFACE, MAHNOB-HCI) show that the proposed framework achieves lower RR estimation errors than individual methods in most cases, with performance gains depending on dataset characteristics. These findings highlight the potential of quality-driven predictive modeling to deliver scalable and generalizable video-based respiratory monitoring solutions.
Problem

Research questions and friction points this paper is trying to address.

Unreliable video-based respiratory rate estimation due to inconsistent signal quality
Integrates heterogeneous signal sources with dynamic reliability assessment
Enables adaptive signal fusion and quality-based filtering for improved accuracy
Innovation

Methods, ideas, or system contributions that make the work stand out.

Integrates heterogeneous signals with dynamic quality assessment
Uses machine learning to predict accuracy and select reliable signals
Enables adaptive signal fusion and quality-based segment filtering
🔎 Similar Papers
No similar papers found.
N
Nhi Nguyen
Center for Machine Vision and Signal Analysis (CMVS), University of Oulu, Finland
Constantino Álvarez Casado
Constantino Álvarez Casado
Postdoctoral Researcher, University of Oulu
Computer VisionMachine LearningDeep LearningHuman SensingDigital Signal Processing
Le Nguyen
Le Nguyen
Research Scientist, Soar Technologies. PhD Student, Rochester Institute of Technology
Artificial IntelligenceNatural Language ProcessingData ScienceCyber Social Threats
M
Manuel Lage Cañellas
Center for Machine Vision and Signal Analysis (CMVS), University of Oulu, Finland
M
Miguel Bordallo López
Center for Machine Vision and Signal Analysis (CMVS), University of Oulu, Finland