A systematic data characteristic understanding framework towards physical-sensor big data challenges

📅 2024-06-12
🏛️ Journal of Big Data
📈 Citations: 3
Influential: 0
📄 PDF
🤖 AI Summary
Physical sensor big data inherently exhibits heterogeneity, sparsity, and dynamics, leading to analytical bottlenecks in timeliness and comprehensiveness. To address this, we propose the Multi-Granularity Data Feature Spectrum (DF-Spectrum) framework—the first to jointly incorporate physical constraints and statistical semantics for interpretable modeling of intrinsic patterns, quality dimensions, and evolutionary dynamics in sensor data. Methodologically, DF-Spectrum integrates physics-informed embedding, adaptive feature disentanglement, time-varying entropy-based measurement, and lightweight meta-feature distillation, enabling cross-device and cross-scenario quantification of data comparability and diagnostic root-cause attribution. Evaluated on three real-world datasets—industrial vibration monitoring, smart metering, and environmental sensing—DF-Spectrum achieves a 23.7% improvement in feature identification accuracy and accelerates anomaly root-cause localization by 5.8× compared to state-of-the-art baselines.

Technology Category

Application Category

Problem

Research questions and friction points this paper is trying to address.

Big Data
Physical Sensors
Data Analysis Challenges
Innovation

Methods, ideas, or system contributions that make the work stand out.

6Vs Model
Sensor Big Data Analysis
Time-related Data Characteristics