On Predicting Sociodemographics from Mobility Signals

📅 2025-11-06
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Predicting demographic attributes from mobile trajectory data faces challenges including weak attribute–mobility correlations and poor generalization across domains. To address these, we propose directed mobility graphs to encode high-order, structured movement patterns of individuals. We further design a multi-task learning framework that jointly predicts age, gender, income, and household composition, thereby enhancing cross-scenario and cross-temporal distributional robustness. Additionally, we introduce a confidence–accuracy-balanced visualization diagnostic tool to improve model interpretability and enable principled uncertainty quantification. Extensive experiments demonstrate that our approach significantly outperforms state-of-the-art baselines under data scarcity and distribution shift, achieving superior prediction accuracy and robustness. This work establishes a new paradigm for trustworthy utilization of passive mobility data in urban transportation planning.

Technology Category

Application Category

📝 Abstract
Inferring sociodemographic attributes from mobility data could help transportation planners better leverage passively collected datasets, but this task remains difficult due to weak and inconsistent relationships between mobility patterns and sociodemographic traits, as well as limited generalization across contexts. We address these challenges from three angles. First, to improve predictive accuracy while retaining interpretability, we introduce a behaviorally grounded set of higher-order mobility descriptors based on directed mobility graphs. These features capture structured patterns in trip sequences, travel modes, and social co-travel, and significantly improve prediction of age, gender, income, and household structure over baselines features. Second, we introduce metrics and visual diagnostic tools that encourage evenness between model confidence and accuracy, enabling planners to quantify uncertainty. Third, to improve generalization and sample efficiency, we develop a multitask learning framework that jointly predicts multiple sociodemographic attributes from a shared representation. This approach outperforms single-task models, particularly when training data are limited or when applying models across different time periods (i.e., when the test set distribution differs from the training set).
Problem

Research questions and friction points this paper is trying to address.

Predicting sociodemographics from mobility data with weak relationships
Improving model interpretability while maintaining predictive accuracy
Enhancing generalization across different contexts and time periods
Innovation

Methods, ideas, or system contributions that make the work stand out.

Higher-order mobility descriptors from directed graphs
Visual diagnostic tools for model confidence and accuracy
Multitask learning framework for shared representation across attributes
🔎 Similar Papers
No similar papers found.
E
Ekin Uugurel
Department of Civil and Environmental Engineering, University of Washington, Seattle, 98105, WA, USA
C
Cynthia Chen
Department of Civil and Environmental Engineering, University of Washington, Seattle, 98105, WA, USA
B
Brian H. Y. Lee
Puget Sound Regional Council (PSRC), Seattle, 98101, WA, USA
Filipe Rodrigues
Filipe Rodrigues
Technical University of Denmark (DTU)
Machine LearningReinforcement LearningIntelligent Transportation SystemsUrban Mobility