Training Machine Learning Models on Human Spatio-temporal Mobility Data: An Experimental Study [Experiment Paper]

📅 2025-08-18

📈 Citations: 0

✨ Influential: 0

career value

213K/year

🤖 AI Summary

This work addresses the underexplored problem of long-term (multi-day to multi-week) full-trajectory prediction in human mobility modeling. Methodologically, we propose a macro-level lifestyle-pattern-aware trajectory forecasting framework that integrates weekday semantic encoding with user semantic clustering and hierarchical sampling to mitigate data skewness; employs a hybrid LSTM-Transformer architecture to jointly model user-specific historical behavior; and adopts mini-batch stochastic gradient optimization for robust training. Experimental results demonstrate that incorporating lifestyle-pattern semantics significantly improves prediction accuracy; semantic clustering and hierarchical sampling enhance model generalization; and mini-batch optimization exhibits superior robustness under data-scarce conditions. Our approach establishes a novel, interpretable, and scalable paradigm for long-horizon individual trajectory forecasting.

Technology Category

Application Category

📝 Abstract

Individual-level human mobility prediction has emerged as a significant topic of research with applications in infectious disease monitoring, child, and elderly care. Existing studies predominantly focus on the microscopic aspects of human trajectories: such as predicting short-term trajectories or the next location visited, while offering limited attention to macro-level mobility patterns and the corresponding life routines. In this paper, we focus on an underexplored problem in human mobility prediction: determining the best practices to train a machine learning model using historical data to forecast an individuals complete trajectory over the next days and weeks. In this experiment paper, we undertake a comprehensive experimental analysis of diverse models, parameter configurations, and training strategies, accompanied by an in-depth examination of the statistical distribution inherent in human mobility patterns. Our empirical evaluations encompass both Long Short-Term Memory and Transformer-based architectures, and further investigate how incorporating individual life patterns can enhance the effectiveness of the prediction. We show that explicitly including semantic information such as day-of-the-week and user-specific historical information can help the model better understand individual patterns of life and improve predictions. Moreover, since the absence of explicit user information is often missing due to user privacy, we show that the sampling of users may exacerbate data skewness and result in a substantial loss in predictive accuracy. To mitigate data imbalance and preserve diversity, we apply user semantic clustering with stratified sampling to ensure that the sampled dataset remains representative. Our results further show that small-batch stochastic gradient optimization improves model performance, especially when human mobility training data is limited.

Problem

Research questions and friction points this paper is trying to address.

Training ML models to predict complete individual mobility trajectories

Exploring macro-level mobility patterns and life routines in predictions

Addressing data skewness and privacy in mobility prediction models

Innovation

Methods, ideas, or system contributions that make the work stand out.

LSTM and Transformer models for trajectory prediction

User semantic clustering with stratified sampling

Small-batch stochastic gradient optimization technique

🔎 Similar Papers

Human Mobility Modeling with Limited Information via Large Language Models