Aircraft Trajectory Dataset Augmentation in Latent Space

📅 2025-06-09

📈 Citations: 0

✨ Influential: 0

career value

194K/year

🤖 AI Summary

To address the scarcity and severe class imbalance of aircraft trajectory data in air traffic management, this paper proposes ATRADA—a novel trajectory data augmentation framework. First, a Transformer encoder learns latent representations of trajectories. Second, in the dimensionality-reduced latent space, Principal Component Analysis (PCA) and Gaussian Mixture Models (GMM) are jointly applied to model the underlying trajectory distribution. Finally, a differentiable MLP decoder synthesizes semantically consistent and geometrically plausible trajectories. ATRADA is the first method to integrate latent-space modeling, PCA-GMM-based joint distribution fitting, and differentiable decoding for aviation trajectory augmentation. Experiments on conflict detection and landing time prediction demonstrate that ATRADA significantly outperforms baseline methods—including SMOTE, GANs, and VAEs—yielding substantial improvements in downstream model robustness and generalization performance.

Technology Category

Application Category

📝 Abstract

Aircraft trajectory modeling plays a crucial role in Air Traffic Management (ATM) and is important for various downstream tasks, including conflict detection and landing time prediction. Dataset augmentation through the addition of synthetically generated trajectory data is necessary to develop a more robust aircraft trajectory model and ensure that the trajectory dataset is sufficient and balanced. In this work, we propose a novel framework called ATRADA for aircraft trajectory dataset augmentation. In the proposed framework, a Transformer encoder learns the underlying patterns in the original trajectory dataset and converts each data point into a context vector in the learned latent space. The converted dataset in the latent space is projected into reduced dimensions using principal component analysis (PCA), and a Gaussian mixture model (GMM) is applied to fit the probability distribution of the data points in the reduced-dimensional space. Finally, new samples are drawn from the fitted GMM, the dimension of the samples is reverted to the original dimension, and they are decoded with a Multi-Layer Perceptron (MLP). Several experiments demonstrate that the framework effectively generates new, high-quality synthetic aircraft trajectory data, which were compared to the results of several baselines.

Problem

Research questions and friction points this paper is trying to address.

Enhancing aircraft trajectory dataset robustness via synthetic data

Learning latent space patterns for trajectory augmentation

Generating high-quality synthetic trajectories using GMM and MLP

Innovation

Methods, ideas, or system contributions that make the work stand out.

Transformer encoder learns trajectory patterns

PCA and GMM model latent space distribution

MLP decodes synthetic high-quality trajectory data

🔎 Similar Papers

Data augmentation with automated machine learning: approaches and performance comparison with classical data augmentation methods