SpectralSplats: Robust Differentiable Tracking via Spectral Moment Supervision

📅 2026-03-25
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge of optimizing 3D Gaussian splatting under severe camera misalignment, where the local support of Gaussian primitives leads to vanishing gradients and optimization failure. To overcome this, the authors propose shifting the optimization objective to the frequency domain by leveraging globally supported complex sinusoidal features—termed spectral moments—to construct a global attraction basin. A physics-informed frequency annealing strategy is introduced to mitigate gradient vanishing while avoiding high-frequency periodic local minima. By integrating spectral moment supervision with differentiable rendering, the method enables robust tracking of complex deformations. Experiments demonstrate that, even with severely misaligned initializations, the approach consistently outperforms conventional appearance-based tracking methods across diverse deformation parameterizations.

Technology Category

Application Category

📝 Abstract
3D Gaussian Splatting (3DGS) enables real-time, photorealistic novel view synthesis, making it a highly attractive representation for model-based video tracking. However, leveraging the differentiability of the 3DGS renderer "in the wild" remains notoriously fragile. A fundamental bottleneck lies in the compact, local support of the Gaussian primitives. Standard photometric objectives implicitly rely on spatial overlap; if severe camera misalignment places the rendered object outside the target's local footprint, gradients strictly vanish, leaving the optimizer stranded. We introduce SpectralSplats, a robust tracking framework that resolves this "vanishing gradient" problem by shifting the optimization objective from the spatial to the frequency domain. By supervising the rendered image via a set of global complex sinusoidal features (Spectral Moments), we construct a global basin of attraction, ensuring that a valid, directional gradient toward the target exists across the entire image domain, even when pixel overlap is completely nonexistent. To harness this global basin without introducing periodic local minima associated with high frequencies, we derive a principled Frequency Annealing schedule from first principles, gracefully transitioning the optimizer from global convexity to precise spatial alignment. We demonstrate that SpectralSplats acts as a seamless, drop-in replacement for spatial losses across diverse deformation parameterizations (from MLPs to sparse control points), successfully recovering complex deformations even from severely misaligned initializations where standard appearance-based tracking catastrophically fails.
Problem

Research questions and friction points this paper is trying to address.

vanishing gradient
3D Gaussian Splatting
video tracking
camera misalignment
photometric loss
Innovation

Methods, ideas, or system contributions that make the work stand out.

Spectral Moments
3D Gaussian Splatting
Frequency Annealing
Differentiable Tracking
Global Optimization
🔎 Similar Papers
No similar papers found.
A
Avigail Cohen Rimon
Technion - Israel Institute of Technology
A
Amir Mann
Technion - Israel Institute of Technology
M
Mirela Ben Chen
Technion - Israel Institute of Technology
Or Litany
Or Litany
Technion, NVIDIA
Computer VisionArtificial IntelligenceDeep LearningAI