Blind Augmentation: Calibration-free Camera Distortion Model Estimation for Real-time Mixed-reality Consistency

📅 2025-03-03

📈 Citations: 0

✨ Influential: 0

career value

213K/year

🤖 AI Summary

To address visual inconsistencies—such as noise, motion blur, and depth-of-field mismatch—between virtual content and real-world video streams in augmented reality (AR), this paper proposes a camera-calibration-free real-time distortion modeling method. The approach leverages a differentiable image restoration framework that jointly integrates blind deconvolution (for motion deblurring), depth-guided denoising, and monocular depth estimation into an end-to-end self-calibration pipeline. Crucially, it employs gradient backpropagation to automatically optimize distortion parameters of black-box real-time renderers (e.g., Unity or Unreal Engine). This work introduces the novel “calibration-free self-calibration” paradigm, enabling millisecond-level distortion parameter estimation and significantly improving photorealistic alignment between virtual and real scenes. The method is plug-and-play, fully compatible with standard game engine rendering pipelines, and requires neither custom hardware nor domain-specific training data.

Technology Category

Application Category

📝 Abstract

Real camera footage is subject to noise, motion blur (MB) and depth of field (DoF). In some applications these might be considered distortions to be removed, but in others it is important to model them because it would be ineffective, or interfere with an aesthetic choice, to simply remove them. In augmented reality applications where virtual content is composed into a live video feed, we can model noise, MB and DoF to make the virtual content visually consistent with the video. Existing methods for this typically suffer two main limitations. First, they require a camera calibration step to relate a known calibration target to the specific cameras response. Second, existing work require methods that can be (differentiably) tuned to the calibration, such as slow and specialized neural networks. We propose a method which estimates parameters for noise, MB and DoF instantly, which allows using off-the-shelf real-time simulation methods from e.g., a game engine in compositing augmented content. Our main idea is to unlock both features by showing how to use modern computer vision methods that can remove noise, MB and DoF from the video stream, essentially providing self-calibration. This allows to auto-tune any black-box real-time noise+MB+DoF method to deliver fast and high-fidelity augmentation consistency.

Problem

Research questions and friction points this paper is trying to address.

Estimates camera distortion parameters without calibration.

Enables real-time augmented reality content consistency.

Uses computer vision to auto-tune noise, motion blur, depth of field.

Innovation

Methods, ideas, or system contributions that make the work stand out.

Estimates noise, MB, DoF instantly

Uses off-the-shelf real-time simulation methods

Self-calibration via modern computer vision

🔎 Similar Papers

No similar papers found.

Bosch Group

Renningen, BW, DE

Master Thesis AI-Based Keypoint Refinement for Autonomous Driving

Bosch Group

Hildesheim, NDS, DE

Research Scientist Intern, Machine Perception for Input and Interaction (PhD)