RealCam-Vid: High-resolution Video Dataset with Dynamic Scenes and Metric-scale Camera Movements

📅 2025-04-11

📈 Citations: 0

✨ Influential: 0

career value

209K/year

🤖 AI Summary

Existing camera-controllable video generation methods are constrained by static-scene datasets (e.g., RealEstate10K) and relative-scale camera poses, limiting their ability to model realistic object motion and precise camera trajectories while lacking metric-scale geometric consistency. To address this, we introduce the first open-source, high-resolution dynamic-scene video dataset comprising over 1,200 4K (3840×2160) video sequences, each annotated with centimeter-accurate, pixel-aligned metric-scale camera trajectories. Our hybrid annotation pipeline integrates multi-view geometric calibration, SLAM-based optimization, and manual refinement—enabling, for the first time, pixel-aligned metric-scale camera motion annotation in dynamic scenes. Evaluated on state-of-the-art video generation models, this dataset significantly improves geometric fidelity, physical plausibility of object motion, and accuracy of synthesized camera trajectories.

Technology Category

Application Category

📝 Abstract

Recent advances in camera-controllable video generation have been constrained by the reliance on static-scene datasets with relative-scale camera annotations, such as RealEstate10K. While these datasets enable basic viewpoint control, they fail to capture dynamic scene interactions and lack metric-scale geometric consistency-critical for synthesizing realistic object motions and precise camera trajectories in complex environments. To bridge this gap, we introduce the first fully open-source, high-resolution dynamic-scene dataset with metric-scale camera annotations in https://github.com/ZGCTroy/RealCam-Vid.

Problem

Research questions and friction points this paper is trying to address.

Lack of dynamic scene datasets with metric-scale camera movements

Need for realistic object motions in complex environments

Absence of high-resolution video datasets with precise camera trajectories

Innovation

Methods, ideas, or system contributions that make the work stand out.

High-resolution dynamic-scene video dataset

Metric-scale camera movement annotations

Open-source dataset for realistic synthesis

🔎 Similar Papers

No similar papers found.