NeuralMeshing: Complete Object Mesh Extraction from Casual Captures

📅 2025-08-21

📈 Citations: 0

✨ Influential: 0

career value

225K/year

🤖 AI Summary

This work addresses the challenge of reconstructing complete, high-fidelity 3D geometry of everyday objects from low-quality, incompletely covered, and poorly calibrated consumer-grade multi-view videos—without requiring professional 3D scanners. The method leverages a single user-provided 2D point correspondence as weak supervision, integrating Structure-from-Motion, multi-view stereo matching, and neural rendering to achieve robust cross-video multi-view alignment and dense reconstruction. Crucially, it introduces multi-video collaborative modeling to mitigate occlusion-induced holes inherent in single-video reconstructions, and supports spatial registration via AR markers or checkerboard patterns. Experiments demonstrate stable reconstruction of watertight, high-fidelity meshes from unstructured smartphone videos. The approach exhibits strong generalization across diverse objects and scenes, and the code is publicly released, underscoring its practical utility for real-world 3D content creation.

Technology Category

Application Category

📝 Abstract

How can we extract complete geometric models of objects that we encounter in our daily life, without having access to commercial 3D scanners? In this paper we present an automated system for generating geometric models of objects from two or more videos. Our system requires the specification of one known point in at least one frame of each video, which can be automatically determined using a fiducial marker such as a checkerboard or Augmented Reality (AR) marker. The remaining frames are automatically positioned in world space by using Structure-from-Motion techniques. By using multiple videos and merging results, a complete object mesh can be generated, without having to rely on hole filling. Code for our system is available from https://github.com/FlorisE/NeuralMeshing.

Problem

Research questions and friction points this paper is trying to address.

Extracts complete object meshes from casual video captures

Automates 3D modeling without commercial scanners

Uses multiple videos and markers for full reconstruction

Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses multiple videos for complete mesh extraction

Employs fiducial markers for automatic point specification

Applies Structure-from-Motion for world space positioning

🔎 Similar Papers

No similar papers found.