VolTex: Food Volume Estimation using Text-Guided Segmentation and Neural Surface Reconstruction

📅 2025-06-03
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing 3D food volume estimation methods lack text-driven object selection capability, hindering user-specified food targeting. This paper proposes the first text-driven food volume estimation framework: given a food name, the system performs text-image cross-modal segmentation to precisely localize the target instance, followed by NeRF-based neural surface reconstruction to generate high-fidelity 3D meshes; volume is then computed via mesh discretization and voxelization. Our core contribution is the novel coupling of text-guided segmentation with neural surface reconstruction for food volume estimation—enabling fine-grained, user-specifiable, instance-level 3D modeling. Evaluated on the MetaFood3D dataset, our method achieves significant improvements in target isolation and surface reconstruction accuracy, reducing volume estimation error by 32.7% compared to prior approaches. The end-to-end pipeline supports real-world dietary analysis with precise, semantically controllable food quantification.

Technology Category

Application Category

📝 Abstract
Accurate food volume estimation is crucial for dietary monitoring, medical nutrition management, and food intake analysis. Existing 3D Food Volume estimation methods accurately compute the food volume but lack for food portions selection. We present VolTex, a framework that improves change{the food object selection} in food volume estimation. Allowing users to specify a target food item via text input to be segmented, our method enables the precise selection of specific food objects in real-world scenes. The segmented object is then reconstructed using the Neural Surface Reconstruction method to generate high-fidelity 3D meshes for volume computation. Extensive evaluations on the MetaFood3D dataset demonstrate the effectiveness of our approach in isolating and reconstructing food items for accurate volume estimation. The source code is accessible at https://github.com/GCVCG/VolTex.
Problem

Research questions and friction points this paper is trying to address.

Improves food object selection in volume estimation
Enables precise food segmentation via text input
Reconstructs 3D meshes for accurate volume computation
Innovation

Methods, ideas, or system contributions that make the work stand out.

Text-guided segmentation for food selection
Neural Surface Reconstruction for 3D meshes
High-fidelity volume computation from segmentation
🔎 Similar Papers
No similar papers found.
Ahmad AlMughrabi
Ahmad AlMughrabi
University of Barcelona
NeRF3D ReconstructionVolume EstimationSegmentation
Umair Haroon
Umair Haroon
PhD Student at University of Barcelona
Artificial IntelligenceComputer VisionAction RecognitionActivity RecognitionHuman Interaction Recognition
R
Ricardo Marques
Grup de Tecnologies Interactives (GTI), Universitat Pompeu Fabra (UPF), Spain
P
P. Radeva
Universitat de Barcelona, Spain IMUB & Institut de Neurociencies, Barcelona