Towards Agnostic and Holistic Universal Image Segmentation with Bit Diffusion

📅 2026-01-06
🏛️ arXiv.org
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work proposes a general-purpose image segmentation method that operates without relying on mask-based mechanisms, leveraging a diffusion model to enable end-to-end holistic prediction. The key innovations include the introduction of a position-aware palette and 2D Gray code ordering to support principled uncertainty modeling, alongside the use of tanh activation and a sigmoid-based loss weighting strategy within a discrete output space. Although the approach does not surpass current state-of-the-art mask-based methods in performance, it significantly narrows the gap while demonstrating a distinctive capacity for uncertainty-aware segmentation. This capability opens a promising new direction for integrating large-scale pretraining into segmentation frameworks.

Technology Category

Application Category

📝 Abstract
This paper introduces a diffusion-based framework for universal image segmentation, making agnostic segmentation possible without depending on mask-based frameworks and instead predicting the full segmentation in a holistic manner. We present several key adaptations to diffusion models, which are important in this discrete setting. Notably, we show that a location-aware palette with our 2D gray code ordering improves performance. Adding a final tanh activation function is crucial for discrete data. On optimizing diffusion parameters, the sigmoid loss weighting consistently outperforms alternatives, regardless of the prediction type used, and we settle on x-prediction. While our current model does not yet surpass leading mask-based architectures, it narrows the performance gap and introduces unique capabilities, such as principled ambiguity modeling, that these models lack. All models were trained from scratch, and we believe that combining our proposed improvements with large-scale pretraining or promptable conditioning could lead to competitive models.
Problem

Research questions and friction points this paper is trying to address.

universal image segmentation
agnostic segmentation
holistic segmentation
diffusion models
discrete data modeling
Innovation

Methods, ideas, or system contributions that make the work stand out.

diffusion models
universal image segmentation
agnostic segmentation
discrete data modeling
ambiguity modeling
🔎 Similar Papers
No similar papers found.
J
J. Christensen
Technical University of Denmark
M
M. Hannemose
Technical University of Denmark
Anders Bjorholm Dahl
Anders Bjorholm Dahl
Professor, Image Analysis, Technical University of Denmark
Image analysis
V
V. Dahl
Technical University of Denmark