LeMoRe: Learn More Details for Lightweight Semantic Segmentation

📅 2025-05-29

📈 Citations: 0

✨ Influential: 0

career value

173K/year

🤖 AI Summary

Lightweight semantic segmentation faces the fundamental challenge of balancing representational capacity with computational efficiency, as existing approaches are constrained by rigid architectures and implicit modeling—often relying on computationally expensive vision transformers. To address this, we propose an explicit-implicit collaborative modeling paradigm: (1) explicit Cartesian-directional views to incorporate geometric priors for long-range contextual modeling, and (2) a nested attention mechanism enabling efficient multi-scale contextual dependency capture with minimal parameters. Integrated with lightweight feature interaction and multi-scale aggregation, our model achieves state-of-the-art accuracy on ADE20K and Cityscapes while reducing FLOPs by 38% and model parameters by 52% compared to prior arts. This yields an optimal trade-off between real-time inference capability and segmentation quality.

Technology Category

Application Category

📝 Abstract

Lightweight semantic segmentation is essential for many downstream vision tasks. Unfortunately, existing methods often struggle to balance efficiency and performance due to the complexity of feature modeling. Many of these existing approaches are constrained by rigid architectures and implicit representation learning, often characterized by parameter-heavy designs and a reliance on computationally intensive Vision Transformer-based frameworks. In this work, we introduce an efficient paradigm by synergizing explicit and implicit modeling to balance computational efficiency with representational fidelity. Our method combines well-defined Cartesian directions with explicitly modeled views and implicitly inferred intermediate representations, efficiently capturing global dependencies through a nested attention mechanism. Extensive experiments on challenging datasets, including ADE20K, CityScapes, Pascal Context, and COCO-Stuff, demonstrate that LeMoRe strikes an effective balance between performance and efficiency.

Problem

Research questions and friction points this paper is trying to address.

Balancing efficiency and performance in lightweight semantic segmentation

Overcoming rigid architectures and implicit representation learning limitations

Synergizing explicit and implicit modeling for computational efficiency

Innovation

Methods, ideas, or system contributions that make the work stand out.

Synergizes explicit and implicit feature modeling

Uses nested attention for global dependencies

Combines Cartesian directions with multi-view modeling

🔎 Similar Papers

No similar papers found.