Path-Decoupled Hyperbolic Flow Matching for Few-Shot Adaptation

📅 2026-02-24
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge of few-shot cross-modal adaptation, where Euclidean flow matching suffers from entangled feature trajectories due to its flat geometry, hindering effective alignment between visual and semantic distributions. To overcome this limitation, the paper introduces hyperbolic geometry for the first time in this context and proposes a centripetal hyperbolic alignment mechanism. By constructing a text-anchored hierarchical structure on the Lorentz manifold and designing class-specific geodesic corridors to constrain trajectory evolution, the method enables ordered and disentangled cross-modal transport. An adaptive diameter stopping strategy is further introduced to prevent over-transportation. Evaluated on 11 benchmarks, the approach achieves new state-of-the-art results, significantly outperforming existing Euclidean flow matching methods and demonstrating the superiority of hyperbolic space for few-shot cross-modal alignment.

Technology Category

Application Category

📝 Abstract
Recent advances in cross-modal few-shot adaptation treat visual-semantic alignment as a continuous feature transport problem via Flow Matching (FM). However, we argue that Euclidean-based FM overlooks fundamental limitations of flat geometry, where polynomial volume growth fails to accommodate diverse feature distributions, leading to severe path entanglement. To this end, we propose path-decoupled Hyperbolic Flow Matching (HFM), leveraging the Lorentz manifold's exponential expansion for trajectory decoupling. HFM structures the transport via two key designs: 1) Centripetal hyperbolic alignment: It constructs a centripetal hierarchy by anchoring textual roots, which pushes visual leaves to the boundary to initialize orderly flows. 2) Path-decoupled objective: It acts as a ``semantic guardrail''rigidly confining trajectories within isolated class-specific geodesic corridors via step-wise supervision. Furthermore, we devise an adaptive diameter-based stopping to prevent over-transportation into the crowded origin based on the intrinsic semantic scale. Extensive ablations on 11 benchmarks have shown that HFM establishes a new state-of-the-art, consistently outperforming its Euclidean counterparts. Our codes and models will be released.
Problem

Research questions and friction points this paper is trying to address.

few-shot adaptation
flow matching
path entanglement
Euclidean geometry
feature transport
Innovation

Methods, ideas, or system contributions that make the work stand out.

Hyperbolic Geometry
Flow Matching
Few-Shot Adaptation
Path Decoupling
Cross-Modal Alignment
🔎 Similar Papers
No similar papers found.
L
Lin Li
The Hong Kong University of Science and Technology (HKUST)
Z
Ziqi Jiang
The Hong Kong University of Science and Technology (HKUST)
G
Gefan Ye
Zhejiang University
Zhenqi He
Zhenqi He
The Hong Kong University of Science and Technology (HKUST) | The University of Hong Kong (HKU)
Open-World LearningComputer VisionMulti-Modal Learning
Jiahui Li
Jiahui Li
Zhejiang University
MARLExplainable AIRLHF
J
Jun Xiao
Zhejiang University
K
Kwang-Ting Cheng
The Hong Kong University of Science and Technology (HKUST)
L
Long Chen
The Hong Kong University of Science and Technology (HKUST)