Importance-Weighted Non-IID Sampling for Flow Matching Models

📅 2025-11-21

📈 Citations: 0

✨ Influential: 0

career value

185K/year

🤖 AI Summary

Under limited sampling budgets, expectation estimation in flow matching models suffers from high variance due to rare, high-impact events under independent sampling. This work proposes an unbiased importance-weighted non-i.i.d. sampling framework—the first to integrate importance weighting into the flow matching generative process. Our method learns a residual velocity field guided by the score function, jointly reconstructing the target marginal distribution and estimating sample importance weights via diversity regularization. A score-based regularization term further enforces moderate separation of samples in high-density regions, mitigating off-manifold drift. Experiments demonstrate that the approach preserves estimator unbiasedness while significantly improving sample diversity and quality. Consequently, it yields more accurate and robust expectation estimates, enhancing both interpretability and reliability of flow matching model outputs.

Technology Category

Application Category

📝 Abstract

Flow-matching models effectively represent complex distributions, yet estimating expectations of functions of their outputs remains challenging under limited sampling budgets. Independent sampling often yields high-variance estimates, especially when rare but with high-impact outcomes dominate the expectation. We propose an importance-weighted non-IID sampling framework that jointly draws multiple samples to cover diverse, salient regions of a flow's distribution while maintaining unbiased estimation via estimated importance weights. To balance diversity and quality, we introduce a score-based regularization for the diversity mechanism, which uses the score function, i.e., the gradient of the log probability, to ensure samples are pushed apart within high-density regions of the data manifold, mitigating off-manifold drift. We further develop the first approach for importance weighting of non-IID flow samples by learning a residual velocity field that reproduces the marginal distribution of the non-IID samples. Empirically, our method produces diverse, high-quality samples and accurate estimates of both importance weights and expectations, advancing the reliable characterization of flow-matching model outputs. Our code will be publicly available on GitHub.

Problem

Research questions and friction points this paper is trying to address.

Estimating expectations of flow-matching model outputs under limited sampling budgets

Reducing high variance in estimates caused by rare high-impact outcomes

Balancing diversity and quality in non-IID sampling while maintaining unbiased estimation

Innovation

Methods, ideas, or system contributions that make the work stand out.

Importance-weighted non-IID sampling for unbiased estimation

Score-based regularization ensures diversity within high-density regions

Residual velocity field learning for importance weight estimation

🔎 Similar Papers

Elucidating the Design Choice of Probability Paths in Flow Matching for Forecasting