Hierarchical Flow Diffusion for Efficient Frame Interpolation

๐Ÿ“… 2025-04-01
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
Existing diffusion-based video frame interpolation methods suffer from low accuracy and slow inference due to excessively large denoising ranges in latent space. To address this, we propose Hierarchical Optical Flow Diffusion Modeling (HLFM), the first framework to explicitly formulate bilateral optical flow as a hierarchical diffusion processโ€”thereby decoupling motion estimation from content synthesis. We further design a flow-guided image synthesizer that enables end-to-end generation of high-fidelity intermediate frames. By hierarchically constraining the denoising search space via optical flow priors, HLFM achieves superior modeling precision without sacrificing computational efficiency. On multiple standard benchmarks, HLFM establishes new state-of-the-art performance in frame interpolation. Notably, it accelerates inference by over 10ร— compared to existing diffusion-based approaches while preserving strong temporal consistency and visual fidelity.

Technology Category

Application Category

๐Ÿ“ Abstract
Most recent diffusion-based methods still show a large gap compared to non-diffusion methods for video frame interpolation, in both accuracy and efficiency. Most of them formulate the problem as a denoising procedure in latent space directly, which is less effective caused by the large latent space. We propose to model bilateral optical flow explicitly by hierarchical diffusion models, which has much smaller search space in the denoising procedure. Based on the flow diffusion model, we then use a flow-guided images synthesizer to produce the final result. We train the flow diffusion model and the image synthesizer end to end. Our method achieves state of the art in accuracy, and 10+ times faster than other diffusion-based methods. The project page is at: https://hfd-interpolation.github.io.
Problem

Research questions and friction points this paper is trying to address.

Improves accuracy and efficiency in video frame interpolation
Reduces search space by modeling bilateral optical flow hierarchically
Combines flow diffusion and image synthesis for better results
Innovation

Methods, ideas, or system contributions that make the work stand out.

Hierarchical diffusion models for optical flow
Flow-guided image synthesizer for interpolation
End-to-end training of diffusion and synthesizer
๐Ÿ”Ž Similar Papers
No similar papers found.
Yang Hai
Yang Hai
Insta 360
Computer VisionDeep Learning3D Vision
G
Guo Wang
Insta360 Research
T
Tan Su
Insta360 Research
W
Wenjie Jiang
Insta360 Research
Yinlin Hu
Yinlin Hu
MagicLeap
Computer VisionAugmented RealityMachine Learning