Hierarchical Flow Diffusion for Efficient Frame Interpolation

📅 2025-04-01

📈 Citations: 0

✨ Influential: 0

career value

189K/year

🤖 AI Summary

Existing diffusion-based video frame interpolation methods suffer from low accuracy and slow inference due to excessively large denoising ranges in latent space. To address this, we propose Hierarchical Optical Flow Diffusion Modeling (HLFM), the first framework to explicitly formulate bilateral optical flow as a hierarchical diffusion process—thereby decoupling motion estimation from content synthesis. We further design a flow-guided image synthesizer that enables end-to-end generation of high-fidelity intermediate frames. By hierarchically constraining the denoising search space via optical flow priors, HLFM achieves superior modeling precision without sacrificing computational efficiency. On multiple standard benchmarks, HLFM establishes new state-of-the-art performance in frame interpolation. Notably, it accelerates inference by over 10× compared to existing diffusion-based approaches while preserving strong temporal consistency and visual fidelity.

Technology Category

Application Category

📝 Abstract

Most recent diffusion-based methods still show a large gap compared to non-diffusion methods for video frame interpolation, in both accuracy and efficiency. Most of them formulate the problem as a denoising procedure in latent space directly, which is less effective caused by the large latent space. We propose to model bilateral optical flow explicitly by hierarchical diffusion models, which has much smaller search space in the denoising procedure. Based on the flow diffusion model, we then use a flow-guided images synthesizer to produce the final result. We train the flow diffusion model and the image synthesizer end to end. Our method achieves state of the art in accuracy, and 10+ times faster than other diffusion-based methods. The project page is at: https://hfd-interpolation.github.io.

Problem

Research questions and friction points this paper is trying to address.

Improves accuracy and efficiency in video frame interpolation

Reduces search space by modeling bilateral optical flow hierarchically

Combines flow diffusion and image synthesis for better results

Innovation

Methods, ideas, or system contributions that make the work stand out.

Hierarchical diffusion models for optical flow

Flow-guided image synthesizer for interpolation

End-to-end training of diffusion and synthesizer

🔎 Similar Papers

Generative Inbetweening: Adapting Image-to-Video Models for Keyframe Interpolation