FocalPolicy: Frequency-Optimized Chunking and Locally Anchored Flow Matching for Coherent Visuomotor Policy

📅 2026-05-15

📈 Citations: 0

✨ Influential: 0

career value

214K/year

🤖 AI Summary

Existing visuomotor policies struggle to simultaneously achieve local precision and global coherence when generating long-horizon action trajectories, often resulting in discontinuities across action segments. To address this, this work proposes FocalPolicy, which innovatively integrates frequency-aware chunking with a locally anchored flow matching mechanism. Specifically, a forward-looking composite objective function jointly optimizes temporal alignment of local actions and structural consistency across chunks in the frequency domain. Additionally, a locally anchored sampling strategy is introduced to enhance the propagation efficiency of target signals within the flow matching framework. Experiments demonstrate that FocalPolicy significantly outperforms current methods across multiple visuomotor tasks, and its components can be effectively transferred to other baselines, confirming both its generality and efficacy.

📝 Abstract

Visuomotor policies aim to learn complex manipulation tasks from expert demonstrations. However, generating smooth and coherent trajectories remains challenging, as it requires balancing proximal precision with distal foresight. Existing approaches typically focus on optimizing intra-chunk action distributions, often neglecting the inter-chunk coherence. Consequently, inter-chunk discontinuities significantly impede the learning of coherent long-horizon actions. To overcome this limitation and achieve a synergetic balance between precision and foresight, we propose FocalPolicy, a foresight-aware visuomotor policy that combines Frequency-Optimized Chunking with Locally Anchored flow matching. We introduce a foresight composite objective that supervises time-domain alignment within the proximal actions while regularizing frequency-domain structure over multiple future action chunks to improve cross-chunk coherence. To efficiently learn complex action distributions, we design locally anchored campling to enhance target signal propagation efficiency during consistency flow matching training. Extensive experiments demonstrate that FocalPolicy outperforms existing approaches and confirm the generalizability of our modules to other baselines. Project website: https://focalpolicy.github.io/

Problem

Research questions and friction points this paper is trying to address.

visuomotor policy

action coherence

chunking

long-horizon manipulation

trajectory smoothness

Innovation

Methods, ideas, or system contributions that make the work stand out.

Frequency-Optimized Chunking

Locally Anchored Flow Matching

Visuomotor Policy

Cross-chunk Coherence

Foresight-aware Learning

🔎 Similar Papers

Learning Manipulation Skills through Robot Chain-of-Thought with Sparse Failure Guidance

2024-05-22arXiv.orgCitations: 1