SI-Diff: A Framework for Learning Search and High-Precision Insertion with a Force-Domain Diffusion Policy

📅 2026-05-12
📈 Citations: 0
Influential: 0
📄 PDF

career value

257K/year
🤖 AI Summary
This work addresses the challenge of unifying search and high-precision insertion behaviors in contact-rich assembly tasks, where relative pose uncertainty complicates joint modeling. To this end, the authors propose SI-Diff, a framework that leverages a force-domain diffusion strategy to jointly learn both behaviors and introduces a novel mode-conditioning mechanism enabling a single policy to adaptively switch between search and insertion modes. By integrating tactile and end-effector velocity observations, teacher–student imitation learning, and a new search teacher policy that generates diverse trajectories, SI-Diff significantly enhances generalization. Compared to the TacDiffusion baseline, it improves lateral (x–y) misalignment tolerance from 2 mm to 5 mm and demonstrates strong zero-shot transfer performance on unseen object geometries.
📝 Abstract
Contact-rich assembly is fundamental in robotics but poses significant challenges due to uncertainties in relative poses, such as misalignments and small clearances in peg-in-hole tasks. Existing approaches typically address search and high-precision insertion separately, because these tasks involve distinct action patterns. However, supporting both tasks within a single model, without switching models or weights, is desirable for intelligent assembly systems. In this work, we propose SI-Diff, a framework that learns both search and high-precision insertion through a force-domain diffusion policy. To this end, we introduce a new mode-conditioning mechanism that enables the policy to capture distinct action behaviors under a single framework. Moreover, we develop a new search teacher policy that can generate diverse trajectories. By training on successful and efficient demonstrations provided by the teacher policy, the model learns the mapping from tactile and end-effector velocity observations to effective action behaviors. We conduct thorough experiments to show that SI-Diff extends the tolerance to x-y misalignments from 2 mm to 5 mm compared to the state-of-the-art baseline, TacDiffusion, while also demonstrating strong zero-shot transferability to unseen shapes.
Problem

Research questions and friction points this paper is trying to address.

contact-rich assembly
peg-in-hole
search and insertion
relative pose uncertainty
unified policy
Innovation

Methods, ideas, or system contributions that make the work stand out.

force-domain diffusion policy
mode-conditioning mechanism
search and insertion
zero-shot transferability
contact-rich assembly
🔎 Similar Papers
2023-09-20IEEE transactions on circuits and systems for video technology (Print)Citations: 0
Yibo Liu
Yibo Liu
Research Scientist @ Epson
S
Stanko Oparnica
Epson Canada, Markham, Ontario L3R 6G3, Canada
S
Simon Shewchun-Jakaitis
Queen's University, Kingston, Ontario K7L 3N6, Canada; also with Epson Canada during internship
G
Guoyi Fu
Epson Canada, Markham, Ontario L3R 6G3, Canada
J
Jie Wang
Epson Canada, Markham, Ontario L3R 6G3, Canada
Jun Yang
Jun Yang
Epson Canada
SLAMRobot LearningComputer VisionMachine Learning
A
Anand Jagannathan
Epson Canada, Markham, Ontario L3R 6G3, Canada
T
Tony Hong-Yau Lo
Epson Canada, Markham, Ontario L3R 6G3, Canada