🤖 AI Summary
This work addresses the challenge of manipulating fragile, deformable containers—such as liquid-filled plastic cups—which require real-time grip-force modulation within an extremely narrow range to prevent slippage or irreversible deformation. The authors propose a reflexive control paradigm grounded in the statistical properties of tactile sensor noise, employing a static grasp-and-unload protocol to automatically calibrate thresholds. This approach establishes a three-channel closed-loop system encompassing slip prevention, adaptive release, and force protection. Notably, it operates without external force calibration, manual parameter tuning, or reliance on material-specific physical models, instead leveraging only the intrinsic sensor noise to deliver an interpretable, plug-and-play force-sensitive safety layer. Experiments demonstrate a 5/5 success rate in anti-deformation tasks (versus a baseline maximum of 1/5) and a 9/10 success rate in dynamic pouring tasks, substantially outperforming fixed-force baselines, which achieved 0/10.
📝 Abstract
Manipulating fragile deformable containers, such as disposable plastic cups filled with liquid, demands real-time grip-force adaptation within an extremely narrow force margin: insufficient force causes slip, while excessive force irreversibly deforms the thin wall. Existing approaches struggle to achieve such force-sensitive manipulation tasks. We propose a noise-statistics-based calibration-driven reflex control paradigm with vision-based tactile sensing: by analyzing the sensor's intrinsic noise characteristics (via a brief static-hold-and-unload protocol), we directly derive all controller thresholds, eliminating external force calibration, trial-and-error manual tuning, or material-specific physical models. Instantiating this paradigm, we present TactileReflex, a three-channel closed-loop controller that extracts three image-level proxies, shear intensity ($S_y$), contact intensity ($F_n$), and center of pressure ($C$), from dual visuo-tactile sensors and drives prioritized reflex channels at ~12 Hz for slip suppression, weight-adaptive release, and force protection. Each channel closes the loop directly on its proxy via noise-derived thresholds. Ablation demonstrates that only the full three-channel system is able to prevent irreversible container deformation (5/5 success vs. at most 1/5 for partial configurations). In a dynamic pouring task, fixed-effort baselines fail in all 10 attempts due to pose drift, while TactileReflex achieves 9/10 success across two water volumes. As a self-contained and interpretable controller, TactileReflex can serve as a plug-and-play safety layer beneath high-level manipulation pipelines, including haptic-free VR teleoperation and vision-language-action (VLA) policies.