CROSS-GAiT: Cross-Attention-Based Multimodal Representation Fusion for Parametric Gait Adaptation in Complex Terrains

📅 2024-09-25
🏛️ arXiv.org
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address poor dynamic gait adaptability and high energy consumption of quadrupedal robots navigating unknown complex terrains (e.g., tall vegetation, quicksand, rubble), this paper proposes a cross-modal fusion-based real-time adaptive gait control method. Methodologically, we introduce, for the first time, a cross-modal cross-attention mechanism that jointly encodes masked ViT visual features and temporally modeled IMU/joint dynamics features extracted via dilated causal convolution, thereby constructing a unified terrain-dynamics representation and enabling end-to-end mapping to foot-height and hip-abduction parameters. The framework supports zero-shot generalization and millisecond-level online adaptation. Experiments on the Vision 60 platform demonstrate a 7.04% reduction in IMU energy density, a 27.3% decrease in total joint torque, a 64.5% improvement in task success rate, a 4.91% reduction in traversal time, and a 4.48% increase in terrain classification accuracy.

Technology Category

Application Category

📝 Abstract
We present CROSS-GAiT, a novel algorithm for quadruped robots that uses Cross Attention to fuse terrain representations derived from visual and time-series inputs, including linear accelerations, angular velocities, and joint efforts. These fused representations are used to adjust the robot's step height and hip splay, enabling adaptive gaits that respond dynamically to varying terrain conditions. We generate these terrain representations by processing visual inputs through a masked Vision Transformer (ViT) encoder and time-series data through a dilated causal convolutional encoder. The cross-attention mechanism then selects and integrates the most relevant features from each modality, combining terrain characteristics with robot dynamics for better-informed gait adjustments. CROSS-GAiT uses the combined representation to dynamically adjust gait parameters in response to varying and unpredictable terrains. We train CROSS-GAiT on data from diverse terrains, including asphalt, concrete, brick pavements, grass, dense vegetation, pebbles, gravel, and sand. Our algorithm generalizes well and adapts to unseen environmental conditions, enhancing real-time navigation performance. CROSS-GAiT was implemented on a Ghost Robotics Vision 60 robot and extensively tested in complex terrains with high vegetation density, uneven/unstable surfaces, sand banks, deformable substrates, etc. We observe at least a 7.04% reduction in IMU energy density and a 27.3% reduction in total joint effort, which directly correlates with increased stability and reduced energy usage when compared to state-of-the-art methods. Furthermore, CROSS-GAiT demonstrates at least a 64.5% increase in success rate and a 4.91% reduction in time to reach the goal in four complex scenarios. Additionally, the learned representations perform 4.48% better than the state-of-the-art on a terrain classification task.
Problem

Research questions and friction points this paper is trying to address.

Fusing visual and time-series data for adaptive robot gaits
Adjusting gait parameters dynamically for complex terrains
Improving stability and energy efficiency in quadruped robots
Innovation

Methods, ideas, or system contributions that make the work stand out.

Cross-attention fuses visual and time-series inputs
Dynamically adjusts gait parameters for terrain adaptation
Uses masked ViT and dilated convolutional encoders
🔎 Similar Papers
No similar papers found.
Gershom Seneviratne
Gershom Seneviratne
University of Maryland
RoboticsMotion PlanningAutonomous NavigationPerception
Kasun Weerakoon
Kasun Weerakoon
Ph.D., University of Maryland, College Park, MD
RoboticsDeep Reinforcement LearningAutonomous Navigation
M
Mohamed Bashir Elnoor
Dept. of Electrical and Computer Engineering, University of Maryland, College Park, MD, USA
V
Vignesh Rajgopal
A. James Clark School of Engineering, University of Maryland, College Park, MD, USA
H
Harshavarthan Varatharajan
A. James Clark School of Engineering, University of Maryland, College Park, MD, USA
M
M. K. M. Jaffar
Dept. of Aerospace Engineering, University of Maryland, College Park, MD, USA
Jason Pusey
Jason Pusey
U.S. Army Research Laboratory (ARL)
Dinesh Manocha
Dinesh Manocha
Distinguished University Professor, University of Maryland at College Park
computer graphicsgeometric modelingmotion planningvirtual realityrobotics