ConPoSe: LLM-Guided Contact Point Selection for Scalable Cooperative Object Pushing

📅 2025-10-09
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Scalability of contact-point combination optimization in multi-robot object pushing tasks suffers from combinatorial explosion as robot count and object size increase. Method: This paper proposes ConPoSe—the first approach to leverage large language models (LLMs) for contact-point selection, exploiting their commonsense reasoning to guide search in high-dimensional combinatorial spaces, while integrating local search to ensure physical feasibility and convergence efficiency. Contribution/Results: ConPoSe overcomes computational bottlenecks of traditional analytical methods in large-scale scenarios. It is validated on cubic, cylindrical, and T-shaped objects. Experiments demonstrate significant improvements over both pure analytical baselines and LLM-only approaches in both contact-point quality and computational scalability, establishing a new paradigm for physics-aware, LLM-guided robotic coordination.

Technology Category

Application Category

📝 Abstract
Object transportation in cluttered environments is a fundamental task in various domains, including domestic service and warehouse logistics. In cooperative object transport, multiple robots must coordinate to move objects that are too large for a single robot. One transport strategy is pushing, which only requires simple robots. However, careful selection of robot-object contact points is necessary to push the object along a preplanned path. Although this selection can be solved analytically, the solution space grows combinatorially with the number of robots and object size, limiting scalability. Inspired by how humans rely on common-sense reasoning for cooperative transport, we propose combining the reasoning capabilities of Large Language Models with local search to select suitable contact points. Our LLM-guided local search method for contact point selection, ConPoSe, successfully selects contact points for a variety of shapes, including cuboids, cylinders, and T-shapes. We demonstrate that ConPoSe scales better with the number of robots and object size than the analytical approach, and also outperforms pure LLM-based selection.
Problem

Research questions and friction points this paper is trying to address.

Selecting robot-object contact points for cooperative pushing
Scaling contact point selection with robot numbers and object size
Combining LLM reasoning with local search for transport optimization
Innovation

Methods, ideas, or system contributions that make the work stand out.

LLM-guided local search for contact points
Combines reasoning with scalable optimization
Outperforms analytical and pure LLM approaches
N
Noah Steinkrüger
Department of Computer Science and Artificial Intelligence, University of Technology Nuremberg, Germany
N
Nisarga Nilavadi
Department of Computer Science and Artificial Intelligence, University of Technology Nuremberg, Germany
Wolfram Burgard
Wolfram Burgard
Professor of Computer Science, University of Technology Nuremberg
RoboticsArtificial IntelligenceAIMachine LearningComputer Vision
T
Tanja Katharina Kaiser
Department of Computer Science and Artificial Intelligence, University of Technology Nuremberg, Germany