Algorithmic Prompt Generation for Diverse Human-like Teaming and Communication with Large Language Models

๐Ÿ“… 2025-04-04
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF

career value

212K/year
๐Ÿค– AI Summary
Scaling human team behavioral data for human-AI collaborative decision-making remains challenging due to the difficulty of collecting diverse, high-quality human interaction traces. Method: This paper introduces the first algorithmic prompt-generation framework that integrates Quality-Diversity (QD) optimization with large language model (LLM) agentsโ€”requiring neither handcrafted prompts nor large-scale user studies. It automatically discovers prompt strategies that elicit multidimensional, human-like communication and coordination behaviors from LLMs in multi-step collaborative settings, synthesizing a broad spectrum of team behavioral patterns. Contribution/Results: Evaluated in a 54-participant user study, the generated behaviors accurately reproduce key human collaboration trends and uncover novel coordination patterns otherwise obscured by data sparsity. The approach significantly outperforms baseline methods in both behavioral diversity and fidelity to human behavior.

Technology Category

Application Category

๐Ÿ“ Abstract
Understanding how humans collaborate and communicate in teams is essential for improving human-agent teaming and AI-assisted decision-making. However, relying solely on data from large-scale user studies is impractical due to logistical, ethical, and practical constraints, necessitating synthetic models of multiple diverse human behaviors. Recently, agents powered by Large Language Models (LLMs) have been shown to emulate human-like behavior in social settings. But, obtaining a large set of diverse behaviors requires manual effort in the form of designing prompts. On the other hand, Quality Diversity (QD) optimization has been shown to be capable of generating diverse Reinforcement Learning (RL) agent behavior. In this work, we combine QD optimization with LLM-powered agents to iteratively search for prompts that generate diverse team behavior in a long-horizon, multi-step collaborative environment. We first show, through a human-subjects experiment (n=54 participants), that humans exhibit diverse coordination and communication behavior in this domain. We then show that our approach can effectively replicate trends from human teaming data and also capture behaviors that are not easily observed without collecting large amounts of data. Our findings highlight the combination of QD and LLM-powered agents as an effective tool for studying teaming and communication strategies in multi-agent collaboration.
Problem

Research questions and friction points this paper is trying to address.

Generating diverse human-like teaming behaviors using LLMs
Overcoming constraints of manual prompt design for diversity
Replicating human communication trends in multi-agent collaboration
Innovation

Methods, ideas, or system contributions that make the work stand out.

Combines QD optimization with LLM-powered agents
Generates diverse prompts for human-like teaming
Replicates human trends with synthetic behaviors
๐Ÿ”Ž Similar Papers
No similar papers found.