What Makes a Model Breathe? Understanding Reinforcement Learning Reward Function Design in Biomechanical User Simulation

📅 2025-03-04
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the lack of systematic guidance in designing reward functions for reinforcement learning (RL) in biomechanical human simulation. Methodologically, we develop a biomechanical simulation framework based on the Proximal Policy Optimization (PPO) algorithm, integrating a choice-reaction task paradigm and trajectory behavioral analysis to quantitatively disentangle the functional roles of three reward components: effort minimization, task completion, and target proximity. Our key contribution is the first identification of their interaction principles, leading to a novel “completion + proximity” dual-core reward paradigm: synergistic integration of these two components is essential for task success, whereas effort regularization is optional but helps suppress aberrant motions. Experiments demonstrate that this paradigm significantly improves trajectory fidelity and stability while lowering the design barrier for non-RL experts—advancing biomechanical simulation toward high-fidelity, deployable solutions.

Technology Category

Application Category

📝 Abstract
Biomechanical models allow for diverse simulations of user movements in interaction. Their performance depends critically on the careful design of reward functions, yet the interplay between reward components and emergent behaviours remains poorly understood. We investigate what makes a model"breathe"by systematically analysing the impact of rewarding effort minimisation, task completion, and target proximity on movement trajectories. Using a choice reaction task as a test-bed, we find that a combination of completion bonus and proximity incentives is essential for task success. Effort terms are optional, but can help avoid irregularities if scaled appropriately. Our work offers practical insights for HCI designers to create realistic simulations without needing deep reinforcement learning expertise, advancing the use of simulations as a powerful tool for interaction design and evaluation in HCI.
Problem

Research questions and friction points this paper is trying to address.

Understanding reward function design in biomechanical user simulations.
Analyzing impact of reward components on movement trajectories.
Providing insights for realistic simulations in HCI design.
Innovation

Methods, ideas, or system contributions that make the work stand out.

Analyzes reward function impact on biomechanical models
Combines completion bonus and proximity for task success
Provides insights for realistic HCI simulation design
🔎 Similar Papers
No similar papers found.
Hannah Selder
Hannah Selder
ScaDS.AI Dresden/Leipzig
Florian Fischer
Florian Fischer
University of Cambridge
Reinforcement LearningComputational InteractionHCIBiomechanical SimulationMachine Learning
P
P. O. Kristensson
University of Cambridge, Cambridge, United Kingdom
A
A. Fleig
Center for Scalable Data Analytics and Artificial Intelligence (ScaDS.AI), Leipzig University, Leipzig, Germany