🤖 AI Summary
This study addresses the limitations of traditional physiological metrics in assessing exercise intensity, which are often constrained by medications or equipment and pose risks for individuals with cardiovascular disease. To overcome these challenges, the authors propose AktivTalk—the first mobile application to digitize the clinical “talk test”—leveraging structured voice interactions to combine user self-assessment with automated classification of exercise intensity. The system extracts MFCC acoustic features and employs class-balancing strategies, cross-validation, and a lightweight neural network, achieving 90% accuracy in distinguishing high versus non-high intensity exercise among 20 participants. Results demonstrate that this approach offers high usability and practicality, significantly outperforming conventional guided assessments and establishing a safe, accessible new paradigm for monitoring exercise intensity.
📝 Abstract
Monitoring exercise intensity is critical for safe and effective physical activity, particularly for individuals with cardiovascular disease, where overexertion can pose serious risks. Although physiological measures such as heart rate are widely used for avoiding overexertion, they can be unreliable in certain cases, such as when affected by medication or when wearables are worn too loosely. We introduce AktivTalk, a mobile prototype that digitizes the clinically validated Talk Test to support voice-based, in-the-moment self-assessment of exertion. In a within-subject study with 20 participants, we collected exertion-labeled voice samples and found that AktivTalk was rated as highly usable and preferred over conductor-guided assessment. We further explored automated exertion classification from Talk Test speech. Using MFCC-based features with class balancing and cross-validation, a lightweight neural classifier achieved up to 90% accuracy for detecting high vs.non-high exertion from Talk Test recordings. This work highlights the potential of structured voice interactions for accessible exertion assessment and motivates future passive exertion monitoring from speech.