🤖 AI Summary
This study addresses the tension between interpretive depth and computational scalability in classroom dialogue analysis, along with temporal observation challenges such as low-frequency variables, estimation uncertainty, and variability in time-unit granularity. To reconcile these issues, the authors propose the AVVA framework, which integrates qualitative interpretation with quantitative modeling by leveraging verbatim transcripts and multimodal interaction data. Analytical rigor is ensured through a ten-step triangulation protocol. The framework innovatively incorporates a four-criterion stability assessment to identify variable relationship patterns—such as granularity-invariant or scale-specific behaviors—across temporal resolutions. Core methodological challenges are mitigated via base-rate filtering, bootstrap confidence intervals, and systematic time-unit analysis. Empirical validation on 23 hours of classroom recordings demonstrates that AVVA effectively transforms complex discourse into structured datasets, uncovering educationally meaningful interaction patterns.
📝 Abstract
Background: The classroom discourse analysis has been transformed by the growing use of audio-video multimodal data, which demands analytical methods that balance interpretive depth with computational scalability.
Methods: This study introduces the Audio Video Verbal Analysis (AVVA) framework, adapted from the Verbal Analysis method to integrate qualitative interpretation with quantitative modelling. Unlike fully multimodal learning analytics approaches, AVVA focuses on verbatim transcripts with essential interactional modalities.
Findings: The framework embeds triangulation as a core design strategy across ten methodological steps, strengthening validity and analytical rigour. A comprehensive validation scheme addresses fundamental challenges in temporal observational research: Phi Ceiling for low-frequency variables (via Base Rate Filtering), estimation uncertainty (via bootstrap confidence intervals), and the Modifiable Temporal Unit Problem, where measured associations depend on observational window size. Four-criterion stability assessment (sign consistency, confidence interval overlap, zero exclusion, magnitude stability) classifies variable pairs into interpretable patterns: grain-invariant, scale-specific, or multi-scale, etc. structures across temporal grain sizes. Its application to 23 hours of classroom recordings illustrates its practical viability and its potential to yield meaningful insights.
Contribution: The framework thus provides a scalable pathway for transforming rich classroom discourse into analysable datasets.