๐ค AI Summary
Current cognitive-linguistic impairment assessment methods rely on eye-tracking data or manual annotation of Content Units (CIUs) to construct spatio-semantic graphs, limiting clinical accessibility and scalability. This paper proposes the first end-to-end automated method for constructing spatio-semantic graphs: leveraging NLP-based semantic parsing and spatial relation modeling, integrated with cognitive-linguistic constraint rules, it directly extracts CIUs and generates graph structures from descriptive narratives of the Cookie Theft pictureโwithout requiring eye-tracking signals or manual annotations. The approach significantly improves deployment efficiency and clinical feasibility. Evaluated on distinguishing individuals with cognitive impairment from healthy controls, its automatically derived graph features achieve superior classification performance, with statistically significant gains over manually annotated baselines and enhanced discriminative power across clinical populations.
๐ Abstract
Existing methods for analyzing linguistic content from picture descriptions for assessment of cognitive-linguistic impairment often overlook the participant's visual narrative path, which typically requires eye tracking to assess. Spatio-semantic graphs are a useful tool for analyzing this narrative path from transcripts alone, however they are limited by the need for manual tagging of content information units (CIUs). In this paper, we propose an automated approach for estimation of spatio-semantic graphs (via automated extraction of CIUs) from the Cookie Theft picture commonly used in cognitive-linguistic analyses. The method enables the automatic characterization of the visual semantic path during picture description. Experiments demonstrate that the automatic spatio-semantic graphs effectively differentiate between cognitively impaired and unimpaired speakers. Statistical analyses reveal that the features derived by the automated method produce comparable results to the manual method, with even greater group differences between clinical groups of interest. These results highlight the potential of the automated approach for extracting spatio-semantic features in developing clinical speech models for cognitive impairment assessment.