SceneGram: Conceptualizing and Describing Tangrams in Scene Context

📅 2025-06-13
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study investigates how scene context shapes human conceptualization and linguistic reference to identical abstract shapes (Tangram figures) and evaluates whether multimodal large language models (MLLMs) exhibit comparable cognitive flexibility. Method: We introduce SceneGram—the first cross-scene, human-annotated dataset—featuring crowd-sourced conceptual descriptions of the same geometric shapes across diverse contextual scenes. Using rigorously controlled cross-scene experiments and human–model comparative analysis, we quantify contextual effects on naming preferences and conceptual expectations. Contribution/Results: We demonstrate that human conceptualization is strongly grounded in embodied context, whereas state-of-the-art MLLMs—even under prompt tuning—exhibit severe deficits in contextual sensitivity and conceptual variability. This work establishes a novel benchmark and theoretical framework for advancing cognitively interpretable vision-language modeling.

Technology Category

Application Category

📝 Abstract
Research on reference and naming suggests that humans can come up with very different ways of conceptualizing and referring to the same object, e.g. the same abstract tangram shape can be a"crab","sink"or"space ship". Another common assumption in cognitive science is that scene context fundamentally shapes our visual perception of objects and conceptual expectations. This paper contributes SceneGram, a dataset of human references to tangram shapes placed in different scene contexts, allowing for systematic analyses of the effect of scene context on conceptualization. Based on this data, we analyze references to tangram shapes generated by multimodal LLMs, showing that these models do not account for the richness and variability of conceptualizations found in human references.
Problem

Research questions and friction points this paper is trying to address.

How scene context influences human conceptualization of abstract shapes
Comparing human and AI model variability in shape conceptualization
Creating dataset to analyze context effects on object references
Innovation

Methods, ideas, or system contributions that make the work stand out.

Dataset for tangram references in scenes
Analyzes scene context effect on conceptualization
Compares human and LLM tangram conceptualizations
🔎 Similar Papers
No similar papers found.