SceneGram: Conceptualizing and Describing Tangrams in Scene Context

📅 2025-06-13

📈 Citations: 0

✨ Influential: 0

career value

198K/year

🤖 AI Summary

This study investigates how scene context shapes human conceptualization and linguistic reference to identical abstract shapes (Tangram figures) and evaluates whether multimodal large language models (MLLMs) exhibit comparable cognitive flexibility. Method: We introduce SceneGram—the first cross-scene, human-annotated dataset—featuring crowd-sourced conceptual descriptions of the same geometric shapes across diverse contextual scenes. Using rigorously controlled cross-scene experiments and human–model comparative analysis, we quantify contextual effects on naming preferences and conceptual expectations. Contribution/Results: We demonstrate that human conceptualization is strongly grounded in embodied context, whereas state-of-the-art MLLMs—even under prompt tuning—exhibit severe deficits in contextual sensitivity and conceptual variability. This work establishes a novel benchmark and theoretical framework for advancing cognitively interpretable vision-language modeling.

Technology Category

Application Category

📝 Abstract

Research on reference and naming suggests that humans can come up with very different ways of conceptualizing and referring to the same object, e.g. the same abstract tangram shape can be a"crab","sink"or"space ship". Another common assumption in cognitive science is that scene context fundamentally shapes our visual perception of objects and conceptual expectations. This paper contributes SceneGram, a dataset of human references to tangram shapes placed in different scene contexts, allowing for systematic analyses of the effect of scene context on conceptualization. Based on this data, we analyze references to tangram shapes generated by multimodal LLMs, showing that these models do not account for the richness and variability of conceptualizations found in human references.

Problem

Research questions and friction points this paper is trying to address.

How scene context influences human conceptualization of abstract shapes

Comparing human and AI model variability in shape conceptualization

Creating dataset to analyze context effects on object references

Innovation

Methods, ideas, or system contributions that make the work stand out.

Dataset for tangram references in scenes

Analyzes scene context effect on conceptualization

Compares human and LLM tangram conceptualizations

🔎 Similar Papers

No similar papers found.