The One Where They Brain-Tune for Social Cognition: Multi-Modal Brain-Tuning on Friends

📅 2025-11-11

📈 Citations: 0

✨ Influential: 0

career value

216K/year

🤖 AI Summary

This study addresses the insufficient region-specific alignment in social cognitive modeling by pioneering the extension of brain-tuning to multimodal (audio-visual) domains, with a focus on the superior temporal sulcus (STS)—a core hub for social processing. Methodologically, we develop a deep neural model that learns joint audio-visual representations and fine-tunes it using fMRI neural responses recorded while participants watched *Friends*, targeting fine-grained activity prediction within the STS and its anatomically adjacent regions. Our contributions are threefold: (1) introducing the first multimodal brain-tuning paradigm; (2) achieving significantly improved fMRI activity prediction accuracy specifically within the STS; and (3) demonstrating substantial performance gains on real-world social semantic tasks—e.g., sarcasm detection in sitcoms—thereby validating the efficacy and generalizability of targeted, region-specific brain-tuning for high-level social cognition modeling.

Technology Category

Application Category

📝 Abstract

Recent studies on audio models show brain-tuning - fine-tuning models to better predict corresponding fMRI activity - improves brain alignment and increases performance on downstream semantic and audio tasks. We extend this approach to a multimodal audio-video model to enhance social cognition, targeting the Superior Temporal Sulcus (STS), a key region for social processing, while subjects watch Friends. We find significant increases in brain alignment to the STS and an adjacent ROI, as well as improvements to a social cognition task related to the training data - sarcasm detection in sitcoms. In summary, our study extends brain-tuning to the multi-modal domain, demonstrating improvements to a downstream task after tuning to a relevant functional region.

Problem

Research questions and friction points this paper is trying to address.

Extending brain-tuning to multimodal audio-video models

Enhancing social cognition by targeting brain regions like STS

Improving sarcasm detection in sitcoms through brain alignment

Innovation

Methods, ideas, or system contributions that make the work stand out.

Extends brain-tuning to multimodal audio-video models

Targets Superior Temporal Sulcus for social cognition

Improves sarcasm detection through brain alignment

🔎 Similar Papers

No similar papers found.

OpenAI

$380K – $445K • Offers Equity

San Francisco, CA, USA

AI Research Scientist, Computer Vision - Facebook Video Intelligence