BabyLM Turns 4: Call for Papers for the 2026 BabyLM Workshop

📅 2026-02-23

📈 Citations: 0

✨ Influential: 0

career value

188K/year

🤖 AI Summary

This work addresses the gap between cognitive modeling and language modeling by organizing the fourth BabyLM Challenge and Workshop, with a focus on data-efficient pretraining to advance language models that are both cognitively plausible and computationally efficient. The initiative introduces the first multilingual track, extending BabyLM to cross-linguistic settings, and integrates research directions such as cognitively inspired architectures, weak-model evaluation, and training efficiency optimization. By fostering the development of low-resource, interpretable models grounded in human-like learning mechanisms, this effort significantly strengthens the interdisciplinary synergy between cognitive science and artificial intelligence, thereby catalyzing the emergence of a new research community dedicated to building cognitively grounded language models.

Technology Category

Application Category

📝 Abstract

BabyLM aims to dissolve the boundaries between cognitive modeling and language modeling. We call for both workshop papers and for researchers to join the 4th BabyLM competition. As in previous years, we call for participants in the data-efficient pretraining challenge in the general track. This year, we also offer a new track: Multilingual. We also call for papers outside the competition in any relevant areas. These include training efficiency, cognitively plausible research, weak model evaluation, and more.

Problem

Research questions and friction points this paper is trying to address.

cognitive modeling

language modeling

data-efficient pretraining

multilingual

Innovation

Methods, ideas, or system contributions that make the work stand out.

cognitive modeling

data-efficient pretraining

multilingual language modeling

training efficiency