🤖 AI Summary
This study addresses inherent limitations in current LLM-driven AI agents—namely, poor interpretability, insufficient autonomous regulation, and weak embodied reasoning. To this end, we propose a Content-Centric Computational Cognition (C⁴) model that, for the first time, deeply integrates metacognitive capabilities into the C⁴ framework. Methodologically, we unify neural-symbolic processing, the Language-Empowered Intelligent Agent (LEIA) architecture, and cognitive robotics to enable synergistic modeling of symbolic logic and sub-symbolic learning. The resulting model significantly enhances agents’ perception, interpretation, storage, and dynamic utilization of environmental, self-, and other-related information. It improves self-monitoring, strategic reflection, and cross-domain knowledge transfer in dynamic tasks. Our work establishes a novel theoretical paradigm and provides key technical foundations for developing next-generation AI agents that are trustworthy, autonomous, and interpretable.
📝 Abstract
For AI agents to emulate human behavior, they must be able to perceive, meaningfully interpret, store, and use large amounts of information about the world, themselves, and other agents. Metacognition is a necessary component of all of these processes. In this paper, we briefly a) introduce content-centric computational cognitive (C4) modeling for next-generation AI agents; b) review the long history of developing C4 agents at RPI's LEIA (Language-Endowed Intelligent Agents) Lab; c) discuss our current work on extending LEIAs' cognitive capabilities to cognitive robotic applications developed using a neuro symbolic processing model; and d) sketch plans for future developments in this paradigm that aim to overcome underappreciated limitations of currently popular, LLM-driven methods in AI.