Factored Reasoning with Inner Speech and Persistent Memory for Evidence-Grounded Human-Robot Interaction

📅 2026-01-31
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work proposes JANUS, a cognitive architecture designed to address key challenges in conversational human–agent interaction—namely, difficulties in maintaining context, interpreting ambiguous requests, and ensuring verifiable responses. JANUS models interaction as a partially observable Markov decision process and integrates an inner speech mechanism with a persistent memory system through a modular controller. It uniquely combines inner speech, hierarchical memory, and explicit policy control to enable judgments of informational sufficiency, readiness for execution, and tool grounding. Semantic retrieval from memory and evidence-bundle constraints ensure responses are both faithful and auditable. Evaluated in a knowledge graph–driven dietary assistance scenario, module-level testing demonstrates high reference consistency and practical latency, validating the efficacy of hierarchical reasoning in long-term collaborative tasks.

Technology Category

Application Category

📝 Abstract
Dialogue-based human-robot interaction requires robot cognitive assistants to maintain persistent user context, recover from underspecified requests, and ground responses in external evidence, while keeping intermediate decisions verifiable. In this paper we introduce JANUS, a cognitive architecture for assistive robots that models interaction as a partially observable Markov decision process and realizes control as a factored controller with typed interfaces. To this aim, Janus (i) decomposes the overall behavior into specialized modules, related to scope detection, intent recognition, memory, inner speech, query generation, and outer speech, and (ii) exposes explicit policies for information sufficiency, execution readiness, and tool grounding. A dedicated memory agent maintains a bounded recent-history buffer, a compact core memory, and an archival store with semantic retrieval, coupled through controlled consolidation and revision policies. Models inspired by the notion of inner speech in cognitive theories provide a control-oriented internal textual flow that validates parameter completeness and triggers clarification before grounding, while a faithfulness constraint ties robot-to-human claims to an evidence bundle combining working context and retrieved tool outputs. We evaluate JANUS through module-level unit tests in a dietary assistance domain grounded on a knowledge graph, reporting high agreement with curated references and practical latency profiles. These results support factored reasoning as a promising path to scalable, auditable, and evidence-grounded robot assistance over extended interaction horizons.
Problem

Research questions and friction points this paper is trying to address.

human-robot interaction
persistent memory
evidence grounding
underspecified requests
verifiable reasoning
Innovation

Methods, ideas, or system contributions that make the work stand out.

Factored Reasoning
Inner Speech
Persistent Memory
Evidence Grounding
Cognitive Architecture
🔎 Similar Papers
No similar papers found.
V
Valerio Belcamino
Department of Engineering, University of Palermo, Viale delle Scienze, Bldg. 7, Palermo, 90128, Italy
M
Mariya Kilina
Department of Engineering, University of Palermo, Viale delle Scienze, Bldg. 7, Palermo, 90128, Italy
Alessandro Carfì
Alessandro Carfì
University of Genoa
roboticshuman robot interactionmachine learning
V
Valeria Seidita
Department of Engineering, University of Palermo, Viale delle Scienze, Bldg. 7, Palermo, 90128, Italy
Fulvio Mastrogiovanni
Fulvio Mastrogiovanni
University of Genoa, Istituto Italiano di Tecnologia
Cognitive SystemsCognitive RoboticsEmbodied CognitionEmbodied AIPhysical AI
Antonio Chella
Antonio Chella
Professor of Robotics, University of Palermo, Italy
Machine consciousnessRoboticsComputer VisionArtificial Intelligence