Standards for Belief Representations in LLMs

📅 2024-05-31

🏛️ Minds Mach.

📈 Citations: 5

✨ Influential: 0

🤖 AI Summary

This work addresses the lack of a unified theoretical foundation for belief representation in large language models (LLMs), which leads to inconsistency and opacity in their internal belief states. We propose the first formalized standard framework for belief representation, defining three core properties: semantic consistency, dynamic updatability, and causal traceability. Methodologically, we integrate doxastic logic (DoX), neuro-symbolic interfaces, inter-layer attention attribution, and counterfactual belief editing to enable verifiable modeling and controllable intervention of LLMs’ latent belief states. Evaluated on BELIEF-BENCH, our approach improves belief consistency accuracy by 32.7% and supports fine-grained belief injection and withdrawal. The framework establishes the first auditable, cross-task generalizable cognitive infrastructure for trustworthy AI, effectively breaking the “belief black box” limitation of LLMs.

Technology Category

Application Category

Problem

Research questions and friction points this paper is trying to address.

Lack of unified theory for belief representation in LLMs.

Need for criteria to define belief-like representations in LLMs.

Challenges in measuring belief in LLMs compared to traditional methods.

Innovation

Methods, ideas, or system contributions that make the work stand out.

Proposes adequacy conditions for belief-like LLM representations

Establishes four criteria: accuracy, coherence, uniformity, use

Integrates philosophy and machine learning for belief measurement

🔎 Similar Papers

No similar papers found.

💼 Related Jobs

Natural Language Processing Researcher

Remote, USA: AL, AZ, CO, DC, FL, GA, IL, IN, MA, MD, ME, MN, NC, NM, NY, OH, OR, PA, TN, TX, UT, VA, WI

Natural Language Processing Researcher

Clifton Park, New York / Carrboro, North Carolina / Minneapolis, MN

Natural Language Processing Researcher

Arlington, Virginia

Senior / Principal Machine Learning Scientist, Scientific Reasoning Models, AI for Drug Discovery

New York City, New York, United States of America / South San Francisco, California, United States of America

Senior Research Scientist - Machine Learning System

United States / China / Singapore

Authors to Follow