Line of Duty: Evaluating LLM Self-Knowledge via Consistency in Feasibility Boundaries

📅 2025-03-14

📈 Citations: 0

✨ Influential: 0

career value

175K/year

🤖 AI Summary

This work addresses the weak self-awareness of large language models (LLMs), particularly their unreliable judgment in identifying unanswerable questions. We propose the first self-knowledge assessment paradigm grounded in **self-determined feasibility boundary consistency**, overcoming the limitations of manually predefined unanswerability criteria. Methodologically, we introduce boundary-adaptive prompting to enable models to autonomously calibrate their capability boundaries, coupled with multi-dimensional consistency metrics, task-sensitive confidence balancing, and reproducible self-questioning modeling. Experimental results reveal that state-of-the-art models such as GPT-4o correctly identify their own capability boundaries only ~20% of the time; contextual understanding deficits are the primary cause of boundary misalignment. Our analysis systematically identifies two core weaknesses: temporal awareness and contextual comprehension. All code and evaluation datasets are publicly released.

Technology Category

Application Category

📝 Abstract

As LLMs grow more powerful, their most profound achievement may be recognising when to say"I don't know". Existing studies on LLM self-knowledge have been largely constrained by human-defined notions of feasibility, often neglecting the reasons behind unanswerability by LLMs and failing to study deficient types of self-knowledge. This study aims to obtain intrinsic insights into different types of LLM self-knowledge with a novel methodology: allowing them the flexibility to set their own feasibility boundaries and then analysing the consistency of these limits. We find that even frontier models like GPT-4o and Mistral Large are not sure of their own capabilities more than 80% of the time, highlighting a significant lack of trustworthiness in responses. Our analysis of confidence balance in LLMs indicates that models swing between overconfidence and conservatism in feasibility boundaries depending on task categories and that the most significant self-knowledge weaknesses lie in temporal awareness and contextual understanding. These difficulties in contextual comprehension additionally lead models to question their operational boundaries, resulting in considerable confusion within the self-knowledge of LLMs. We make our code and results available publicly at https://github.com/knowledge-verse-ai/LLM-Self_Knowledge_Eval

Problem

Research questions and friction points this paper is trying to address.

Evaluating LLM self-knowledge via feasibility boundaries.

Analyzing consistency in LLM self-set feasibility limits.

Identifying weaknesses in temporal and contextual understanding.

Innovation

Methods, ideas, or system contributions that make the work stand out.

Flexible self-set feasibility boundaries for LLMs

Analysis of consistency in LLM self-knowledge limits

Evaluation of confidence balance in LLM responses

🔎 Similar Papers

No similar papers found.