BIG5-TPoT: Predicting BIG Five Personality Traits, Facets, and Items Through Targeted Preselection of Texts

📅 2025-11-12
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Accurately predicting individual Big Five personality traits, facets, and item-level scores from large-scale generated text remains challenging due to LLM input-length constraints and semantic noise. Method: We propose a semantics-guided text preselection framework that filters raw text and extracts contextually relevant segments based on the semantic characteristics of each personality dimension, thereby enhancing alignment between input text and target personality constructs. The approach integrates a deep learning prediction model with fine-grained semantic similarity computation to enable end-to-end, goal-directed text selection. Contribution/Results: Evaluated on a stream-of-consciousness essay dataset, our method reduces mean absolute error by 12.7% and significantly improves prediction accuracy across all five traits, demonstrating both the effectiveness and generalizability of semantics-driven preselection in computational personality assessment.

Technology Category

Application Category

📝 Abstract
Predicting an individual's personalities from their generated texts is a challenging task, especially when the text volume is large. In this paper, we introduce a straightforward yet effective novel strategy called targeted preselection of texts (TPoT). This method semantically filters the texts as input to a deep learning model, specifically designed to predict a Big Five personality trait, facet, or item, referred to as the BIG5-TPoT model. By selecting texts that are semantically relevant to a particular trait, facet, or item, this strategy not only addresses the issue of input text limits in large language models but also improves the Mean Absolute Error and accuracy metrics in predictions for the Stream of Consciousness Essays dataset.
Problem

Research questions and friction points this paper is trying to address.

Predicting personality traits from large text volumes using semantic filtering
Addressing input text limits in language models for personality assessment
Improving prediction accuracy for Big Five traits through targeted text selection
Innovation

Methods, ideas, or system contributions that make the work stand out.

Semantically filters texts for personality prediction
Uses deep learning model with targeted preselection
Improves prediction accuracy by selecting relevant texts
🔎 Similar Papers
No similar papers found.
T
Triet M. Le
The University of Maryland Applied Research Laboratory for Intelligence and Security (ARLIS)
Arjun Chandra
Arjun Chandra
Founder and CEO at brua.io
Natural ComputationMachine LearningSelf-aware ComputingEmergenceMechanism Design
C
C. Anton Rytting
The University of Maryland Applied Research Laboratory for Intelligence and Security (ARLIS)
V
Valerie Karuzis
The University of Maryland Applied Research Laboratory for Intelligence and Security (ARLIS)
V
Vladimir Rife
The University of Maryland Applied Research Laboratory for Intelligence and Security (ARLIS)
W
William A. Simpson
The University of Maryland Applied Research Laboratory for Intelligence and Security (ARLIS)