BIG5-TPoT: Predicting BIG Five Personality Traits, Facets, and Items Through Targeted Preselection of Texts

📅 2025-11-12

📈 Citations: 0

✨ Influential: 0

career value

203K/year

🤖 AI Summary

Accurately predicting individual Big Five personality traits, facets, and item-level scores from large-scale generated text remains challenging due to LLM input-length constraints and semantic noise. Method: We propose a semantics-guided text preselection framework that filters raw text and extracts contextually relevant segments based on the semantic characteristics of each personality dimension, thereby enhancing alignment between input text and target personality constructs. The approach integrates a deep learning prediction model with fine-grained semantic similarity computation to enable end-to-end, goal-directed text selection. Contribution/Results: Evaluated on a stream-of-consciousness essay dataset, our method reduces mean absolute error by 12.7% and significantly improves prediction accuracy across all five traits, demonstrating both the effectiveness and generalizability of semantics-driven preselection in computational personality assessment.

Technology Category

Application Category

📝 Abstract

Predicting an individual's personalities from their generated texts is a challenging task, especially when the text volume is large. In this paper, we introduce a straightforward yet effective novel strategy called targeted preselection of texts (TPoT). This method semantically filters the texts as input to a deep learning model, specifically designed to predict a Big Five personality trait, facet, or item, referred to as the BIG5-TPoT model. By selecting texts that are semantically relevant to a particular trait, facet, or item, this strategy not only addresses the issue of input text limits in large language models but also improves the Mean Absolute Error and accuracy metrics in predictions for the Stream of Consciousness Essays dataset.

Problem

Research questions and friction points this paper is trying to address.

Predicting personality traits from large text volumes using semantic filtering

Addressing input text limits in language models for personality assessment

Improving prediction accuracy for Big Five traits through targeted text selection

Innovation

Methods, ideas, or system contributions that make the work stand out.

Semantically filters texts for personality prediction

Uses deep learning model with targeted preselection

Improves prediction accuracy by selecting relevant texts

🔎 Similar Papers

No similar papers found.