🤖 AI Summary
Traditional sentiment analysis methods struggle with metaphorical expressions, stylistic ambiguity, and cross-lingual/cross-temporal affective variation in literary texts, while also exhibiting limited performance on low-resource languages; moreover, discrete label outputs from Transformer-based models hinder fine-grained affective modeling. This paper proposes a continuous sentiment scoring method based on concept vector projection—the first application of this technique to multilingual literary sentiment analysis—yielding real-valued sentiment scores rather than categorical labels. The model jointly optimizes multilingual literary semantic representations with human annotation distributions. Evaluated on English and Danish literary corpora, it significantly outperforms existing tools, achieving high alignment with human judgments (Pearson’s *r* > 0.92). The resulting continuous scores enable robust fine-grained modeling of sentiment arcs across narrative time.
📝 Abstract
Sentiment Analysis is widely used to quantify sentiment in text, but its application to literary texts poses unique challenges due to figurative language, stylistic ambiguity, as well as sentiment evocation strategies. Traditional dictionary-based tools often underperform, especially for low-resource languages, and transformer models, while promising, typically output coarse categorical labels that limit fine-grained analysis. We introduce a novel continuous sentiment scoring method based on concept vector projection, trained on multilingual literary data, which more effectively captures nuanced sentiment expressions across genres, languages, and historical periods. Our approach outperforms existing tools on English and Danish texts, producing sentiment scores whose distribution closely matches human ratings, enabling more accurate analysis and sentiment arc modeling in literature.