Multi-Stage Evolutionary Model Merging with Meta Data Driven Curriculum Learning for Sentiment-Specialized Large Language Modeling

๐Ÿ“… 2024-12-13
๐Ÿ›๏ธ Decision Support Systems
๐Ÿ“ˆ Citations: 1
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
This work addresses the challenge that existing large language models struggle to balance accuracy and generalization in multi-task sentiment analysis. To this end, the authors propose a novel framework that integrates instruction tuning, an evolutionary algorithm-driven multi-stage model ensemble, and metadata-guided curriculum learning. This approach represents the first effort to combine task metadataโ€“driven curriculum learning with evolutionary model ensembling, enabling efficient customization for sentiment analysis tasks. Experimental results demonstrate that the proposed method significantly outperforms baseline large language models across multiple sentiment analysis subtasks, achieving notable improvements in both prediction accuracy and cross-task generalization performance.

Technology Category

Application Category

๐Ÿ“ Abstract
The emergence of large language models (LLMs) has significantly transformed natural language processing (NLP), enabling more generalized models to perform various tasks with minimal training. However, traditional sentiment analysis methods, which focus on individual tasks such as sentiment classification or aspect-based analysis, are not practical for real-world applications that usually require handling multiple tasks. While offering flexibility, LLMs in sentiment-specific tasks often fall short of the required accuracy. Techniques like fine-tuning and evolutionary model merging help integrate models into a unified framework, which can improve the learning performance while reducing computational costs. The use of task meta-data and curriculum learning to optimize learning processes remains underexplored, while sentiment analysis is a critical task in NLP that requires high accuracy and scalability across multiple subtasks. In this study, we propose a hybrid learning model called Multi-stage Evolutionary Model Merging with Meta data driven Curriculum Learning (MEM-MCL), to enhance the sentiment analysis in large language modeling. In particular, expert models are created through instruction tuning for specific sentiment tasks and then merged using evolutionary algorithms to form a unified model. The merging process is optimized with weak data to enhance performance across tasks. The curriculum learning is incorporated to provide a learning sequence based on task difficulty, improving knowledge extraction from LLMs. Experiment results demonstrate that the proposed MEM-MCL model outperforms conventional LLMs in a majority of sentiment analysis tasks, achieving superior results across various subtasks.
Problem

Research questions and friction points this paper is trying to address.

sentiment analysis
large language models
multi-task learning
model accuracy
scalability
Innovation

Methods, ideas, or system contributions that make the work stand out.

Evolutionary Model Merging
Curriculum Learning
Meta-data-driven
Sentiment-specialized LLM
Instruction Tuning
๐Ÿ”Ž Similar Papers
No similar papers found.