π€ AI Summary
Contemporary RAG-based question-answering systems suffer from insufficient global semantic understanding and generation outputs misaligned with human ethical norms and quality preferences. To address these challenges, we propose a graph-enhanced hierarchical QA framework. First, it constructs a hierarchical document graph to explicitly model cross-document semantic associations, enabling holistic contextual understanding. Second, it introduces a pattern-seeking preference optimization mechanism that aligns model outputs with human preferences via probabilistic matching constraints, emulating cognition-driven information synthesis. The method integrates retrieval-augmented generation, graph-structured modeling, semantic similarity measurement, and preference-aware optimization. Extensive experiments across six benchmark datasets demonstrate significant improvements in answer accuracy, answer consistency, and human evaluation scores. These results validate the frameworkβs dual advantages: deeper knowledge integration and enhanced generation controllability.
π Abstract
Recent advancements in retrieval-augmented generation (RAG) have enhanced large language models in question answering by integrating external knowledge. However, challenges persist in achieving global understanding and aligning responses with human ethical and quality preferences. To address these issues, we propose GraphMPA, a comprehensive graph-based framework with mode-seeking preference alignment. Our approach constructs a hierarchical document graph using a general similarity measurement, mimicking human cognitive processes for information understanding and synthesis. Additionally, we introduce mode-seeking preference optimization to better align model outputs with human preferences through probability-matching constraints. Extensive experiments on six datasets demonstrate the effectiveness of our href{https://github.com/tangquanwei/GraphMPA}{GraphMPA}.