PaperRegister: Boosting Flexible-grained Paper Search via Hierarchical Register Indexing

📅 2025-08-14
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing paper retrieval systems rely on abstract-based indexing, limiting their ability to support precise queries over fine-grained technical details—such as module configurations. To address this, we propose PaperRegister, the first framework introducing a hierarchical registration indexing mechanism. It constructs a multi-granularity index tree spanning topics, methods, components, and parameters via hierarchical text parsing, modular information extraction, and adaptive matching. The framework synergistically integrates offline index construction with online dynamic retrieval, overcoming the granularity bottleneck inherent in conventional abstract-only indexing. Experiments demonstrate that PaperRegister achieves state-of-the-art performance across both coarse- and fine-grained retrieval tasks, significantly outperforming baseline methods—especially for queries targeting technical specifications. The system exhibits strong practicality and scalability, enabling efficient, precise navigation of technical knowledge in scientific literature.

Technology Category

Application Category

📝 Abstract
Paper search is an important activity for researchers, typically involving using a query with description of a topic to find relevant papers. As research deepens, paper search requirements may become more flexible, sometimes involving specific details such as module configuration rather than being limited to coarse-grained topics. However, previous paper search systems are unable to meet these flexible-grained requirements, as these systems mainly collect paper abstracts to construct index of corpus, which lack detailed information to support retrieval by finer-grained queries. In this work, we propose PaperRegister, consisted of offline hierarchical indexing and online adaptive retrieval, transforming traditional abstract-based index into hierarchical index tree for paper search, thereby supporting queries at flexible granularity. Experiments on paper search tasks across a range of granularity demonstrate that PaperRegister achieves the state-of-the-art performance, and particularly excels in fine-grained scenarios, highlighting the good potential as an effective solution for flexible-grained paper search in real-world applications. Code for this work is in https://github.com/Li-Z-Q/PaperRegister.
Problem

Research questions and friction points this paper is trying to address.

Enables flexible-grained paper search via hierarchical indexing
Addresses limitations of abstract-based search systems
Supports queries with specific details and varying granularity
Innovation

Methods, ideas, or system contributions that make the work stand out.

Hierarchical index tree for flexible search
Offline indexing and online adaptive retrieval
Supports queries at flexible granularity
🔎 Similar Papers
No similar papers found.