PAPER_TITLE

FIRST_AUTHOR_LAST, FIRST_AUTHOR_FIRST; SECOND_AUTHOR_LAST, SECOND_AUTHOR_FIRST

Beyond Perplexity: Let the Reader Select Retrieval Summaries via Spectrum Projection Score

Zhanghao Hu, Qinglin Zhu, Siya Qi, Yulan He, Hanqi Yan, Lin Gui,

King's College London, The Alan Turing Institute
AAAI 2026 Oral🌟

Huggingface Code arXiv

Overview of the xCompress framework. Retrieved passages are first compressed into summaries. An adaptive norm-guided filtering mechanism determines whether additional test-time sampling is necessary. If required, multiple summaries are sampled from the compressor LLM and evaluated using the Spectrum Projection Score (SPS). These summaries are first embedded via max-pooling, then projected onto the reader’s principal subspace of its parameter. The summary with the lowest SPS is selected as input to the reader; otherwise, the initial summary is used directly for answer generation.

Abstract

Large Language Models (LLMs) have shown improved generation performance through retrieval-augmented generation (RAG) following the retriever-reader paradigm, which supplements model inputs with externally retrieved knowledge. However, prior work often evaluates RAG holistically, assessing the retriever and reader jointly, making it difficult to isolate the true contribution of retrieval, particularly given the prompt sensitivity of LLMs used as readers. We move beyond perplexity and introduce Spectrum Projection Score (SPS), a lightweight, supervision-free metric that allows the reader to gauge the semantic alignment of a retrieved summary with its hidden representation by comparing the area formed by generated tokens from the summary, and the principal directions of subspace in the reader and to measure the relevance. Building on SPS we present xCompress, an inference-time controller framework that dynamically samples, ranks, and compresses retrieval summary candidates. Extensive experiments on five QA benchmarks with four open source LLMs show that SPS not only enhances performance across a range of tasks but also provides a principled perspective on the interaction between retrieval and generation.

Main Experiment

Video and Poster are coming soon!

BibTeX


@article{hu2025spectrum,
  title={Spectrum Projection Score: Aligning Retrieved Summaries with Reader Models in Retrieval-Augmented Generation},
  author={Hu, Zhanghao and Zhu, Qinglin and Qi, Siya and He, Yulan and Yan, Hanqi and Gui, Lin},
  journal={arXiv preprint arXiv:2508.05909},
  year={2025}
}

More Works from Our Lab

Paper Title 1

Paper Title 2

Paper Title 3

Beyond Perplexity: Let the Reader Select Retrieval Summaries via Spectrum Projection Score

Abstract

Main Experiment

BibTeX