Skip to content
Sign in

What is Reranking?

Reranking is the step of reordering an initial set of retrieved results or candidates using a more accurate but often slower model to improve relevance.

In data and retrieval pipelines, a fast first-stage retriever quickly returns a large pool of candidates. A second-stage reranker then scores and reorders only the top candidates with richer features or a heavier model.

This two-stage design balances speed and quality: the retriever handles scale while the reranker focuses compute on promising items, often using cross-encoders, gradient-boosted trees, or learned ranking models.

Reranking can incorporate user context, freshness signals, or business rules that were too expensive to apply during the initial retrieval.

Example

A search engine first uses BM25 to fetch the top 1,000 documents for a query, then applies a neural reranker to promote the 10 most relevant ones to the top of the result page.

Why it matters

Modern AI systems rely on reranking to deliver higher-quality search and recommendation results at interactive speeds, directly improving user satisfaction and engagement.

Frequently asked questions

Initial ranking (or retrieval) quickly finds many candidates; reranking reorders a smaller subset with a more sophisticated model for better precision.