DevRev Search

DevRev Search is a novel passage retrieval benchmark specifically designed for technical customer support, introduced to tackle critical challenges in large-scale multi-tenant retrieval systems. The primary problems it addresses are the 'dark data' issue—a severe lack of curated relevance labels in vast user query logs—and the prohibitive operational cost of model updates, particularly the re-indexing required when jointly fine-tuning query and document encoders. The benchmark is constructed via a fully automatic pipeline, employing a fusion-based candidate generation strategy that pools results from diverse sparse and dense retrievers. Furthermore, it leverages an LLM-as-a-Judge for rigorous consistency filtering and relevance assignment. DevRev Search also proposes an 'Index-Preserving Adaptation' strategy, where only the query encoder is fine-tuned using Low-Rank Adaptation (LoRA), thereby achieving performance improvements while keeping the document index frozen. This benchmark and its associated strategies are crucial for researchers and ML engineers working on efficient domain adaptation, information retrieval, and cost-effective model deployment in complex multi-tenant environments.

Core Challenges Addressed by DevRev Search

Lack of Curated Relevance Labels: Large-scale multi-tenant retrieval systems accumulate extensive user query logs but critically lack the curated relevance labels necessary for effective domain adaptation, a problem referred to as 'dark data' (2601.04646v1).
Prohibitive Model Update Costs: The operational cost of updating models is significant, as jointly fine-tuning query and document encoders necessitates re-indexing the entire corpus, which is impractical in multi-tenant environments with thousands of isolated indices (2601.04646v1).

Construction Methodology of DevRev Search

Fully Automatic Pipeline

Core Challenges Addressed by DevRev Search

Construction Methodology of DevRev Search

Index-Preserving Adaptation Strategy with DevRev Search

Sources

At a glance

Executive summary

TL;DR

Key points

Use cases

Related topics