ARXIV:2604.02319 · LLM ROUTING · SUBMITTED 03 APR · 20:50 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

No Single Best Model for Diversity: Learning a Router for Sample Diversity

Yuhan Liu · Fangyuan Xu · Vishakh Padmakumar · Daphne Ippolito · Eunsol Choi · arXiv

A router that selects the best LLM for each query to maximize response diversity, outperforming single models.

Ship in 2-4 weeks›Score7.0Evidence unverified

Opportunity summary

Pain A router that selects the best LLM for each query to maximize response diversity, outperforming single models.

Evidence 0 refs | 0 sources | 33% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

A router that selects the best LLM for each query to maximize response diversity, outperforming single models. In this paper, we study methods to elicit a comprehensive set of valid responses.

METHOD

Full abstract

When posed with prompts that permit a large number of valid answers, comprehensively generating them is the first step towards satisfying a wide range of users. In this paper, we study methods to elicit a comprehensive set of valid responses. To evaluate this, we introduce \textbf{diversity coverage}, a metric that measures the total quality scores assigned to each \textbf{unique} answer in the predicted answer set relative to the best possible answer set with the same number of answers. Using this metric, we evaluate 18 LLMs, finding no single model dominates at generating diverse responses to a wide range of open-ended prompts. Yet, per each prompt, there exists a model that outperforms all other models significantly at generating a diverse answer set. Motivated by this finding, we introduce a router that predicts the best model for each query. On NB-Wildchat, our trained router outperforms the single best model baseline (26.3% vs $23.8%). We further show generalization to an out-of-domain dataset (NB-Curated) as well as different answer-generation prompting strategies. Our work lays foundation for studying generating comprehensive answers when we have access to a suite of models.

RESULT

ScienceToStartup currently rates this 7.0/10 on the public viability pass. We further show generalization to an out-of-domain dataset (NB-Curated) as well as different answer-generation prompting strategies. Code availability is flagged in the production record;…

WHY NOW

LLM Routing moved forward this cycle; last verified April 2026. Public score 7.0/10. Production flags indicate code availability.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score7.0

PainA router that selects the best LLM for each query to maximize response diversity, outperforming single models.

Evidence0 refs | 0 sources | 33% coverage

Blockerno shell-level blocker reported

Analysis summary

A router that selects the best LLM for each query to maximize response diversity, outperforming single models.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

A router that selects the best LLM for each query to maximize response diversity, outperforming single models.

Segment

LLM Routing

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "2a149747-7bb9-42b6-8b1b-7d5bae5f6560", "arxiv_id": "2604.02319", "canonical_route": "/paper/no-single-best-model-for-diversity-learning-a-router-for-sample-diversity", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "no-single-best-model-for-diversity-learning-a-router-for-sample-diversity", "endpoints": { "paper_pack": "/api/v1/paper/no-single-best-model-for-diversity-learning-a-router-for-sample-diversity/paper-pack", "build_passport": "/api/v1/paper/no-single-best-model-for-diversity-learning-a-router-for-sample-diversity/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "No Single Best Model for Diversity: Learning a Router for Sample Diversity", "normalized_query": "2604.02319", "route": "/paper/no-single-best-model-for-diversity-learning-a-router-for-sample-diversity", "paper_ref": "no-single-best-model-for-diversity-learning-a-router-for-sample-diversity", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/no-single-best-model-for-diversity-learning-a-router-for-sample-diversity#webpage", "url": "https://sciencetostartup.com/paper/no-single-best-model-for-diversity-learning-a-router-for-sample-diversity", "name": "No Single Best Model for Diversity: Learning a Router for Sample Diversity", "description": "A router that selects the best LLM for each query to maximize response diversity, outperforming single models.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/no-single-best-model-for-diversity-learning-a-router-for-sample-diversity#scholarlyArticle", "headline": "No Single Best Model for Diversity: Learning a Router for Sample Diversity", "description": "A router that selects the best LLM for each query to maximize response diversity, outperforming single models.", "url": "https://sciencetostartup.com/paper/no-single-best-model-for-diversity-learning-a-router-for-sample-diversity", "sameAs": "https://arxiv.org/abs/2604.02319", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2604.02319" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-04-02T17:58:37.000Z", "author": [ { "@type": "Person", "name": "Yuhan Liu" }, { "@type": "Person", "name": "Fangyuan Xu" }, { "@type": "Person", "name": "Vishakh Padmakumar" }, { "@type": "Person", "name": "Daphne Ippolito" }, { "@type": "Person", "name": "Eunsol Choi" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 7 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "LLM Routing" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "LLM Routing", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "No Single Best Model for Diversity: Learning a Router for Sa", "item": "https://sciencetostartup.com/paper/no-single-best-model-for-diversity-learning-a-router-for-sample-diversity" } ] } ] }

Competitive landscape

A router that selects the best LLM for each query to maximize response diversity, outperforming single models.

Segment

LLM Routing

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

No Single Best Model for Diversity: Learning a Router for Sample Diversity

No Single Best Model for Diversity: Learning a Router for Sample Diversity

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline