ARXIV:2603.28052 · LLM HARNESS OPTIMIZATION · SUBMITTED 31 MAR · 20:20 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Meta-Harness: End-to-End Optimization of Model Harnesses

Yoonho Lee · Roshen Nair · Qizheng Zhang · Kangwook Lee · Omar Khattab · Chelsea Finn · arXiv

Automate the creation of LLM harnesses to significantly improve performance and reduce context token usage.

Blocked on Code›Score7.0Evidence unverified

Opportunity summary

Pain Automate the creation of LLM harnesses to significantly improve performance and reduce context token usage.

Evidence 76 refs | 4 sources | 50% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

Automate the creation of LLM harnesses to significantly improve performance and reduce context token usage. Yet harnesses are still designed largely by hand, and existing text optimizers are poorly matched to this setting because…

METHOD

Full abstract

The performance of large language model (LLM) systems depends not only on model weights, but also on their harness: the code that determines what information to store, retrieve, and present to the model. Yet harnesses are still designed largely by hand, and existing text optimizers are poorly matched to this setting because they compress feedback too aggressively. We introduce Meta-Harness, an outer-loop system that searches over harness code for LLM applications. It uses an agentic proposer that accesses the source code, scores, and execution traces of all prior candidates through a filesystem. On online text classification, Meta-Harness improves over a state-of-the-art context management system by 7.7 points while using 4x fewer context tokens. On retrieval-augmented math reasoning, a single discovered harness improves accuracy on 200 IMO-level problems by 4.7 points on average across five held-out models. On agentic coding, discovered harnesses surpass the best hand-engineered baselines on TerminalBench-2. Together, these results show that richer access to prior experience can enable automated harness engineering.

RESULT

ScienceToStartup currently rates this 7.0/10 on the public viability pass. On online text classification, Meta-Harness improves over a state-of-the-art context management system by 7.7 points while using 4x fewer context tokens.

WHY NOW

LLM Harness Optimization moved forward this cycle; last verified April 2026. Public score 7.0/10.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score7.0

PainAutomate the creation of LLM harnesses to significantly improve performance and reduce context token usage.

Evidence76 refs | 4 sources | 50% coverage

Blockerno shell-level blocker reported

Analysis summary

Automate the creation of LLM harnesses to significantly improve performance and reduce context token usage.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

Automate the creation of LLM harnesses to significantly improve performance and reduce context token usage.

Segment

LLM Harness Optimization

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "527057df-daf8-4f4c-982f-107f84fe9272", "arxiv_id": "2603.28052", "canonical_route": "/paper/meta-harness-end-to-end-optimization-of-model-harnesses", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "meta-harness-end-to-end-optimization-of-model-harnesses", "endpoints": { "paper_pack": "/api/v1/paper/meta-harness-end-to-end-optimization-of-model-harnesses/paper-pack", "build_passport": "/api/v1/paper/meta-harness-end-to-end-optimization-of-model-harnesses/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "Meta-Harness: End-to-End Optimization of Model Harnesses", "normalized_query": "2603.28052", "route": "/paper/meta-harness-end-to-end-optimization-of-model-harnesses", "paper_ref": "meta-harness-end-to-end-optimization-of-model-harnesses", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/meta-harness-end-to-end-optimization-of-model-harnesses#webpage", "url": "https://sciencetostartup.com/paper/meta-harness-end-to-end-optimization-of-model-harnesses", "name": "Meta-Harness: End-to-End Optimization of Model Harnesses", "description": "Automate the creation of LLM harnesses to significantly improve performance and reduce context token usage.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/meta-harness-end-to-end-optimization-of-model-harnesses#scholarlyArticle", "headline": "Meta-Harness: End-to-End Optimization of Model Harnesses", "description": "Automate the creation of LLM harnesses to significantly improve performance and reduce context token usage.", "url": "https://sciencetostartup.com/paper/meta-harness-end-to-end-optimization-of-model-harnesses", "sameAs": "https://arxiv.org/abs/2603.28052", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2603.28052" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-03-30T05:33:50.000Z", "author": [ { "@type": "Person", "name": "Yoonho Lee" }, { "@type": "Person", "name": "Roshen Nair" }, { "@type": "Person", "name": "Qizheng Zhang" }, { "@type": "Person", "name": "Kangwook Lee" }, { "@type": "Person", "name": "Omar Khattab" }, { "@type": "Person", "name": "Chelsea Finn" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 7 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "LLM Harness Optimization" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "LLM Harness Optimization", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "Meta-Harness: End-to-End Optimization of Model Harnesses", "item": "https://sciencetostartup.com/paper/meta-harness-end-to-end-optimization-of-model-harnesses" } ] } ] }

Competitive landscape

Automate the creation of LLM harnesses to significantly improve performance and reduce context token usage.

Segment

LLM Harness Optimization

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

Meta-Harness: End-to-End Optimization of Model Harnesses

Meta-Harness: End-to-End Optimization of Model Harnesses

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline