ARXIV:2602.21997 · AI-DRIVEN SOFTWARE TESTING · SUBMITTED 02 APR · 02:30 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: unverified proof status

Enhancing LLM-Based Test Generation by Eliminating Covered Code

arXiv

Automated test generation tool that enhances code coverage for complex Python projects by eliminating already-covered code parts.

Blocked on Code›Score7.0Evidence unverified

Opportunity summary

Pain Automated test generation tool that enhances code coverage for complex Python projects by eliminating already-covered code parts.

Evidence 0 refs | 0 sources | 17% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

Automated test generation tool that enhances code coverage for complex Python projects by eliminating already-covered code parts. Recent advancements in Large Language Models (LLMs) have shown promise in improving test generation, particularly in achieving…

METHOD

Full abstract

Automated test generation is essential for software quality assurance, with coverage rate serving as a key metric to ensure thorough testing. Recent advancements in Large Language Models (LLMs) have shown promise in improving test generation, particularly in achieving higher coverage. However, while existing LLM-based test generation solutions perform well on small, isolated code snippets, they struggle when applied to complex methods under test. To address these issues, we propose a scalable LLM-based unit test generation method. Our approach consists of two key steps. The first step is context information retrieval, which uses both LLMs and static analysis to gather relevant contextual information associated with the complex methods under test. The second step, iterative test generation with code elimination, repeatedly generates unit tests for the code slice, tracks the achieved coverage, and selectively removes code segments that have already been covered. This process simplifies the testing task and mitigates issues arising from token limits or reduced reasoning effectiveness associated with excessively long contexts. Through comprehensive evaluations on open-source projects, our approach outperforms state-of-the-art LLM-based and search-based methods, demonstrating its effectiveness in achieving high coverage on complex methods.

RESULT

ScienceToStartup currently rates this 7.0/10 on the public viability pass. Through comprehensive evaluations on open-source projects, our approach outperforms state-of-the-art LLM-based and search-based methods, demonstrating its effectiveness in achieving high coverage on complex methods.

WHY NOW

AI-Driven Software Testing moved forward this cycle; last verified April 2026. Public score 7.0/10.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score7.0

PainAutomated test generation tool that enhances code coverage for complex Python projects by eliminating already-covered code parts.

Evidence0 refs | 0 sources | 17% coverage

Blockermissing authors

Analysis summary

Automated test generation tool that enhances code coverage for complex Python projects by eliminating already-covered code parts.

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: unverified proof status

Competitive landscape

Automated test generation tool that enhances code coverage for complex Python projects by eliminating already-covered code parts.

Segment

AI-Driven Software Testing

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

References(17)

Code-Aware Prompting: A Study of Coverage-Guided Test Generation in Regression Setting using LLM

2024Gabriel Ryan, Siddhartha Jain et al.

DeepSeek-Coder: When the Large Language Model Meets Programming - The Rise of Code Intelligence

2024Daya Guo, Qihao Zhu et al.

Enhancing LLM-based Test Generation for Hard-to-Cover Branches via Program Analysis

2024Chen Yang, Junjie Chen et al.

Abstract Syntax Tree for Programming Language Understanding and Representation: How Far Are We?

2023Weisong Sun, Chunrong Fang et al.

ChatUniTest: A Framework for LLM-Based Test Generation

2023Yinghao Chen, Zehao Hu et al.

No More Manual Tests? Evaluating and Improving ChatGPT for Unit Test Generation

2023Zhiqiang Yuan, Yiling Lou et al.

Large Language Models Can Be Easily Distracted by Irrelevant Context

2023Freda Shi, Xinyun Chen et al.

Pynguin: Automated Unit Test Generation for Python

2022Stephan Lukasczyk, G. Fraser

Unit Test Case Generation with Transformers

2020Michele Tufano, Dawn Drain et al.

Development of an Enhanced Automated Software Complexity Measurement System

2020Sanusi B.A

Hypothesis: A new approach to property-based testing

2019D. MacIver, Zac Hatfield-Dodds

State of the art: Dynamic symbolic execution for automated test generation

2013Ting Chen, Xiaosong Zhang et al.

Search-Based Software Testing: Past, Present and Future

2011Phil McMinn

KLEE: Unassisted and Automatic Generation of High-Coverage Tests for Complex Systems Programs

2008Cristian Cadar, Daniel Dunbar et al.

Randoop: feedback-directed random testing for Java

2007Carlos Pacheco, Michael D. Ernst

DART: directed automated random testing

2005Patrice Godefroid, Nils Klarlund et al.

coverage

1972M. Lemay

{ "contract_version": "paper-r2", "paper_id": "3f992fbd-f471-43ab-8cfd-c5d86ad91a61", "arxiv_id": "2602.21997", "canonical_route": "/paper/enhancing-llm-based-test-generation-by-eliminating-covered-code", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "enhancing-llm-based-test-generation-by-eliminating-covered-code", "endpoints": { "paper_pack": "/api/v1/paper/enhancing-llm-based-test-generation-by-eliminating-covered-code/paper-pack", "build_passport": "/api/v1/paper/enhancing-llm-based-test-generation-by-eliminating-covered-code/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "Enhancing LLM-Based Test Generation by Eliminating Covered Code", "normalized_query": "2602.21997", "route": "/paper/enhancing-llm-based-test-generation-by-eliminating-covered-code", "paper_ref": "enhancing-llm-based-test-generation-by-eliminating-covered-code", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/enhancing-llm-based-test-generation-by-eliminating-covered-code#webpage", "url": "https://sciencetostartup.com/paper/enhancing-llm-based-test-generation-by-eliminating-covered-code", "name": "Enhancing LLM-Based Test Generation by Eliminating Covered Code", "description": "Automated test generation tool that enhances code coverage for complex Python projects by eliminating already-covered code parts.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/enhancing-llm-based-test-generation-by-eliminating-covered-code#scholarlyArticle", "headline": "Enhancing LLM-Based Test Generation by Eliminating Covered Code", "description": "Automated test generation tool that enhances code coverage for complex Python projects by eliminating already-covered code parts.", "url": "https://sciencetostartup.com/paper/enhancing-llm-based-test-generation-by-eliminating-covered-code", "sameAs": "https://arxiv.org/abs/2602.21997", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2602.21997" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-02-25T15:16:43.000Z", "author": [ { "@type": "Person", "name": "WeiZhe Xu", "affiliation": { "@type": "Organization", "name": "University of Notre Dame" } }, { "@type": "Person", "name": "Mengyu Liu", "affiliation": { "@type": "Organization", "name": "Washington State University" } }, { "@type": "Person", "name": "Fanxin Kong", "affiliation": { "@type": "Organization", "name": "University of Notre Dame" } } ], "citation": [ { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "46db418ac45d17f4381b26daab73e8e3e0728d99" }, "url": "https://www.semanticscholar.org/paper/46db418ac45d17f4381b26daab73e8e3e0728d99" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "1f2a20a6efaf83214861dddae4a38a83ae18fe32" }, "url": "https://www.semanticscholar.org/paper/1f2a20a6efaf83214861dddae4a38a83ae18fe32" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "cc839c6f78cbc305efc846472045a6371986587d" }, "url": "https://www.semanticscholar.org/paper/cc839c6f78cbc305efc846472045a6371986587d" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "8089453770cba202fb352ac4ed1f9cfd99058d69" }, "url": "https://www.semanticscholar.org/paper/8089453770cba202fb352ac4ed1f9cfd99058d69" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "4c552ad10efda5283f7b681f7dd2e532445259fc" }, "url": "https://www.semanticscholar.org/paper/4c552ad10efda5283f7b681f7dd2e532445259fc" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "3d68522abfadfc8ee6b7ec9edaaf91f1b2f38e5e" }, "url": "https://www.semanticscholar.org/paper/3d68522abfadfc8ee6b7ec9edaaf91f1b2f38e5e" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "eadcf2f484b2d326f4d32ba4a897b009e4de1784" }, "url": "https://www.semanticscholar.org/paper/eadcf2f484b2d326f4d32ba4a897b009e4de1784" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "dc3877e3058bc9c854e57770e596acf188d9b57e" }, "url": "https://www.semanticscholar.org/paper/dc3877e3058bc9c854e57770e596acf188d9b57e" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "4fc4898d15441d43e79298945bf2cf0d54ea5c77" }, "url": "https://www.semanticscholar.org/paper/4fc4898d15441d43e79298945bf2cf0d54ea5c77" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "02afb1a4a45cfea0c1aeffca4a441e6541a5d34b" }, "url": "https://www.semanticscholar.org/paper/02afb1a4a45cfea0c1aeffca4a441e6541a5d34b" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "1af20927ec4429b4374c3da4c23aeee713b3e60a" }, "url": "https://www.semanticscholar.org/paper/1af20927ec4429b4374c3da4c23aeee713b3e60a" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "0b93657965e506dfbd56fbc1c1d4b9666b1d01c8" }, "url": "https://www.semanticscholar.org/paper/0b93657965e506dfbd56fbc1c1d4b9666b1d01c8" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "feebec1b65cb45de2abcb787d22cf24acf86f51a" }, "url": "https://www.semanticscholar.org/paper/feebec1b65cb45de2abcb787d22cf24acf86f51a" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "3f6a6707d0e2a818a6a4359b253c32603abe3651" }, "url": "https://www.semanticscholar.org/paper/3f6a6707d0e2a818a6a4359b253c32603abe3651" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "467b322bb070288f4cec186bb8a0574e4634db66" }, "url": "https://www.semanticscholar.org/paper/467b322bb070288f4cec186bb8a0574e4634db66" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "8a2c54733489cf9d8408eb61679f762fcd77c619" }, "url": "https://www.semanticscholar.org/paper/8a2c54733489cf9d8408eb61679f762fcd77c619" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "dac109f8cbdb889ada41b382fd30075c1bf956e1" }, "url": "https://www.semanticscholar.org/paper/dac109f8cbdb889ada41b382fd30075c1bf956e1" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 7 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "AI-Driven Software Testing" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "AI-Driven Software Testing", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "Enhancing LLM-Based Test Generation by Eliminating Covered C", "item": "https://sciencetostartup.com/paper/enhancing-llm-based-test-generation-by-eliminating-covered-code" } ] }, { "@type": "FAQPage", "mainEntity": [ { "@type": "Question", "name": "What is the startup potential of \"Enhancing LLM-Based Test Generation by Eliminating Covered C\"?", "acceptedAnswer": { "@type": "Answer", "text": "Automated test generation tool that enhances code coverage for complex Python projects by eliminating already-covered code parts." } }, { "@type": "Question", "name": "What products could be built from this research?", "acceptedAnswer": { "@type": "Answer", "text": "Create a SaaS platform that integrates with continuous integration and deployment (CI/CD) tools, providing real-time test generation and coverage analysis for developers working on complex Python codebases." } }, { "@type": "Question", "name": "What are the practical use cases?", "acceptedAnswer": { "@type": "Answer", "text": "Implement a software testing tool that integrates into existing development pipelines to automatically generate comprehensive unit tests for Python projects, particularly targeting complex methods." } }, { "@type": "Question", "name": "What industries could this research disrupt?", "acceptedAnswer": { "@type": "Answer", "text": "This approach could replace existing automated testing tools in software development environments by offering more effective coverage for complex codebases, reducing the overhead of writing manual tests." } } ] } ] }

Competitive landscape

Automated test generation tool that enhances code coverage for complex Python projects by eliminating already-covered code parts.

Segment

AI-Driven Software Testing

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

References(17)

Code-Aware Prompting: A Study of Coverage-Guided Test Generation in Regression Setting using LLM

2024Gabriel Ryan, Siddhartha Jain et al.

DeepSeek-Coder: When the Large Language Model Meets Programming - The Rise of Code Intelligence

2024Daya Guo, Qihao Zhu et al.

Enhancing LLM-based Test Generation for Hard-to-Cover Branches via Program Analysis

2024Chen Yang, Junjie Chen et al.

Abstract Syntax Tree for Programming Language Understanding and Representation: How Far Are We?

2023Weisong Sun, Chunrong Fang et al.

ChatUniTest: A Framework for LLM-Based Test Generation

2023Yinghao Chen, Zehao Hu et al.

No More Manual Tests? Evaluating and Improving ChatGPT for Unit Test Generation

2023Zhiqiang Yuan, Yiling Lou et al.

Large Language Models Can Be Easily Distracted by Irrelevant Context

2023Freda Shi, Xinyun Chen et al.

Pynguin: Automated Unit Test Generation for Python

2022Stephan Lukasczyk, G. Fraser

Unit Test Case Generation with Transformers

2020Michele Tufano, Dawn Drain et al.

Development of an Enhanced Automated Software Complexity Measurement System

2020Sanusi B.A

Hypothesis: A new approach to property-based testing

2019D. MacIver, Zac Hatfield-Dodds

State of the art: Dynamic symbolic execution for automated test generation

2013Ting Chen, Xiaosong Zhang et al.

Search-Based Software Testing: Past, Present and Future

2011Phil McMinn

KLEE: Unassisted and Automatic Generation of High-Coverage Tests for Complex Systems Programs

2008Cristian Cadar, Daniel Dunbar et al.

Randoop: feedback-directed random testing for Java

2007Carlos Pacheco, Michael D. Ernst

DART: directed automated random testing

2005Patrice Godefroid, Nils Klarlund et al.

coverage

1972M. Lemay

Enhancing LLM-Based Test Generation by Eliminating Covered Code

Enhancing LLM-Based Test Generation by Eliminating Covered Code

Claim map

Constellation map

Competitive landscape

Buzz

PDF

References(17)

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

References(17)

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline