ARXIV:2603.26664 · AI-ASSISTED CODING · SUBMITTED 31 MAR · 20:30 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Learning to Commit: Generating Organic Pull Requests via Online Repository Memory

Mo Li · L. H. Xu · Qitai Tan · Ting Cao · Yunxin Liu · arXiv

A tool that generates organic pull requests by learning from the history of a software repository.

Ship in 2-4 weeks›Score7.0Evidence unverified

Opportunity summary

Pain A tool that generates organic pull requests by learning from the history of a software repository.

Evidence 33 refs | 3 sources | 67% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

A tool that generates organic pull requests by learning from the history of a software repository. The root cause is not functional incorrectness but a lack of organicity: generated code ignores project-specific conventions, duplicates…

METHOD

Full abstract

Large language model (LLM)-based coding agents achieve impressive results on controlled benchmarks yet routinely produce pull requests that real maintainers reject. The root cause is not functional incorrectness but a lack of organicity: generated code ignores project-specific conventions, duplicates functionality already provided by internal APIs, and violates implicit architectural constraints accumulated over years of development. Simply exposing an agent to the latest repository snapshot is not enough: the snapshot reveals the final state of the codebase, but not the repository-specific change patterns by which that state was reached. We introduce Learning to Commit, a framework that closes this gap through Online Repository Memory. Given a repository with a strict chronological split, the agent performs supervised contrastive reflection on earlier commits: it blindly attempts to resolve each historical issue, compares its prediction against the oracle diff, and distils the gap into a continuously growing set of skills-reusable patterns capturing coding style, internal API usage, and architectural invariants. When a new PR description arrives, the agent conditions its generation on these accumulated skills, producing changes grounded in the project's own evolution rather than generic pretraining priors. Evaluation is conducted on genuinely future, merged pull requests that could not have been seen during the skill-building phase, and spans multiple dimensions including functional correctness, code-style consistency, internal API reuse rate, and modified-region plausibility. Experiments on an expert-maintained repository with rich commit history show that Online Repository Memory effectively improves organicity scores on held-out future tasks.

RESULT

ScienceToStartup currently rates this 7.0/10 on the public viability pass. Large language model (LLM)-based coding agents achieve impressive results on controlled benchmarks yet routinely produce pull requests that real maintainers reject. Code availability is…

WHY NOW

AI-assisted Coding moved forward this cycle; last verified April 2026. Public score 7.0/10. Production flags indicate code availability.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score7.0

PainA tool that generates organic pull requests by learning from the history of a software repository.

Evidence33 refs | 3 sources | 67% coverage

Blockerno shell-level blocker reported

Analysis summary

A tool that generates organic pull requests by learning from the history of a software repository.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

A tool that generates organic pull requests by learning from the history of a software repository.

Segment

AI-assisted Coding

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "07dfb54c-f694-4249-bd13-55d7e2cb332c", "arxiv_id": "2603.26664", "canonical_route": "/paper/learning-to-commit-generating-organic-pull-requests-via-online-repository-memory", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "learning-to-commit-generating-organic-pull-requests-via-online-repository-memory", "endpoints": { "paper_pack": "/api/v1/paper/learning-to-commit-generating-organic-pull-requests-via-online-repository-memory/paper-pack", "build_passport": "/api/v1/paper/learning-to-commit-generating-organic-pull-requests-via-online-repository-memory/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "Learning to Commit: Generating Organic Pull Requests via Online Repository Memory", "normalized_query": "2603.26664", "route": "/paper/learning-to-commit-generating-organic-pull-requests-via-online-repository-memory", "paper_ref": "learning-to-commit-generating-organic-pull-requests-via-online-repository-memory", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/learning-to-commit-generating-organic-pull-requests-via-online-repository-memory#webpage", "url": "https://sciencetostartup.com/paper/learning-to-commit-generating-organic-pull-requests-via-online-repository-memory", "name": "Learning to Commit: Generating Organic Pull Requests via Online Repository Memory", "description": "A tool that generates organic pull requests by learning from the history of a software repository.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/learning-to-commit-generating-organic-pull-requests-via-online-repository-memory#scholarlyArticle", "headline": "Learning to Commit: Generating Organic Pull Requests via Online Repository Memory", "description": "A tool that generates organic pull requests by learning from the history of a software repository.", "url": "https://sciencetostartup.com/paper/learning-to-commit-generating-organic-pull-requests-via-online-repository-memory", "sameAs": "https://arxiv.org/abs/2603.26664", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2603.26664" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-03-27T17:58:56.000Z", "author": [ { "@type": "Person", "name": "Mo Li", "affiliation": { "@type": "Organization", "name": "Tsinghua University" } }, { "@type": "Person", "name": "L. H. Xu", "affiliation": { "@type": "Organization", "name": "Tsinghua University" } }, { "@type": "Person", "name": "Qitai Tan", "affiliation": { "@type": "Organization", "name": "Tsinghua University" } }, { "@type": "Person", "name": "Ting Cao", "affiliation": { "@type": "Organization", "name": "Tsinghua University" } }, { "@type": "Person", "name": "Yunxin Liu", "affiliation": { "@type": "Organization", "name": "Tsinghua University" } } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 7 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "AI-assisted Coding" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "AI-assisted Coding", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "Learning to Commit: Generating Organic Pull Requests via Onl", "item": "https://sciencetostartup.com/paper/learning-to-commit-generating-organic-pull-requests-via-online-repository-memory" } ] }, { "@type": "FAQPage", "mainEntity": [ { "@type": "Question", "name": "What is the startup potential of \"Learning to Commit: Generating Organic Pull Requests via Onl\"?", "acceptedAnswer": { "@type": "Answer", "text": "A tool that generates organic pull requests by learning from the history of a software repository." } }, { "@type": "Question", "name": "What products could be built from this research?", "acceptedAnswer": { "@type": "Answer", "text": "This technology could be productized as an API or integration tool for development platforms like GitHub or GitLab, allowing AI coding assistants to improve their pull request acceptance rates by learning from previous commits." } }, { "@type": "Question", "name": "What are the practical use cases?", "acceptedAnswer": { "@type": "Answer", "text": "A GitHub plugin that helps developers generate pull requests that align with the coding style and architecture of specific repositories, reducing the time maintainers spend revising AI-generated code submissions." } }, { "@type": "Question", "name": "What industries could this research disrupt?", "acceptedAnswer": { "@type": "Answer", "text": "This solution could replace current AI coding tools that fail to consider historical context, reducing the need for manual revisions and increasing efficiency in software development processes." } } ] } ] }

Competitive landscape

A tool that generates organic pull requests by learning from the history of a software repository.

Segment

AI-assisted Coding

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

Learning to Commit: Generating Organic Pull Requests via Online Repository Memory

Learning to Commit: Generating Organic Pull Requests via Online Repository Memory

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline