ARXIV:2604.18320 · MLLM SELF-EVOLUTION · SUBMITTED 21 APR · 20:33 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields available

EVE: Verifiable Self-Evolution of MLLMs via Executable Visual Transformations

Yongrui Heng · Chaoya Jiang · Han Yang · Shikun Zhang · Wei Ye · arXiv

EVE is a framework for verifiable MLLM self-evolution using executable visual transformations, generating diverse and challenging training data with verified ground truth.

Ship in 2-4 weeks›Score8.0Evidence verified

Opportunity summary

Pain EVE is a framework for verifiable MLLM self-evolution using executable visual transformations, generating diverse and challenging training data with verified ground truth.

Evidence 0 refs | 4 sources | 83% coverage

Blocker Evidence verified

Open Build Read PDF Signal Canvas Track

PROBLEM

EVE is a framework for verifiable MLLM self-evolution using executable visual transformations, generating diverse and challenging training data with verified ground truth. We contend that robust, continuous self-improvement requires not only deterministic external feedback…

METHOD

Full abstract

Self-evolution of multimodal large language models (MLLMs) remains a critical challenge: pseudo-label-based methods suffer from progressive quality degradation as model predictions drift, while template-based methods are confined to a static set of transformations that cannot adapt in difficulty or diversity. We contend that robust, continuous self-improvement requires not only deterministic external feedback independent of the model's internal certainty, but also a mechanism to perpetually diversify the training distribution. To this end, we introduce EVE (Executable Visual transformation-based self-Evolution), a novel framework that entirely bypasses pseudo-labels by harnessing executable visual transformations continuously enriched in both variety and complexity. EVE adopts a Challenger-Solver dual-policy architecture. The Challenger maintains and progressively expands a queue of visual transformation code examples, from which it synthesizes novel Python scripts to perform dynamic visual transformations. Executing these scripts yields VQA problems with absolute, execution-verified ground-truth answers, eliminating any reliance on model-generated supervision. A multi-dimensional reward system integrating semantic diversity and dynamic difficulty calibration steers the Challenger to enrich its code example queue while posing progressively more challenging tasks, preventing mode collapse and fostering reciprocal co-evolution between the two policies. Extensive experiments demonstrate that EVE consistently surpasses existing self-evolution methods, establishing a robust and scalable paradigm for verifiable MLLM self-evolution. The code is available at https://github.com/0001Henry/EVE .

RESULT

ScienceToStartup currently rates this 8.0/10 on the public viability pass. Extensive experiments demonstrate that EVE consistently surpasses existing self-evolution methods, establishing a robust and scalable paradigm for verifiable MLLM self-evolution. A public repository is…

WHY NOW

MLLM Self-Evolution moved forward this cycle; last verified April 2026. Public score 8.0/10. Implementation evidence is present through a linked repository.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score8.0

PainEVE is a framework for verifiable MLLM self-evolution using executable visual transformations, generating diverse and challenging training data with verified ground truth.

Evidence0 refs | 4 sources | 83% coverage

Blockerno shell-level blocker reported

Analysis summary

EVE is a framework for verifiable MLLM self-evolution using executable visual transformations, generating diverse and challenging training data with verified ground truth.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields available

Competitive landscape

EVE is a framework for verifiable MLLM self-evolution using executable visual transformations, generating diverse and challenging training data with verified ground truth.

Segment

MLLM Self-Evolution

Adoption evidence

Public code linked for build inspection

Commercial read

8.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "3031d28c-10ac-477e-ba03-e3064b360b5b", "arxiv_id": "2604.18320", "canonical_route": "/paper/eve-verifiable-self-evolution-of-mllms-via-executable-visual-transformations", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "eve-verifiable-self-evolution-of-mllms-via-executable-visual-transformations", "endpoints": { "paper_pack": "/api/v1/paper/eve-verifiable-self-evolution-of-mllms-via-executable-visual-transformations/paper-pack", "build_passport": "/api/v1/paper/eve-verifiable-self-evolution-of-mllms-via-executable-visual-transformations/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "EVE: Verifiable Self-Evolution of MLLMs via Executable Visual Transformations", "normalized_query": "2604.18320", "route": "/paper/eve-verifiable-self-evolution-of-mllms-via-executable-visual-transformations", "paper_ref": "eve-verifiable-self-evolution-of-mllms-via-executable-visual-transformations", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/eve-verifiable-self-evolution-of-mllms-via-executable-visual-transformations#webpage", "url": "https://sciencetostartup.com/paper/eve-verifiable-self-evolution-of-mllms-via-executable-visual-transformations", "name": "EVE: Verifiable Self-Evolution of MLLMs via Executable Visual Transformations", "description": "EVE is a framework for verifiable MLLM self-evolution using executable visual transformations, generating diverse and challenging training data with verified ground truth.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/eve-verifiable-self-evolution-of-mllms-via-executable-visual-transformations#scholarlyArticle", "headline": "EVE: Verifiable Self-Evolution of MLLMs via Executable Visual Transformations", "description": "EVE is a framework for verifiable MLLM self-evolution using executable visual transformations, generating diverse and challenging training data with verified ground truth.", "url": "https://sciencetostartup.com/paper/eve-verifiable-self-evolution-of-mllms-via-executable-visual-transformations", "sameAs": "https://arxiv.org/abs/2604.18320", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2604.18320" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-04-20T14:20:44.000Z", "author": [ { "@type": "Person", "name": "Yongrui Heng" }, { "@type": "Person", "name": "Chaoya Jiang" }, { "@type": "Person", "name": "Han Yang" }, { "@type": "Person", "name": "Shikun Zhang" }, { "@type": "Person", "name": "Wei Ye" } ], "codeRepository": "https://github.com/0001Henry/EVE", "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 8 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "MLLM Self-Evolution" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code, repo url" } ] }, { "@type": "SoftwareSourceCode", "@id": "https://sciencetostartup.com/paper/eve-verifiable-self-evolution-of-mllms-via-executable-visual-transformations#software", "name": "EVE: Verifiable Self-Evolution of MLLMs via Executable Visual Transformations - Source Code", "description": "EVE is a framework for verifiable MLLM self-evolution using executable visual transformations, generating diverse and challenging training data with verified ground truth.", "codeRepository": "https://github.com/0001Henry/EVE", "url": "https://github.com/0001Henry/EVE" }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "MLLM Self-Evolution", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "EVE: Verifiable Self-Evolution of MLLMs via Executable Visua", "item": "https://sciencetostartup.com/paper/eve-verifiable-self-evolution-of-mllms-via-executable-visual-transformations" } ] }, { "@type": "FAQPage", "mainEntity": [ { "@type": "Question", "name": "What is the startup potential of \"EVE: Verifiable Self-Evolution of MLLMs via Executable Visua\"?", "acceptedAnswer": { "@type": "Answer", "text": "Verifiable self-evolution for multimodal language models using executable visual transformations." } }, { "@type": "Question", "name": "What products could be built from this research?", "acceptedAnswer": { "@type": "Answer", "text": "Create a SaaS platform allowing companies to upload their own data and apply this method for continuous model optimization without needing supervised label creation." } }, { "@type": "Question", "name": "What are the practical use cases?", "acceptedAnswer": { "@type": "Answer", "text": "Develop a platform that offers customizable AI training environments using executable visual transformations aimed at businesses needing rapid AI adaptation without costly labeling." } }, { "@type": "Question", "name": "What industries could this research disrupt?", "acceptedAnswer": { "@type": "Answer", "text": "This approach could reduce the need for manual data labeling in AI training, challenging existing workflows and companies that focus on large-scale ML data annotations." } } ] } ] }

Competitive landscape

EVE is a framework for verifiable MLLM self-evolution using executable visual transformations, generating diverse and challenging training data with verified ground truth.

Segment

MLLM Self-Evolution

Adoption evidence

Public code linked for build inspection

Commercial read

8.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

EVE: Verifiable Self-Evolution of MLLMs via Executable Visual Transformations

EVE: Verifiable Self-Evolution of MLLMs via Executable Visual Transformations

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline