ARXIV:2602.23622 · COMPUTER VISION · SUBMITTED 02 APR · 02:30 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: unverified proof status

DLEBench: Evaluating Small-scale Object Editing Ability for Instruction-based Image Editing Model

arXiv

DLEBench offers a benchmark for evaluating and improving the performance of instruction-based image editing models on small objects.

Blocked on Code›Score5.0Evidence unverified

Opportunity summary

Pain DLEBench offers a benchmark for evaluating and improving the performance of instruction-based image editing models on small objects.

Evidence 0 refs | 0 sources | 17% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

DLEBench offers a benchmark for evaluating and improving the performance of instruction-based image editing models on small objects. However, while these models demonstrate plausible adherence to instructions and strong reasoning ability on current benchmarks,…

METHOD

Full abstract

Significant progress has been made in the field of Instruction-based Image Editing Models (IIEMs). However, while these models demonstrate plausible adherence to instructions and strong reasoning ability on current benchmarks, their ability to edit small objects remains underexplored, despite its importance for precise local editing and refining details in both real and generated images. In this paper, we introduce DeepLookEditBench (DLEBench), the first benchmark dedicated to assessing the abilities of IIEMs in editing small-scale objects. Specifically, we construct a challenging testbed comprising 1889 samples across seven instruction types. In these samples, target objects occupy only 1%-10% of the image area, covering complex scenarios such as partial occlusion and multi-object editing. To ensure robust evaluation on this benchmark, we propose an evaluation protocol with refined score rubrics to minimize subjectivity and ambiguity in two criteria: Instruction Following and Visual Consistency. This protocol also introduces a dual-mode evaluation framework (Tool-driven and Oracle-guided Modes) addressing the misalignment between LMM-as-a-Judge and human judgements on DLEBench. Empirical results on 10 IIEMs reveal significant performance gaps in small-scale object editing, highlighting the need for specialized benchmarks to advance this ability.

RESULT

ScienceToStartup currently rates this 5.0/10 on the public viability pass. However, while these models demonstrate plausible adherence to instructions and strong reasoning ability on current benchmarks, their ability to edit small objects remains underexplored,…

WHY NOW

Computer Vision moved forward this cycle; last verified April 2026. Public score 5.0/10.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score5.0

PainDLEBench offers a benchmark for evaluating and improving the performance of instruction-based image editing models on small objects.

Evidence0 refs | 0 sources | 17% coverage

Blockermissing authors

Analysis summary

DLEBench offers a benchmark for evaluating and improving the performance of instruction-based image editing models on small objects.

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: unverified proof status

Competitive landscape

DLEBench offers a benchmark for evaluating and improving the performance of instruction-based image editing models on small objects.

Segment

Computer Vision

Adoption evidence

No public code link in the paper record yet

Commercial read

5.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "bfd4a7a6-8fe1-40dc-bbe0-a7928b046e3a", "arxiv_id": "2602.23622", "canonical_route": "/paper/dlebench-evaluating-small-scale-object-editing-ability-for-instruction-based-image-editing-model", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "dlebench-evaluating-small-scale-object-editing-ability-for-instruction-based-image-editing-model", "endpoints": { "paper_pack": "/api/v1/paper/dlebench-evaluating-small-scale-object-editing-ability-for-instruction-based-image-editing-model/paper-pack", "build_passport": "/api/v1/paper/dlebench-evaluating-small-scale-object-editing-ability-for-instruction-based-image-editing-model/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "DLEBench: Evaluating Small-scale Object Editing Ability for Instruction-based Image Editing Model", "normalized_query": "2602.23622", "route": "/paper/dlebench-evaluating-small-scale-object-editing-ability-for-instruction-based-image-editing-model", "paper_ref": "dlebench-evaluating-small-scale-object-editing-ability-for-instruction-based-image-editing-model", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/dlebench-evaluating-small-scale-object-editing-ability-for-instruction-based-image-editing-model#webpage", "url": "https://sciencetostartup.com/paper/dlebench-evaluating-small-scale-object-editing-ability-for-instruction-based-image-editing-model", "name": "DLEBench: Evaluating Small-scale Object Editing Ability for Instruction-based Image Editing Model", "description": "DLEBench offers a benchmark for evaluating and improving the performance of instruction-based image editing models on small objects.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/dlebench-evaluating-small-scale-object-editing-ability-for-instruction-based-image-editing-model#scholarlyArticle", "headline": "DLEBench: Evaluating Small-scale Object Editing Ability for Instruction-based Image Editing Model", "description": "DLEBench offers a benchmark for evaluating and improving the performance of instruction-based image editing models on small objects.", "url": "https://sciencetostartup.com/paper/dlebench-evaluating-small-scale-object-editing-ability-for-instruction-based-image-editing-model", "sameAs": "https://arxiv.org/abs/2602.23622", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2602.23622" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-02-27T02:59:34.000Z", "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 5 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Computer Vision" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Computer Vision", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "DLEBench: Evaluating Small-scale Object Editing Ability for ", "item": "https://sciencetostartup.com/paper/dlebench-evaluating-small-scale-object-editing-ability-for-instruction-based-image-editing-model" } ] } ] }

Competitive landscape

DLEBench offers a benchmark for evaluating and improving the performance of instruction-based image editing models on small objects.

Segment

Computer Vision

Adoption evidence

No public code link in the paper record yet

Commercial read

5.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

DLEBench: Evaluating Small-scale Object Editing Ability for Instruction-based Image Editing Model

DLEBench: Evaluating Small-scale Object Editing Ability for Instruction-based Image Editing Model

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline