ARXIV:2605.14534 · IMAGE/VIDEO EDITING EVALUATION · SUBMITTED 15 MAY · 20:12 UTC · FRESHNESS FRESH

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

PROVE: A Perceptual RemOVal cohErence Benchmark for Visual Media

Fuhao Li · Shaofeng You · Jiagao Hu · Yu Liu · Yuxuan Chen · Zepeng Wang · +3 at arXiv

PROVE is a new evaluation framework with perception-aligned metrics and a benchmark dataset for assessing object removal quality in visual media.

Ship in 2-4 weeks›Score7.0Evidence unverified

Opportunity summary

Pain PROVE is a new evaluation framework with perception-aligned metrics and a benchmark dataset for assessing object removal quality in visual media.

Evidence 0 refs | 0 sources | 0% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

PROVE is a new evaluation framework with perception-aligned metrics and a benchmark dataset for assessing object removal quality in visual media. Full-reference metrics reward copy-paste behaviors over genuine erasure; no-reference metrics suffer from systematic…

METHOD

Full abstract

Evaluating object removal in images and videos remains challenging because the task is inherently one-to-many, yet existing metrics frequently disagree with human perception. Full-reference metrics reward copy-paste behaviors over genuine erasure; no-reference metrics suffer from systematic biases such as favoring blurry results; and global temporal metrics are insensitive to localized artifacts within edited regions. To address these limitations, we propose RC (Removal Coherence), a pair of perception-aligned metrics: RC-S, which measures spatial coherence via sliding-window feature comparison between masked and background regions, and RC-T, which measures temporal consistency via distribution tracking within shared restored regions across adjacent frames. To validate RC and support community benchmarking, we further introduce PROVE-Bench, a two-tier real-world benchmark comprising PROVE-M, an 80-video paired dataset with motion augmentation, and PROVE-H, a 100-video challenging subset without ground truth. Together, RC metrics and PROVE-Bench form the PROVE (Perceptual RemOVal cohErence) evaluation framework for visual media. Experiments across diverse image and video benchmarks demonstrate that RC achieves substantially stronger alignment with human judgments than existing evaluation protocols. The code for RC metrics and PROVE-Bench are publicly available at: https://github.com/xiaomi-research/prove/.

RESULT

ScienceToStartup currently rates this 7.0/10 on the public viability pass. Full-reference metrics reward copy-paste behaviors over genuine erasure; no-reference metrics suffer from systematic biases such as favoring blurry results; and global temporal metrics are…

WHY NOW

Image/Video Editing Evaluation moved forward this cycle; last verified May 2026. Public score 7.0/10. Implementation evidence is present through a linked repository.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score7.0

PainPROVE is a new evaluation framework with perception-aligned metrics and a benchmark dataset for assessing object removal quality in visual media.

Evidence0 refs | 0 sources | 0% coverage

Blockerno shell-level blocker reported

Analysis summary

PROVE is a new evaluation framework with perception-aligned metrics and a benchmark dataset for assessing object removal quality in visual media.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

PROVE is a new evaluation framework with perception-aligned metrics and a benchmark dataset for assessing object removal quality in visual media.

Segment

Image/Video Editing Evaluation

Adoption evidence

Public code linked for build inspection

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "22e54bcf-b85e-4e69-9c5b-9e4ecf04f85a", "arxiv_id": "2605.14534", "canonical_route": "/paper/prove-a-perceptual-removal-coherence-benchmark-for-visual-media", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "prove-a-perceptual-removal-coherence-benchmark-for-visual-media", "endpoints": { "paper_pack": "/api/v1/paper/prove-a-perceptual-removal-coherence-benchmark-for-visual-media/paper-pack", "build_passport": "/api/v1/paper/prove-a-perceptual-removal-coherence-benchmark-for-visual-media/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "PROVE: A Perceptual RemOVal cohErence Benchmark for Visual Media", "normalized_query": "2605.14534", "route": "/paper/prove-a-perceptual-removal-coherence-benchmark-for-visual-media", "paper_ref": "prove-a-perceptual-removal-coherence-benchmark-for-visual-media", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/prove-a-perceptual-removal-coherence-benchmark-for-visual-media#webpage", "url": "https://sciencetostartup.com/paper/prove-a-perceptual-removal-coherence-benchmark-for-visual-media", "name": "PROVE: A Perceptual RemOVal cohErence Benchmark for Visual Media", "description": "PROVE is a new evaluation framework with perception-aligned metrics and a benchmark dataset for assessing object removal quality in visual media.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/prove-a-perceptual-removal-coherence-benchmark-for-visual-media#scholarlyArticle", "headline": "PROVE: A Perceptual RemOVal cohErence Benchmark for Visual Media", "description": "PROVE is a new evaluation framework with perception-aligned metrics and a benchmark dataset for assessing object removal quality in visual media.", "url": "https://sciencetostartup.com/paper/prove-a-perceptual-removal-coherence-benchmark-for-visual-media", "sameAs": "https://arxiv.org/abs/2605.14534", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2605.14534" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-05-14T08:16:51.000Z", "author": [ { "@type": "Person", "name": "Fuhao Li" }, { "@type": "Person", "name": "Shaofeng You" }, { "@type": "Person", "name": "Jiagao Hu" }, { "@type": "Person", "name": "Yu Liu" }, { "@type": "Person", "name": "Yuxuan Chen" }, { "@type": "Person", "name": "Zepeng Wang" }, { "@type": "Person", "name": "Fei Wang" }, { "@type": "Person", "name": "Daiguo Zhou" }, { "@type": "Person", "name": "Jian Luan" } ], "codeRepository": "https://github.com/xiaomi-research/prove", "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 7 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Image/Video Editing Evaluation" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code, repo url" } ] }, { "@type": "SoftwareSourceCode", "@id": "https://sciencetostartup.com/paper/prove-a-perceptual-removal-coherence-benchmark-for-visual-media#software", "name": "PROVE: A Perceptual RemOVal cohErence Benchmark for Visual Media - Source Code", "description": "PROVE is a new evaluation framework with perception-aligned metrics and a benchmark dataset for assessing object removal quality in visual media.", "codeRepository": "https://github.com/xiaomi-research/prove", "url": "https://github.com/xiaomi-research/prove" }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Image/Video Editing Evaluation", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "PROVE: A Perceptual RemOVal cohErence Benchmark for Visual M", "item": "https://sciencetostartup.com/paper/prove-a-perceptual-removal-coherence-benchmark-for-visual-media" } ] } ] }

Competitive landscape

PROVE is a new evaluation framework with perception-aligned metrics and a benchmark dataset for assessing object removal quality in visual media.

Segment

Image/Video Editing Evaluation

Adoption evidence

Public code linked for build inspection

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

PROVE: A Perceptual RemOVal cohErence Benchmark for Visual Media

PROVE: A Perceptual RemOVal cohErence Benchmark for Visual Media

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline