ARXIV:2603.12264 · IMAGE EDITING · SUBMITTED 02 APR · 02:30 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: unverified proof status

GRADE: Benchmarking Discipline-Informed Reasoning in Image Editing

arXiv

GRADE is a benchmark for assessing discipline-informed reasoning in image editing across various academic domains.

Blocked on Code›Score7.0Evidence unverified

Opportunity summary

Pain GRADE is a benchmark for assessing discipline-informed reasoning in image editing across various academic domains.

Evidence 0 refs | 0 sources | 17% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

GRADE is a benchmark for assessing discipline-informed reasoning in image editing across various academic domains. In this work, we introduce GRADE, the first benchmark to assess discipline-informed knowledge and reasoning in image editing.

METHOD

Full abstract

Unified multimodal models target joint understanding, reasoning, and generation, but current image editing benchmarks are largely confined to natural images and shallow commonsense reasoning, offering limited assessment of this capability under structured, domain-specific constraints. In this work, we introduce GRADE, the first benchmark to assess discipline-informed knowledge and reasoning in image editing. GRADE comprises 520 carefully curated samples across 10 academic domains, spanning from natural science to social science. To support rigorous evaluation, we propose a multi-dimensional evaluation protocol that jointly assesses Discipline Reasoning, Visual Consistency, and Logical Readability. Extensive experiments on 20 state-of-the-art open-source and closed-source models reveal substantial limitations in current models under implicit, knowledge-intensive editing settings, leading to large performance gaps. Beyond quantitative scores, we conduct rigorous analyses and ablations to expose model shortcomings and identify the constraints within disciplinary editing. Together, GRADE pinpoints key directions for the future development of unified multimodal models, advancing the research on discipline-informed image editing and reasoning. Our benchmark and evaluation code are publicly released.

RESULT

ScienceToStartup currently rates this 7.0/10 on the public viability pass. To support rigorous evaluation, we propose a multi-dimensional evaluation protocol that jointly assesses Discipline Reasoning, Visual Consistency, and Logical Readability.

WHY NOW

Image Editing moved forward this cycle; last verified April 2026. Public score 7.0/10.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score7.0

PainGRADE is a benchmark for assessing discipline-informed reasoning in image editing across various academic domains.

Evidence0 refs | 0 sources | 17% coverage

Blockermissing authors

Analysis summary

GRADE is a benchmark for assessing discipline-informed reasoning in image editing across various academic domains.

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: unverified proof status

ARXIV:2603.12264 · IMAGE EDITING · SUBMITTED 02 APR · 02:30 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: unverified proof status

GRADE: Benchmarking Discipline-Informed Reasoning in Image Editing

arXiv

GRADE is a benchmark for assessing discipline-informed reasoning in image editing across various academic domains.

Blocked on Code›Score7.0Evidence unverified

Opportunity summary

Pain GRADE is a benchmark for assessing discipline-informed reasoning in image editing across various academic domains.

Evidence 0 refs | 0 sources | 17% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

METHOD

Full abstract

RESULT

WHY NOW

Image Editing moved forward this cycle; last verified April 2026. Public score 7.0/10.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score7.0

PainGRADE is a benchmark for assessing discipline-informed reasoning in image editing across various academic domains.

Evidence0 refs | 0 sources | 17% coverage

Blockermissing authors

Analysis summary

GRADE is a benchmark for assessing discipline-informed reasoning in image editing across various academic domains.

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: unverified proof status

Paper Pack

10.48550/arXiv.2603.12264

GRADE: Benchmarking Discipline-Informed Reasoning in Image Editing

GRADE is a benchmark for assessing discipline-informed reasoning in image editing across various academic domains.

Abstract

Source availability

PDF linked

The paper record includes a public PDF URL.

Extraction status

Derived fallback

Read summaries are estimated from adjacent metadata, not verified extraction rows.

Proof status

unverified

0 refs; 0 sources; 17% coverage.

What was readable

linkedon filenot materializedderived fallback41 indexednot indexed

Derived fallback: Estimated from adjacent evidence; not verified from source.

Viability

7.0

Time to MVP

MVP estimate missing

Commercial

No commercial flags on file

Export

Preparing verified analysis

lens / founder

PROBLEM

METHOD

RESULT

WHY NOW

Image Editing moved forward this cycle; last verified April 2026. Public score 7.0/10.

Claim map

Abstract-backed public claims while anchored extraction refreshes.

Strong 0Mixed 0Weak 4

Evidencepartial
GRADE is a benchmark for assessing discipline-informed reasoning in image editing across various academic domains. In this work, we introduce GRADE, the first benchmark to assess discipline-informed knowledge and reasoning in image editing.
Implicationpartial
Abstract-backed fallback claim; anchored extraction has not materialized a public claim row yet.
Verificationpartial
partial
Evidencepartial
Unified multimodal models target joint understanding, reasoning, and generation, but current image editing benchmarks are largely confined to natural images and shallow commonsense reasoning, offering limited assessment of this capability under structured, domain-specific constraints. In this work, we introduce GRADE, the first benchmark to assess discipline-informed knowledge and reasoning in image editing.
Implicationpartial
Abstract-backed fallback claim; anchored extraction has not materialized a public claim row yet.
Verificationpartial
partial
Evidencepartial
ScienceToStartup currently rates this 7.0/10 on the public viability pass. To support rigorous evaluation, we propose a multi-dimensional evaluation protocol that jointly assesses Discipline Reasoning, Visual Consistency, and Logical Readability.
Implicationpartial
Abstract-backed fallback claim; anchored extraction has not materialized a public claim row yet.
Verificationpartial
partial
Evidencepartial
Image Editing moved forward this cycle; last verified April 2026. Public score 7.0/10.
Implicationpartial
Abstract-backed fallback claim; anchored extraction has not materialized a public claim row yet.
Verificationpartial
partial

Constellation map

Paper-native neighborhood for concepts, methods, materials, markets, and competitors. Missing lanes stay labeled instead of disappearing behind commercialization gates.

Open full Signal Canvas

Concepts

not indexed

Methods

Materials

PDF linked

Markets

Image Editing

Competitors

not indexed

Competitive landscape

GRADE is a benchmark for assessing discipline-informed reasoning in image editing across various academic domains.

Segment

Image Editing

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

Buzz

No indexed public discussion is attached to 2603.12264 yet. That is a visibility signal, not a blank module: the monitor is watching the public channels below.

Hacker News

Not indexed yet

Bluesky

Not indexed yet

PDF

Preview the source document here, or use the hero PDF action for a new tab.

References(41)

OpenAI GPT-5 System Card

2025Aaditya K. Singh, A. Fry et al.

Qwen3-VL Technical Report

2025Shuai Bai, Yuxuan Cai et al.

UniModel: A Visual-Only Framework for Unified Multimodal Understanding and Generation

2025Chi Zhang, Jiepeng Wang et al.

Large multimodal models evaluation: a survey

2025Zicheng Zhang, Junying Wang et al.

MathCanvas: Intrinsic Visual Chain-of-Thought for Multimodal Mathematical Reasoning

2025Weikang Shi, Aldrich Yu et al.

Seedream 4.0: Toward Next-generation Multimodal Image Generation

2025Yun Chen, Yu Gao et al.

GenExam: A Multidisciplinary Text-to-Image Exam

2025Zhaokai Wang, Penghao Yin et al.

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

2025Weiyun Wang, Zhangwei Gao et al.

T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation

2025Kaiyue Sun, Rongyao Fang et al.

Mono-InternVL-1.5: Towards Cheaper and Faster Monolithic Multimodal Large Language Models

2025Gen Luo, Wenhan Dou et al.

MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning

2025Yuxuan Luo, Yuhui Yuan et al.

SridBench: Benchmark of Scientific Research Illustration Drawing of Image Generation Model

2025Yifan Chang, Yukang Feng et al.

ImgEdit: A Unified Image Editing Dataset and Benchmark

2025Yang Ye, Xianyi He et al.

KRIS-Bench: Benchmarking Next-Level Intelligent Image Editing Models

2025Yongliang Wu, Zong-Lin Li et al.

Emerging Properties in Unified Multimodal Pretraining

2025Chaorui Deng, Deyao Zhu et al.

WorldGenBench: A World-Knowledge-Integrated Benchmark for Reasoning-Driven Text-to-Image Generation

2025Daoan Zhang, Che Jiang et al.

In-Context Edit: Enabling Instructional Image Editing with In-Context Generation in Large Scale Diffusion Transformer

2025Zechuan Zhang, Ji Xie et al.

Step1X-Edit: A Practical Framework for General Image Editing

2025Shiyu Liu, Yucheng Han et al.

Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing

2025Xiangyu Zhao, Peiyuan Zhang et al.

WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation

2025Yuwei Niu, Munan Ning et al.

Showing 20 of 41 references

CITED BY

No citing papers are indexed in the public S2S graph yet. This is an explicit zero-signal state, not a hidden lookup.

Foundation

Prior WorkETCHR: Editing To Clarify and Harness Reasoning

7.0

Prior WorkGEditBench v2: A Human-Aligned Benchmark for General Image Editing

7.0

Prior WorkDDA-Thinker: Decoupled Dual-Atomic Reinforcement Learning for Reasoning-Driven Image Editing

7.0

Prior WorkUniReason 1.0: A Unified Reasoning Framework for World Knowledge Aligned Image Generation and Editing

7.0

Prior WorkCREval: An Automated Interpretable Evaluation for Creative Image Manipulation under Complex Instructions

7.0

Prior WorkUniEditBench: A Unified and Cost-Effective Benchmark for Image and Video Editing via Distilled MLLMs

7.0

Prior WorkCRIT: Graph-Based Automatic Data Synthesis to Enhance Cross-Modal Multi-Hop Reasoning

7.0

Extension

Builds On ThisVisual Reasoning Benchmark: Evaluating Multimodal LLMs on Classroom-Authentic Visual Problems from Primary Education

4.0

Builds On ThisAutomated Benchmark Generation from Domain Guidelines Informed by Bloom's Taxonomy

5.0

Commercially relevant

none indexed

Conflicting

Competing ApproachInEdit-Bench: Benchmarking Intermediate Logical Pathways for Intelligent Image Editing Models

5.0

Related Resources

Owned Distribution

Subscribe to the weekly brief

Get the weekly shortlist of commercializable papers, benchmark movers, and proof receipts that matter for product execution.

Agent drawer

5 surfaces preserved for agents. Humans can ignore.

Developer contracts, payload previews, evidence maps, and run controls stay here instead of the Read, Build, and Track workspace.

Run context

Paper: 2603.12264
Route: /paper/grade-benchmarking-discipline-informed-reasoning-in-image-editing
Active tab: read
Artifact: grade-benchmarking-discipline-informed-reasoning-in-image-editing

Available agents

Read extractor
Build planner
Track monitor
Competitive mapper
Related-paper scout

API/MCP endpoints

REST paper pack API/api/v1/paper/grade-benchmarking-discipline-informed-reasoning-in-image-editing/paper-pack
REST build passport API/api/v1/paper/grade-benchmarking-discipline-informed-reasoning-in-image-editing/build-passport
REST OpenAPI/api/openapi.json
MCP descriptor/api/mcp
MCP resourcesciencetostartup://surfaces/paper-workspace

Tool contracts

paper_packbuild_passportopportunity_kernelforesightsource_proofevidence_state

Payload preview

Inspect payload

{
  "contract_version": "paper-r2",
  "paper_id": "a068f298-3c4a-451e-9016-fed55d084b57",
  "arxiv_id": "2603.12264",
  "canonical_route": "/paper/grade-benchmarking-discipline-informed-reasoning-in-image-editing",
  "active_tab": "synced from current hash by the drawer client",
  "selected_artifact": "grade-benchmarking-discipline-informed-reasoning-in-image-editing",
  "endpoints": {
    "paper_pack": "/api/v1/paper/grade-benchmarking-discipline-informed-reasoning-in-image-editing/paper-pack",
    "build_passport": "/api/v1/paper/grade-benchmarking-discipline-informed-reasoning-in-image-editing/build-passport",
    "mcp_resource": "sciencetostartup://surfaces/paper-workspace"
  }
}

Schema validation

paper-r2 contract: present
JSON-LD twin: SSR emitted
OpenAPI path parity: /api/openapi.json
MCP resource parity: paper-workspace

Job trace

queued: drawer opened by user action
running: inspect or copy payload
succeeded: payload available in SSR
failed: route errors appear in evidence cards

Evidence map

sources used: page freshness, source proof anchors, JSON-LD
missing sources: exposed by PaperPack and EvidenceState chips
derived fallbacks: marked unverified before handoff

Page Freshness

Canonical route, proof status, last verified, refs, sources, and coverage.

Page Freshness

Paper proof surface

Canonical route: /paper/grade-benchmarking-discipline-informed-reasoning-in-image-editing

stale

Proof freshness: stale
Proof status: unverified
Display score: 7/10
Last proof check: 2026-04-02
Score updated: 2026-04-02
Score fresh until: 2026-05-02
References: 0
Source count: 0
Coverage: 17%

This page is showing the last landed evidence receipt and score bundle because the latest proof data is outside the freshness window.

OpenAlex: pending — this preprint is not yet indexed by OpenAlex.

Agent Handoff

Endpoint list, payload shape, route context, and copyable handoff data.

Agent Handoff

GRADE: Benchmarking Discipline-Informed Reasoning in Image Editing

Canonical ID grade-benchmarking-discipline-informed-reasoning-in-image-editing | Route /paper/grade-benchmarking-discipline-informed-reasoning-in-image-editing

REST example

curl https://sciencetostartup.com/api/v1/agent-handoff/paper/grade-benchmarking-discipline-informed-reasoning-in-image-editing

MCP example

{
  "tool": "get_paper",
  "arguments": {
    "arxiv_id": "2603.12264"
  }
}

source_context

{
  "surface": "paper",
  "mode": "paper",
  "query": "GRADE: Benchmarking Discipline-Informed Reasoning in Image Editing",
  "normalized_query": "2603.12264",
  "route": "/paper/grade-benchmarking-discipline-informed-reasoning-in-image-editing",
  "paper_ref": "grade-benchmarking-discipline-informed-reasoning-in-image-editing",
  "topic_slug": null,
  "benchmark_ref": null,
  "dataset_ref": null
}

Buildability Receipt

Verdict, compute envelope, blockers, signature state, and receipt links.

Paper proof page receipt window

Watch and verify: GRADE: Benchmarking Discipline-Informed Reasoning in Image Editing

/buildability/grade-benchmarking-discipline-informed-reasoning-in-image-editing

Watchwatch

Subject: GRADE: Benchmarking Discipline-Informed Reasoning in Image Editing

Verdict

Watch

Verdict is Watch because viability or proof quality is intermediate and should be re-evaluated before execution.

Time to first demo

Insufficient data

No first-demo timestamp, owner estimate, or elapsed demo receipt is attached to this surface.

Compute envelope

Structured compute envelope

Insufficient data

No data, compute, hardware, memory, latency, dependency, or serving requirement receipt is attached.

Evidence ids

Receipt path

/buildability/grade-benchmarking-discipline-informed-reasoning-in-image-editing

Paper ref

grade-benchmarking-discipline-informed-reasoning-in-image-editing

arXiv id

2603.12264

Freshness

Generated at

2026-04-02T02:30:40.136Z

Evidence freshness

stale

Last verification

2026-04-02T02:30:40.136Z

Sources

References

Coverage

17%

Hash state

Lineage hash

e5e47589b2b200fa34e564b3b3d53a5b2213c4e34078ff5f6b78466775f2afda

Canonical opportunity-kernel lineage hash.

Signature state

External signature

unsigned_external

No founder, registry, pilot, or production-adoption signature is attached to this receipt.

Verification

not_verified

Verification is blocked until an external signature is provided.

Blockers

Missing: repo_url
Missing: references
Missing: proof_status
Missing: distribution_readiness_scores
Missing: paper_extraction_scorecards
Unknown: distribution readiness has not been computed yet
Unknown: proof verification has not been recorded yet

Verification pending / evidence receipt incomplete

repo_url

references

Missing proof, requirement, signature, approval, adoption, or telemetry fields are blockers and must not be inferred.

Open receipt API receipt Build Loop Signal Canvas Proof divergence Divergence API Brier outcomes API

Source Proof anchors

Visual citations from the paper document graph.

JSON-LD twin

The application/ld+json payload rendered for agents.

{
  "@context": "https://schema.org",
  "@graph": [
    {
      "@type": "WebPage",
      "@id": "https://sciencetostartup.com/paper/grade-benchmarking-discipline-informed-reasoning-in-image-editing#webpage",
      "url": "https://sciencetostartup.com/paper/grade-benchmarking-discipline-informed-reasoning-in-image-editing",
      "name": "GRADE: Benchmarking Discipline-Informed Reasoning in Image Editing",
      "description": "GRADE is a benchmark for assessing discipline-informed reasoning in image editing across various academic domains.",
      "isPartOf": {
        "@id": "https://sciencetostartup.com/#website"
      }
    },
    {
      "@type": "ScholarlyArticle",
      "@id": "https://sciencetostartup.com/paper/grade-benchmarking-discipline-informed-reasoning-in-image-editing#scholarlyArticle",
      "headline": "GRADE: Benchmarking Discipline-Informed Reasoning in Image Editing",
      "description": "GRADE is a benchmark for assessing discipline-informed reasoning in image editing across various academic domains.",
      "url": "https://sciencetostartup.com/paper/grade-benchmarking-discipline-informed-reasoning-in-image-editing",
      "sameAs": "https://arxiv.org/abs/2603.12264",
      "identifier": {
        "@type": "PropertyValue",
        "propertyID": "arXiv",
        "value": "2603.12264"
      },
      "isAccessibleForFree": true,
      "isPartOf": {
        "@id": "https://sciencetostartup.com/#website"
      },
      "datePublished": "2026-03-12T17:59:52.000Z",
      "citation": [
        {
          "@type": "ScholarlyArticle",
          "identifier": {
            "@type": "PropertyValue",
            "propertyID": "SemanticScholar",
            "value": "3538aa7a4ffeb4e730c425e741f952f771153671"
          },
          "url": "https://www.semanticscholar.org/paper/3538aa7a4ffeb4e730c425e741f952f771153671"
        },
        {
          "@type": "ScholarlyArticle",
          "identifier": {
            "@type": "PropertyValue",
            "propertyID": "SemanticScholar",
            "value": "15538df854dd33351dfb5cefd7e8f3340c8936c3"
          },
          "url": "https://www.semanticscholar.org/paper/15538df854dd33351dfb5cefd7e8f3340c8936c3"
        },
        {
          "@type": "ScholarlyArticle",
          "identifier": {
            "@type": "PropertyValue",
            "propertyID": "SemanticScholar",
            "value": "cd3ed14c462318fb706a0403f5cdce14a5638825"
          },
          "url": "https://www.semanticscholar.org/paper/cd3ed14c462318fb706a0403f5cdce14a5638825"
        },
        {
          "@type": "ScholarlyArticle",
          "identifier": {
            "@type": "PropertyValue",
            "propertyID": "SemanticScholar",
            "value": "03a38a8e2dc74592b9e5d175d45445b2f1cfaf02"
          },
          "url": "https://www.semanticscholar.org/paper/03a38a8e2dc74592b9e5d175d45445b2f1cfaf02"
        },
        {
          "@type": "ScholarlyArticle",
          "identifier": {
            "@type": "PropertyValue",
            "propertyID": "SemanticScholar",
            "value": "50d23dda81f3908415af3048c78a768a95772111"
          },
          "url": "https://www.semanticscholar.org/paper/50d23dda81f3908415af3048c78a768a95772111"
        },
        {
          "@type": "ScholarlyArticle",
          "identifier": {
            "@type": "PropertyValue",
            "propertyID": "SemanticScholar",
            "value": "1071902c5444d32970620e47321b5d5c3ec9d819"
          },
          "url": "https://www.semanticscholar.org/paper/1071902c5444d32970620e47321b5d5c3ec9d819"
        },
        {
          "@type": "ScholarlyArticle",
          "identifier": {
            "@type": "PropertyValue",
            "propertyID": "SemanticScholar",
            "value": "9b2810399f99db32b4141855aeb636009236c066"
          },
          "url": "https://www.semanticscholar.org/paper/9b2810399f99db32b4141855aeb636009236c066"
        },
        {
          "@type": "ScholarlyArticle",
          "identifier": {
            "@type": "PropertyValue",
            "propertyID": "SemanticScholar",
            "value": "18d83103fb98905ccbce420987470eb2ea021187"
          },
          "url": "https://www.semanticscholar.org/paper/18d83103fb98905ccbce420987470eb2ea021187"
        },
        {
          "@type": "ScholarlyArticle",
          "identifier": {
            "@type": "PropertyValue",
            "propertyID": "SemanticScholar",
            "value": "fca6507f1f8fb076a9975b62d1ee75e867f134e3"
          },
          "url": "https://www.semanticscholar.org/paper/fca6507f1f8fb076a9975b62d1ee75e867f134e3"
        },
        {
          "@type": "ScholarlyArticle",
          "identifier": {
            "@type": "PropertyValue",
            "propertyID": "SemanticScholar",
            "value": "8860306cb19af658848014fec21bb136adab5439"
          },
          "url": "https://www.semanticscholar.org/paper/8860306cb19af658848014fec21bb136adab5439"
        },
        {
          "@type": "ScholarlyArticle",
          "identifier": {
            "@type": "PropertyValue",
            "propertyID": "SemanticScholar",
            "value": "062e2d1a32e835df7bcf4bd0cbb3a6da56cd4039"
          },
          "url": "https://www.semanticscholar.org/paper/062e2d1a32e835df7bcf4bd0cbb3a6da56cd4039"
        },
        {
          "@type": "ScholarlyArticle",
          "identifier": {
            "@type": "PropertyValue",
            "propertyID": "SemanticScholar",
            "value": "0ffad298bcca71add8cd0779a8be959d99e142d6"
          },
          "url": "https://www.semanticscholar.org/paper/0ffad298bcca71add8cd0779a8be959d99e142d6"
        },
        {
          "@type": "ScholarlyArticle",
          "identifier": {
            "@type": "PropertyValue",
            "propertyID": "SemanticScholar",
            "value": "b21b337b7328537ac56be069d0e9f71697dbac26"
          },
          "url": "https://www.semanticscholar.org/paper/b21b337b7328537ac56be069d0e9f71697dbac26"
        },
        {
          "@type": "ScholarlyArticle",
          "identifier": {
            "@type": "PropertyValue",
            "propertyID": "SemanticScholar",
            "value": "830662aa4657354f7cfc6f31e755e01d27c22727"
          },
          "url": "https://www.semanticscholar.org/paper/830662aa4657354f7cfc6f31e755e01d27c22727"
        },
        {
          "@type": "ScholarlyArticle",
          "identifier": {
            "@type": "PropertyValue",
            "propertyID": "SemanticScholar",
            "value": "406a577425c36e05163ee3bf448d65a6eb480ab3"
          },
          "url": "https://www.semanticscholar.org/paper/406a577425c36e05163ee3bf448d65a6eb480ab3"
        },
        {
          "@type": "ScholarlyArticle",
          "identifier": {
            "@type": "PropertyValue",
            "propertyID": "SemanticScholar",
            "value": "940e6bdbb26d572481518fedc15b3be440039b61"
          },
          "url": "https://www.semanticscholar.org/paper/940e6bdbb26d572481518fedc15b3be440039b61"
        },
        {
          "@type": "ScholarlyArticle",
          "identifier": {
            "@type": "PropertyValue",
            "propertyID": "SemanticScholar",
            "value": "56fa0dd32cbfa750ed18b1d0f2ab0e17d3601c6b"
          },
          "url": "https://www.semanticscholar.org/paper/56fa0dd32cbfa750ed18b1d0f2ab0e17d3601c6b"
        },
        {
          "@type": "ScholarlyArticle",
          "identifier": {
            "@type": "PropertyValue",
            "propertyID": "SemanticScholar",
            "value": "40d6dd6f2b140ad41450ed0ecbcf6089e955faaf"
          },
          "url": "https://www.semanticscholar.org/paper/40d6dd6f2b140ad41450ed0ecbcf6089e955faaf"
        },
        {
          "@type": "ScholarlyArticle",
          "identifier": {
            "@type": "PropertyValue",
            "propertyID": "SemanticScholar",
            "value": "a97bdbceb286f8dd0e8dec7ea3e70e4c7164fdd0"
          },
          "url": "https://www.semanticscholar.org/paper/a97bdbceb286f8dd0e8dec7ea3e70e4c7164fdd0"
        },
        {
          "@type": "ScholarlyArticle",
          "identifier": {
            "@type": "PropertyValue",
            "propertyID": "SemanticScholar",
            "value": "2440a4eec3fd1288c4ca84456ac2e96d7ba0d3c3"
          },
          "url": "https://www.semanticscholar.org/paper/2440a4eec3fd1288c4ca84456ac2e96d7ba0d3c3"
        }
      ],
      "additionalProperty": [
        {
          "@type": "PropertyValue",
          "propertyID": "viabilityScore",
          "value": 7
        },
        {
          "@type": "PropertyValue",
          "propertyID": "researchDomain",
          "value": "Image Editing"
        }
      ]
    },
    {
      "@type": "BreadcrumbList",
      "itemListElement": [
        {
          "@type": "ListItem",
          "position": 1,
          "name": "Home",
          "item": "https://sciencetostartup.com"
        },
        {
          "@type": "ListItem",
          "position": 2,
          "name": "Image Editing",
          "item": "https://sciencetostartup.com/topics"
        },
        {
          "@type": "ListItem",
          "position": 3,
          "name": "GRADE: Benchmarking Discipline-Informed Reasoning in Image E",
          "item": "https://sciencetostartup.com/paper/grade-benchmarking-discipline-informed-reasoning-in-image-editing"
        }
      ]
    }
  ]
}

GRADE: Benchmarking Discipline-Informed Reasoning in Image Editing

GRADE: Benchmarking Discipline-Informed Reasoning in Image Editing

Claim map

Constellation map

Competitive landscape

Buzz

PDF

References(41)

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

References(41)

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline