ARXIV:2604.00093 · GENERATIVE IMAGING · SUBMITTED 02 APR · 21:00 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

RawGen: Learning Camera Raw Image Generation

Dongyoung Kim · Junyong Lee · Abhijith Punnappurath · Mahmoud Afifi · Sangmin Han · Alex Levinshtein · +1 at arXiv

A diffusion-based framework for generating camera raw images from text or sRGB inputs, enabling scalable synthetic data for downstream vision tasks.

Ship in 2-4 weeks›Score7.0Evidence unverified

Opportunity summary

Pain A diffusion-based framework for generating camera raw images from text or sRGB inputs, enabling scalable synthetic data for downstream vision tasks.

Evidence 82 refs | 3 sources | 50% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

A diffusion-based framework for generating camera raw images from text or sRGB inputs, enabling scalable synthetic data for downstream vision tasks. Although raw data is more faithful for low-level vision tasks, collecting large-scale raw…

METHOD

Full abstract

Cameras capture scene-referred linear raw images, which are processed by onboard image signal processors (ISPs) into display-referred 8-bit sRGB outputs. Although raw data is more faithful for low-level vision tasks, collecting large-scale raw datasets remains a major bottleneck, as existing datasets are limited and tied to specific camera hardware. Generative models offer a promising way to address this scarcity -- however, existing diffusion frameworks are designed to synthesize photo-finished sRGB images rather than physically consistent linear representations. This paper presents RawGen, to our knowledge the first diffusion-based framework enabling text-to-raw generation for arbitrary target cameras, alongside sRGB-to-raw inversion. RawGen leverages the generative priors of large-scale sRGB diffusion models to synthesize physically meaningful linear outputs, such as CIE XYZ or camera-specific raw representations, via specialized processing in latent and pixel spaces. To handle unknown and diverse ISP pipelines and photo-finishing effects in diffusion-model training data, we build a many-to-one inverse-ISP dataset where multiple sRGB renditions of the same scene generated using diverse ISP parameters are anchored to a common scene-referred target. Fine-tuning a conditional denoiser and specialized decoder on this dataset allows RawGen to obtain camera-centric linear reconstructions that effectively invert the rendering pipeline. We demonstrate RawGen's superior performance over traditional inverse-ISP methods that assume a fixed ISP. Furthermore, we show that augmenting training pipelines with RawGen's scalable, text-driven synthetic data can benefit downstream low-level vision tasks.

RESULT

ScienceToStartup currently rates this 7.0/10 on the public viability pass. We demonstrate RawGen's superior performance over traditional inverse-ISP methods that assume a fixed ISP. Code availability is flagged in the production record; the public…

WHY NOW

Generative Imaging moved forward this cycle; last verified April 2026. Public score 7.0/10. Production flags indicate code availability.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score7.0

PainA diffusion-based framework for generating camera raw images from text or sRGB inputs, enabling scalable synthetic data for downstream vision tasks.

Evidence82 refs | 3 sources | 50% coverage

Blockerno shell-level blocker reported

Analysis summary

A diffusion-based framework for generating camera raw images from text or sRGB inputs, enabling scalable synthetic data for downstream vision tasks.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

A diffusion-based framework for generating camera raw images from text or sRGB inputs, enabling scalable synthetic data for downstream vision tasks.

Segment

Generative Imaging

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "3df6dfc8-5645-4db1-9cd0-adb5dde66a6e", "arxiv_id": "2604.00093", "canonical_route": "/paper/rawgen-learning-camera-raw-image-generation", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "rawgen-learning-camera-raw-image-generation", "endpoints": { "paper_pack": "/api/v1/paper/rawgen-learning-camera-raw-image-generation/paper-pack", "build_passport": "/api/v1/paper/rawgen-learning-camera-raw-image-generation/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "RawGen: Learning Camera Raw Image Generation", "normalized_query": "2604.00093", "route": "/paper/rawgen-learning-camera-raw-image-generation", "paper_ref": "rawgen-learning-camera-raw-image-generation", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/rawgen-learning-camera-raw-image-generation#webpage", "url": "https://sciencetostartup.com/paper/rawgen-learning-camera-raw-image-generation", "name": "RawGen: Learning Camera Raw Image Generation", "description": "A diffusion-based framework for generating camera raw images from text or sRGB inputs, enabling scalable synthetic data for downstream vision tasks.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/rawgen-learning-camera-raw-image-generation#scholarlyArticle", "headline": "RawGen: Learning Camera Raw Image Generation", "description": "A diffusion-based framework for generating camera raw images from text or sRGB inputs, enabling scalable synthetic data for downstream vision tasks.", "url": "https://sciencetostartup.com/paper/rawgen-learning-camera-raw-image-generation", "sameAs": "https://arxiv.org/abs/2604.00093", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2604.00093" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-03-31T18:12:48.000Z", "author": [ { "@type": "Person", "name": "Dongyoung Kim" }, { "@type": "Person", "name": "Junyong Lee" }, { "@type": "Person", "name": "Abhijith Punnappurath" }, { "@type": "Person", "name": "Mahmoud Afifi" }, { "@type": "Person", "name": "Sangmin Han" }, { "@type": "Person", "name": "Alex Levinshtein" }, { "@type": "Person", "name": "Michael S. Brown" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 7 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Generative Imaging" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Generative Imaging", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "RawGen: Learning Camera Raw Image Generation", "item": "https://sciencetostartup.com/paper/rawgen-learning-camera-raw-image-generation" } ] } ] }

Competitive landscape

A diffusion-based framework for generating camera raw images from text or sRGB inputs, enabling scalable synthetic data for downstream vision tasks.

Segment

Generative Imaging

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

RawGen: Learning Camera Raw Image Generation

RawGen: Learning Camera Raw Image Generation

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline