ARXIV:2603.11804 · REMOTE SENSING VLMS · SUBMITTED 19 MAR · 21:31 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: unverified proof status

OSM-based Domain Adaptation for Remote Sensing VLMs

arXiv

OSMDA is a self-contained domain adaptation framework for Vision-Language Models that eliminates the need for costly external annotations by leveraging OpenStreetMap data.

Blocked on Code›Score8.0Evidence unverified

Opportunity summary

Pain OSMDA is a self-contained domain adaptation framework for Vision-Language Models that eliminates the need for costly external annotations by leveraging OpenStreetMap data.

Evidence 0 refs | 0 sources | 33% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

OSMDA is a self-contained domain adaptation framework for Vision-Language Models that eliminates the need for costly external annotations by leveraging OpenStreetMap data. Prevailing pseudo-labeling pipelines address this gap by distilling knowledge from large frontier…

METHOD

Full abstract

Vision-Language Models (VLMs) adapted to remote sensing rely heavily on domain-specific image-text supervision, yet high-quality annotations for satellite and aerial imagery remain scarce and expensive to produce. Prevailing pseudo-labeling pipelines address this gap by distilling knowledge from large frontier models, but this dependence on large teachers is costly, limits scalability, and caps achievable performance at the ceiling of the teacher. We propose OSMDA: a self-contained domain adaptation framework that eliminates this dependency. Our key insight is that a capable base VLM can serve as its own annotation engine: by pairing aerial images with rendered OpenStreetMap (OSM) tiles, we leverage optical character recognition and chart comprehension capabilities of the model to generate captions enriched by OSM's vast auxiliary metadata. The model is then fine-tuned on the resulting corpus with satellite imagery alone, yielding OSMDA-VLM, a domain-adapted VLM that requires no manual labeling and no stronger external model. We conduct exhaustive evaluations spanning 10 benchmarks across image-text-to-text tasks and comparing against 9 competitive baselines. When equally mixed with real data, our method achieves state-of-the-art results, while being substantially cheaper to train than teacher-dependent alternatives. These results suggest that, given a strong foundation model, alignment with crowd-sourced geographic data is a practical and scalable path towards remote sensing domain adaptation. Dataset and model weights will be made publicly available.

RESULT

ScienceToStartup currently rates this 8.0/10 on the public viability pass. When equally mixed with real data, our method achieves state-of-the-art results, while being substantially cheaper to train than teacher-dependent alternatives.

WHY NOW

Remote Sensing VLMs moved forward this cycle; last verified April 2026. Public score 8.0/10.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score8.0

PainOSMDA is a self-contained domain adaptation framework for Vision-Language Models that eliminates the need for costly external annotations by leveraging OpenStreetMap data.

Evidence0 refs | 0 sources | 33% coverage

Blockermissing authors

Analysis summary

OSMDA is a self-contained domain adaptation framework for Vision-Language Models that eliminates the need for costly external annotations by leveraging OpenStreetMap data.

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: unverified proof status

Competitive landscape

OSMDA is a self-contained domain adaptation framework for Vision-Language Models that eliminates the need for costly external annotations by leveraging OpenStreetMap data.

Segment

Remote Sensing VLMs

Adoption evidence

No public code link in the paper record yet

Commercial read

8.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "f0ae76b2-2b24-473c-b45e-f01615c32faa", "arxiv_id": "2603.11804", "canonical_route": "/paper/osm-based-domain-adaptation-for-remote-sensing-vlms", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "osm-based-domain-adaptation-for-remote-sensing-vlms", "endpoints": { "paper_pack": "/api/v1/paper/osm-based-domain-adaptation-for-remote-sensing-vlms/paper-pack", "build_passport": "/api/v1/paper/osm-based-domain-adaptation-for-remote-sensing-vlms/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "OSM-based Domain Adaptation for Remote Sensing VLMs", "normalized_query": "2603.11804", "route": "/paper/osm-based-domain-adaptation-for-remote-sensing-vlms", "paper_ref": "osm-based-domain-adaptation-for-remote-sensing-vlms", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/osm-based-domain-adaptation-for-remote-sensing-vlms#webpage", "url": "https://sciencetostartup.com/paper/osm-based-domain-adaptation-for-remote-sensing-vlms", "name": "OSM-based Domain Adaptation for Remote Sensing VLMs", "description": "OSMDA is a self-contained domain adaptation framework for Vision-Language Models that eliminates the need for costly external annotations by leveraging OpenStreetMap data.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/osm-based-domain-adaptation-for-remote-sensing-vlms#scholarlyArticle", "headline": "OSM-based Domain Adaptation for Remote Sensing VLMs", "description": "OSMDA is a self-contained domain adaptation framework for Vision-Language Models that eliminates the need for costly external annotations by leveraging OpenStreetMap data.", "url": "https://sciencetostartup.com/paper/osm-based-domain-adaptation-for-remote-sensing-vlms", "sameAs": "https://arxiv.org/abs/2603.11804", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2603.11804" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-03-12T11:08:30.000Z", "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 8 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Remote Sensing VLMs" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Remote Sensing VLMs", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "OSM-based Domain Adaptation for Remote Sensing VLMs", "item": "https://sciencetostartup.com/paper/osm-based-domain-adaptation-for-remote-sensing-vlms" } ] } ] }

Competitive landscape

OSMDA is a self-contained domain adaptation framework for Vision-Language Models that eliminates the need for costly external annotations by leveraging OpenStreetMap data.

Segment

Remote Sensing VLMs

Adoption evidence

No public code link in the paper record yet

Commercial read

8.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

OSM-based Domain Adaptation for Remote Sensing VLMs

OSM-based Domain Adaptation for Remote Sensing VLMs

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline