ARXIV:2603.16606 · CROSS-LINGUAL SENTENCE EMBEDDINGS · SUBMITTED 19 MAR · 21:31 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: unverified proof status

Omnilingual SONAR: Cross-Lingual and Cross-Modal Sentence Embeddings Bridging Massively Multilingual Text and Speech

arXiv

OmniSONAR offers an unprecedented omnilingual cross-modal embedding solution for multilingual translation and search applications.

Blocked on Code›Score9.0Evidence unverified

Opportunity summary

Pain OmniSONAR offers an unprecedented omnilingual cross-modal embedding solution for multilingual translation and search applications.

Evidence 0 refs | 0 sources | 33% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

OmniSONAR offers an unprecedented omnilingual cross-modal embedding solution for multilingual translation and search applications. We introduce OmniSONAR, a new family of omnilingual, cross-lingual and cross-modal sentence embedding models that natively embed text, speech, code,…

METHOD

Full abstract

Cross-lingual sentence encoders typically cover only a few hundred languages and often trade downstream quality for stronger alignment, limiting their adoption. We introduce OmniSONAR, a new family of omnilingual, cross-lingual and cross-modal sentence embedding models that natively embed text, speech, code, and mathematical expressions in a single semantic space, while delivering state-of-the-art downstream performance at the scale of thousands of languages, from high-resource to extremely low-resource varieties. To reach this scale without representation collapse, we use progressive training. We first learn a strong foundational space for 200 languages with an LLM-initialized encoder-decoder, combining token-level decoding with a novel split-softmax contrastive loss and synthetic hard negatives. Building on this foundation, we expand to several thousands language varieties via a two-stage teacher-student encoder distillation framework. Finally, we demonstrate the cross-modal extensibility of this space by seamlessly mapping 177 spoken languages into it. OmniSONAR halves cross-lingual similarity search error on the 200-language FLORES dataset and reduces error by a factor of 15 on the 1,560-language BIBLE benchmark. It also enables strong translation, outperforming NLLB-3B on multilingual benchmarks and exceeding prior models (including much larger LLMs) by 15 chrF++ points on 1,560 languages into English BIBLE translation. OmniSONAR also performs strongly on MTEB and XLCoST. For speech, OmniSONAR achieves a 43% lower similarity-search error and reaches 97% of SeamlessM4T speech-to-text quality, despite being zero-shot for translation (trained only on ASR data). Finally, by training an encoder-decoder LM, Spectrum, exclusively on English text processing OmniSONAR embedding sequences, we unlock high-performance transfer to thousands of languages and speech for complex downstream tasks.

RESULT

ScienceToStartup currently rates this 9.0/10 on the public viability pass. Finally, we demonstrate the cross-modal extensibility of this space by seamlessly mapping 177 spoken languages into it.

WHY NOW

Cross-Lingual Sentence Embeddings moved forward this cycle; last verified April 2026. Public score 9.0/10.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score9.0

PainOmniSONAR offers an unprecedented omnilingual cross-modal embedding solution for multilingual translation and search applications.

Evidence0 refs | 0 sources | 33% coverage

Blockermissing authors

Analysis summary

OmniSONAR offers an unprecedented omnilingual cross-modal embedding solution for multilingual translation and search applications.

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: unverified proof status

Competitive landscape

OmniSONAR offers an unprecedented omnilingual cross-modal embedding solution for multilingual translation and search applications.

Segment

Cross-Lingual Sentence Embeddings

Adoption evidence

No public code link in the paper record yet

Commercial read

9.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "61078416-bd34-43e3-b505-d7a1512568e2", "arxiv_id": "2603.16606", "canonical_route": "/paper/omnilingual-sonar-cross-lingual-and-cross-modal-sentence-embeddings-bridging-massively-multilingual-text-and-speech", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "omnilingual-sonar-cross-lingual-and-cross-modal-sentence-embeddings-bridging-massively-multilingual-text-and-speech", "endpoints": { "paper_pack": "/api/v1/paper/omnilingual-sonar-cross-lingual-and-cross-modal-sentence-embeddings-bridging-massively-multilingual-text-and-speech/paper-pack", "build_passport": "/api/v1/paper/omnilingual-sonar-cross-lingual-and-cross-modal-sentence-embeddings-bridging-massively-multilingual-text-and-speech/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "Omnilingual SONAR: Cross-Lingual and Cross-Modal Sentence Embeddings Bridging Massively Multilingual Text and Speech", "normalized_query": "2603.16606", "route": "/paper/omnilingual-sonar-cross-lingual-and-cross-modal-sentence-embeddings-bridging-massively-multilingual-text-and-speech", "paper_ref": "omnilingual-sonar-cross-lingual-and-cross-modal-sentence-embeddings-bridging-massively-multilingual-text-and-speech", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/omnilingual-sonar-cross-lingual-and-cross-modal-sentence-embeddings-bridging-massively-multilingual-text-and-speech#webpage", "url": "https://sciencetostartup.com/paper/omnilingual-sonar-cross-lingual-and-cross-modal-sentence-embeddings-bridging-massively-multilingual-text-and-speech", "name": "Omnilingual SONAR: Cross-Lingual and Cross-Modal Sentence Embeddings Bridging Massively Multilingual Text and Speech", "description": "OmniSONAR offers an unprecedented omnilingual cross-modal embedding solution for multilingual translation and search applications.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/omnilingual-sonar-cross-lingual-and-cross-modal-sentence-embeddings-bridging-massively-multilingual-text-and-speech#scholarlyArticle", "headline": "Omnilingual SONAR: Cross-Lingual and Cross-Modal Sentence Embeddings Bridging Massively Multilingual Text and Speech", "description": "OmniSONAR offers an unprecedented omnilingual cross-modal embedding solution for multilingual translation and search applications.", "url": "https://sciencetostartup.com/paper/omnilingual-sonar-cross-lingual-and-cross-modal-sentence-embeddings-bridging-massively-multilingual-text-and-speech", "sameAs": "https://arxiv.org/abs/2603.16606", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2603.16606" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-03-17T14:47:35.000Z", "author": [ { "@type": "Person", "name": "Paul-Ambroise Duquenne", "affiliation": { "@type": "Organization", "name": "Meta" } }, { "@type": "Person", "name": "João Maria Janeiro", "affiliation": { "@type": "Organization", "name": "Meta" } }, { "@type": "Person", "name": "Pere-Lluís Huguet Cabot", "affiliation": { "@type": "Organization", "name": "Meta" } }, { "@type": "Person", "name": "Ioannis Tsiamas", "affiliation": { "@type": "Organization", "name": "Meta" } }, { "@type": "Person", "name": "Yen Meng", "affiliation": { "@type": "Organization", "name": "Meta" } }, { "@type": "Person", "name": "Vivek Iyer", "affiliation": { "@type": "Organization", "name": "Meta" } }, { "@type": "Person", "name": "Guillem Ramírez", "affiliation": { "@type": "Organization", "name": "Meta" } }, { "@type": "Person", "name": "Loic Barrault", "affiliation": { "@type": "Organization", "name": "Meta" } }, { "@type": "Person", "name": "Belen Alastruey", "affiliation": { "@type": "Organization", "name": "Meta" } }, { "@type": "Person", "name": "Yu-An Chung", "affiliation": { "@type": "Organization", "name": "Meta" } }, { "@type": "Person", "name": "Marta R. Costa-Jussa", "affiliation": { "@type": "Organization", "name": "Meta" } }, { "@type": "Person", "name": "David Dale", "affiliation": { "@type": "Organization", "name": "Meta" } }, { "@type": "Person", "name": "Kevin Heffernan", "affiliation": { "@type": "Organization", "name": "Meta" } }, { "@type": "Person", "name": "Jaehyeong Jo", "affiliation": { "@type": "Organization", "name": "Meta" } }, { "@type": "Person", "name": "Artyom Kozhevnikov", "affiliation": { "@type": "Organization", "name": "Meta" } }, { "@type": "Person", "name": "Alexandre Mourachko", "affiliation": { "@type": "Organization", "name": "Meta" } }, { "@type": "Person", "name": "Christophe Ropers", "affiliation": { "@type": "Organization", "name": "Meta" } }, { "@type": "Person", "name": "Holger Schwenk", "affiliation": { "@type": "Organization", "name": "Meta" } } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 9 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Cross-Lingual Sentence Embeddings" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Cross-Lingual Sentence Embeddings", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "Omnilingual SONAR: Cross-Lingual and Cross-Modal Sentence Em", "item": "https://sciencetostartup.com/paper/omnilingual-sonar-cross-lingual-and-cross-modal-sentence-embeddings-bridging-massively-multilingual-text-and-speech" } ] }, { "@type": "FAQPage", "mainEntity": [ { "@type": "Question", "name": "What is the startup potential of \"Omnilingual SONAR: Cross-Lingual and Cross-Modal Sentence Em\"?", "acceptedAnswer": { "@type": "Answer", "text": "OmniSONAR offers an unprecedented omnilingual cross-modal embedding solution for multilingual translation and search applications." } }, { "@type": "Question", "name": "What products could be built from this research?", "acceptedAnswer": { "@type": "Answer", "text": "The product can be developed as a cloud-based API service that provides real-time translation and text-to-speech capabilities over a vast number of languages, capitalizing on the extensive coverage and enhanced performance of OmniSONAR." } }, { "@type": "Question", "name": "What are the practical use cases?", "acceptedAnswer": { "@type": "Answer", "text": "A commercial application could be a universal translation service that operates seamlessly across 4,200 languages, providing both text and voice translation capabilities, targeted at global enterprises and government bodies." } }, { "@type": "Question", "name": "What industries could this research disrupt?", "acceptedAnswer": { "@type": "Answer", "text": "OmniSONAR could replace existing multilingual solutions by offering improved accuracy and broader language support, making it a preferred tool for providers that rely on seamless language translation and transcription." } } ] } ] }

Competitive landscape

OmniSONAR offers an unprecedented omnilingual cross-modal embedding solution for multilingual translation and search applications.

Segment

Cross-Lingual Sentence Embeddings

Adoption evidence

No public code link in the paper record yet

Commercial read

9.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

Omnilingual SONAR: Cross-Lingual and Cross-Modal Sentence Embeddings Bridging Massively Multilingual Text and Speech

Omnilingual SONAR: Cross-Lingual and Cross-Modal Sentence Embeddings Bridging Massively Multilingual Text and Speech

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline