ARXIV:2603.16184 · MULTILINGUAL ASR · SUBMITTED 19 MAR · 20:22 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: partial proof status

Polyglot-Lion: Efficient Multilingual ASR for Singapore via Balanced Fine-Tuning of Qwen3-ASR

arXiv

Polyglot-Lion offers efficient multilingual ASR tailored for Singapore's diverse languages at a fraction of the cost of larger models.

Blocked on Code›Score8.0Evidence partial

Opportunity summary

Pain Polyglot-Lion offers efficient multilingual ASR tailored for Singapore's diverse languages at a fraction of the cost of larger models.

Evidence 0 refs | 0 sources | 50% coverage

Blocker Evidence partial

Open Build Read PDF Signal Canvas Track

PROBLEM

Polyglot-Lion offers efficient multilingual ASR tailored for Singapore's diverse languages at a fraction of the cost of larger models. Our models are obtained by fine-tuning Qwen3-ASR-0.6B and Qwen3-ASR-1.7B exclusively on publicly available speech corpora,…

METHOD

Full abstract

We present Polyglot-Lion, a family of compact multilingual automatic speech recognition (ASR) models tailored for the linguistic landscape of Singapore, covering English, Mandarin, Tamil, and Malay. Our models are obtained by fine-tuning Qwen3-ASR-0.6B and Qwen3-ASR-1.7B exclusively on publicly available speech corpora, using a balanced sampling strategy that equalizes the number of training utterances per language and deliberately omits language-tag conditioning so that the model learns to identify languages implicitly from audio. On 12 benchmarks spanning the four target languages, Polyglot-Lion-1.7B achieves an average error rate of 14.85, competitive with MERaLiON-2-10B-ASR (14.32) - a model 6x larger - while incurring a training cost of \$81 on a single RTX PRO 6000 GPU compared to \$18,862 for the 128-GPU baseline. Inference throughput is approximately 20x faster than MERaLiON at 0.10 s/sample versus 2.02 s/sample. These results demonstrate that linguistically balanced fine-tuning of moderate-scale pretrained models can yield deployment-ready multilingual ASR at a fraction of the cost of larger specialist systems.

RESULT

ScienceToStartup currently rates this 8.0/10 on the public viability pass. On 12 benchmarks spanning the four target languages, Polyglot-Lion-1.7B achieves an average error rate of 14.85, competitive with MERaLiON-2-10B-ASR (14.32) - a model 6x…

WHY NOW

Multilingual ASR moved forward this cycle; last verified April 2026. Public score 8.0/10. Implementation evidence is present through a linked repository.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score8.0

PainPolyglot-Lion offers efficient multilingual ASR tailored for Singapore's diverse languages at a fraction of the cost of larger models.

Evidence0 refs | 0 sources | 50% coverage

Blockermissing authors

Analysis summary

Polyglot-Lion offers efficient multilingual ASR tailored for Singapore's diverse languages at a fraction of the cost of larger models.

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: partial proof status

Competitive landscape

Polyglot-Lion offers efficient multilingual ASR tailored for Singapore's diverse languages at a fraction of the cost of larger models.

Segment

Multilingual ASR

Adoption evidence

Public code linked for build inspection

Commercial read

8.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "4776aa8c-9c27-44c5-a866-b41d73031f36", "arxiv_id": "2603.16184", "canonical_route": "/paper/polyglot-lion-efficient-multilingual-asr-for-singapore-via-balanced-fine-tuning-of-qwen3-asr", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "polyglot-lion-efficient-multilingual-asr-for-singapore-via-balanced-fine-tuning-of-qwen3-asr", "endpoints": { "paper_pack": "/api/v1/paper/polyglot-lion-efficient-multilingual-asr-for-singapore-via-balanced-fine-tuning-of-qwen3-asr/paper-pack", "build_passport": "/api/v1/paper/polyglot-lion-efficient-multilingual-asr-for-singapore-via-balanced-fine-tuning-of-qwen3-asr/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "Polyglot-Lion: Efficient Multilingual ASR for Singapore via Balanced Fine-Tuning of Qwen3-ASR", "normalized_query": "2603.16184", "route": "/paper/polyglot-lion-efficient-multilingual-asr-for-singapore-via-balanced-fine-tuning-of-qwen3-asr", "paper_ref": "polyglot-lion-efficient-multilingual-asr-for-singapore-via-balanced-fine-tuning-of-qwen3-asr", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/polyglot-lion-efficient-multilingual-asr-for-singapore-via-balanced-fine-tuning-of-qwen3-asr#webpage", "url": "https://sciencetostartup.com/paper/polyglot-lion-efficient-multilingual-asr-for-singapore-via-balanced-fine-tuning-of-qwen3-asr", "name": "Polyglot-Lion: Efficient Multilingual ASR for Singapore via Balanced Fine-Tuning of Qwen3-ASR", "description": "Polyglot-Lion offers efficient multilingual ASR tailored for Singapore's diverse languages at a fraction of the cost of larger models.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/polyglot-lion-efficient-multilingual-asr-for-singapore-via-balanced-fine-tuning-of-qwen3-asr#scholarlyArticle", "headline": "Polyglot-Lion: Efficient Multilingual ASR for Singapore via Balanced Fine-Tuning of Qwen3-ASR", "description": "Polyglot-Lion offers efficient multilingual ASR tailored for Singapore's diverse languages at a fraction of the cost of larger models.", "url": "https://sciencetostartup.com/paper/polyglot-lion-efficient-multilingual-asr-for-singapore-via-balanced-fine-tuning-of-qwen3-asr", "sameAs": "https://arxiv.org/abs/2603.16184", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2603.16184" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-03-17T07:09:42.000Z", "codeRepository": "https://github.com/knoveleng/polyglot-lion", "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 8 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Multilingual ASR" } ] }, { "@type": "SoftwareSourceCode", "@id": "https://sciencetostartup.com/paper/polyglot-lion-efficient-multilingual-asr-for-singapore-via-balanced-fine-tuning-of-qwen3-asr#software", "name": "Polyglot-Lion: Efficient Multilingual ASR for Singapore via Balanced Fine-Tuning of Qwen3-ASR - Source Code", "description": "Polyglot-Lion offers efficient multilingual ASR tailored for Singapore's diverse languages at a fraction of the cost of larger models.", "codeRepository": "https://github.com/knoveleng/polyglot-lion", "url": "https://github.com/knoveleng/polyglot-lion" }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Multilingual ASR", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "Polyglot-Lion: Efficient Multilingual ASR for Singapore via ", "item": "https://sciencetostartup.com/paper/polyglot-lion-efficient-multilingual-asr-for-singapore-via-balanced-fine-tuning-of-qwen3-asr" } ] }, { "@type": "FAQPage", "mainEntity": [ { "@type": "Question", "name": "What products could be built from this research?", "acceptedAnswer": { "@type": "Answer", "text": "Now is the time because businesses in Southeast Asia are rapidly digitizing and facing increasing customer expectations for multilingual support, while existing ASR solutions are either too expensive, too slow, or not tailored to local language mixes, creating a gap for cost-effective, deployment-ready models." } }, { "@type": "Question", "name": "What are the practical use cases?", "acceptedAnswer": { "@type": "Answer", "text": "A voice-based customer support system for a Singaporean bank that automatically transcribes and routes calls in English, Mandarin, Tamil, and Malay, reducing wait times and improving service quality for non-English speakers." } } ] } ] }

Competitive landscape

Polyglot-Lion offers efficient multilingual ASR tailored for Singapore's diverse languages at a fraction of the cost of larger models.

Segment

Multilingual ASR

Adoption evidence

Public code linked for build inspection

Commercial read

8.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

Polyglot-Lion: Efficient Multilingual ASR for Singapore via Balanced Fine-Tuning of Qwen3-ASR

Polyglot-Lion: Efficient Multilingual ASR for Singapore via Balanced Fine-Tuning of Qwen3-ASR

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline