ARXIV:2603.26246 · ASR · SUBMITTED 30 MAR · 21:58 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Distilling Conversations: Abstract Compression of Conversational Audio Context for LLM-based ASR

Shashi Kumar · Esaú Villatoro-Tello · Sergio Burdisso · Kadri Hacioglu · Thibault Bañeras-Roux · Hasindri Watawana · +4 at arXiv

A method to compress conversational audio context for improved LLM-based speech recognition, reducing computational cost.

Blocked on Code›Score4.0Evidence unverified

Opportunity summary

Pain A method to compress conversational audio context for improved LLM-based speech recognition, reducing computational cost.

Evidence 14 refs | 3 sources | 50% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

A method to compress conversational audio context for improved LLM-based speech recognition, reducing computational cost. In this work, we study whether multimodal context from prior turns improves LLM-based ASR and how to represent that…

METHOD

Full abstract

Standard LLM-based speech recognition systems typically process utterances in isolation, limiting their ability to leverage conversational context. In this work, we study whether multimodal context from prior turns improves LLM-based ASR and how to represent that context efficiently. We find that, after supervised multi-turn training, conversational context mainly helps with the recognition of contextual entities. However, conditioning on raw context is expensive because the prior-turn audio token sequence grows rapidly with conversation length. To address this, we propose Abstract Compression, which replaces the audio portion of prior turns with a fixed number of learned latent tokens while retaining corresponding transcripts explicitly. On both in-domain and out-of-domain test sets, the compressed model recovers part of the gains of raw-context conditioning with a smaller prior-turn audio footprint. We also provide targeted analyses of the compression setup and its trade-offs.

RESULT

ScienceToStartup currently rates this 4.0/10 on the public viability pass. In this work, we study whether multimodal context from prior turns improves LLM-based ASR and how to represent that context efficiently.

WHY NOW

ASR moved forward this cycle; last verified April 2026. Public score 4.0/10.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score4.0

PainA method to compress conversational audio context for improved LLM-based speech recognition, reducing computational cost.

Evidence14 refs | 3 sources | 50% coverage

Blockerno shell-level blocker reported

Analysis summary

A method to compress conversational audio context for improved LLM-based speech recognition, reducing computational cost.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

A method to compress conversational audio context for improved LLM-based speech recognition, reducing computational cost.

Segment

ASR

Adoption evidence

No public code link in the paper record yet

Commercial read

4.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "29d55d2b-56ce-4f6e-a9a6-8f82f7691e5b", "arxiv_id": "2603.26246", "canonical_route": "/paper/distilling-conversations-abstract-compression-of-conversational-audio-context-for-llm-based-asr", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "distilling-conversations-abstract-compression-of-conversational-audio-context-for-llm-based-asr", "endpoints": { "paper_pack": "/api/v1/paper/distilling-conversations-abstract-compression-of-conversational-audio-context-for-llm-based-asr/paper-pack", "build_passport": "/api/v1/paper/distilling-conversations-abstract-compression-of-conversational-audio-context-for-llm-based-asr/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "Distilling Conversations: Abstract Compression of Conversational Audio Context for LLM-based ASR", "normalized_query": "2603.26246", "route": "/paper/distilling-conversations-abstract-compression-of-conversational-audio-context-for-llm-based-asr", "paper_ref": "distilling-conversations-abstract-compression-of-conversational-audio-context-for-llm-based-asr", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/distilling-conversations-abstract-compression-of-conversational-audio-context-for-llm-based-asr#webpage", "url": "https://sciencetostartup.com/paper/distilling-conversations-abstract-compression-of-conversational-audio-context-for-llm-based-asr", "name": "Distilling Conversations: Abstract Compression of Conversational Audio Context for LLM-based ASR", "description": "A method to compress conversational audio context for improved LLM-based speech recognition, reducing computational cost.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/distilling-conversations-abstract-compression-of-conversational-audio-context-for-llm-based-asr#scholarlyArticle", "headline": "Distilling Conversations: Abstract Compression of Conversational Audio Context for LLM-based ASR", "description": "A method to compress conversational audio context for improved LLM-based speech recognition, reducing computational cost.", "url": "https://sciencetostartup.com/paper/distilling-conversations-abstract-compression-of-conversational-audio-context-for-llm-based-asr", "sameAs": "https://arxiv.org/abs/2603.26246", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2603.26246" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-03-27T10:09:30.000Z", "author": [ { "@type": "Person", "name": "Shashi Kumar" }, { "@type": "Person", "name": "Esaú Villatoro-Tello" }, { "@type": "Person", "name": "Sergio Burdisso" }, { "@type": "Person", "name": "Kadri Hacioglu" }, { "@type": "Person", "name": "Thibault Bañeras-Roux" }, { "@type": "Person", "name": "Hasindri Watawana" }, { "@type": "Person", "name": "Dairazalia Sanchez-Cortes" }, { "@type": "Person", "name": "Srikanth Madikeri" }, { "@type": "Person", "name": "Petr Motlicek" }, { "@type": "Person", "name": "Andreas Stolcke" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 4 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "ASR" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "ASR", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "Distilling Conversations: Abstract Compression of Conversati", "item": "https://sciencetostartup.com/paper/distilling-conversations-abstract-compression-of-conversational-audio-context-for-llm-based-asr" } ] } ] }

Competitive landscape

A method to compress conversational audio context for improved LLM-based speech recognition, reducing computational cost.

Segment

ASR

Adoption evidence

No public code link in the paper record yet

Commercial read

4.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

Distilling Conversations: Abstract Compression of Conversational Audio Context for LLM-based ASR

Distilling Conversations: Abstract Compression of Conversational Audio Context for LLM-based ASR

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline