ARXIV:2604.02119 · LLM COMPRESSION · SUBMITTED 03 APR · 20:50 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

AA-SVD : Anchored and Adaptive SVD for Large Language Model Compression

Atul Kumar Sinha · François Fleuret · arXiv

A framework for compressing large language models without retraining by accounting for input distribution shifts and refining transformer blocks end-to-end.

Blocked on Code›Score5.0Evidence unverified

Opportunity summary

Pain A framework for compressing large language models without retraining by accounting for input distribution shifts and refining transformer blocks end-to-end.

Evidence 0 refs | 0 sources | 33% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

A framework for compressing large language models without retraining by accounting for input distribution shifts and refining transformer blocks end-to-end. Unlike existing factorization-based approaches that optimize only on the original inputs, ignoring distribution shifts…

METHOD

Full abstract

We introduce a fast low-rank factorization-based framework for compressing large language models that enables rapid compression of billion-parameter models without retraining. Unlike existing factorization-based approaches that optimize only on the original inputs, ignoring distribution shifts from upstream compression and thus propagating errors forward, or those that rely only on shifted inputs and risk drifting away from the original outputs, our approach accounts for both. Beyond individual layer compression, we further refine each transformer block end-to-end, minimizing block-level output distortion and allowing compressed layers to jointly compensate for accumulated errors. By anchoring each compressed layer to the original outputs while explicitly modeling input distribution shifts, our method finds a low-rank approximation that maintains functional equivalence with the original model. Experiments on large language models show that our method consistently outperforms existing SVD-based baselines across compression ratios, with the advantage becoming increasingly pronounced at aggressive compression budgets, where competing methods degrade substantially or collapse entirely, offering a practical solution for efficient, large-scale model deployment.

RESULT

ScienceToStartup currently rates this 5.0/10 on the public viability pass. We introduce a fast low-rank factorization-based framework for compressing large language models that enables rapid compression of billion-parameter models without retraining.

WHY NOW

LLM Compression moved forward this cycle; last verified April 2026. Public score 5.0/10.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score5.0

PainA framework for compressing large language models without retraining by accounting for input distribution shifts and refining transformer blocks end-to-end.

Evidence0 refs | 0 sources | 33% coverage

Blockerno shell-level blocker reported

Analysis summary

A framework for compressing large language models without retraining by accounting for input distribution shifts and refining transformer blocks end-to-end.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

A framework for compressing large language models without retraining by accounting for input distribution shifts and refining transformer blocks end-to-end.

Segment

LLM Compression

Adoption evidence

No public code link in the paper record yet

Commercial read

5.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "db2b5287-9684-4981-8920-be66d59e1aae", "arxiv_id": "2604.02119", "canonical_route": "/paper/aa-svd-anchored-and-adaptive-svd-for-large-language-model-compression", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "aa-svd-anchored-and-adaptive-svd-for-large-language-model-compression", "endpoints": { "paper_pack": "/api/v1/paper/aa-svd-anchored-and-adaptive-svd-for-large-language-model-compression/paper-pack", "build_passport": "/api/v1/paper/aa-svd-anchored-and-adaptive-svd-for-large-language-model-compression/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "AA-SVD : Anchored and Adaptive SVD for Large Language Model Compression", "normalized_query": "2604.02119", "route": "/paper/aa-svd-anchored-and-adaptive-svd-for-large-language-model-compression", "paper_ref": "aa-svd-anchored-and-adaptive-svd-for-large-language-model-compression", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/aa-svd-anchored-and-adaptive-svd-for-large-language-model-compression#webpage", "url": "https://sciencetostartup.com/paper/aa-svd-anchored-and-adaptive-svd-for-large-language-model-compression", "name": "AA-SVD : Anchored and Adaptive SVD for Large Language Model Compression", "description": "A framework for compressing large language models without retraining by accounting for input distribution shifts and refining transformer blocks end-to-end.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/aa-svd-anchored-and-adaptive-svd-for-large-language-model-compression#scholarlyArticle", "headline": "AA-SVD : Anchored and Adaptive SVD for Large Language Model Compression", "description": "A framework for compressing large language models without retraining by accounting for input distribution shifts and refining transformer blocks end-to-end.", "url": "https://sciencetostartup.com/paper/aa-svd-anchored-and-adaptive-svd-for-large-language-model-compression", "sameAs": "https://arxiv.org/abs/2604.02119", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2604.02119" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-04-02T14:55:49.000Z", "author": [ { "@type": "Person", "name": "Atul Kumar Sinha" }, { "@type": "Person", "name": "François Fleuret" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 5 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "LLM Compression" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "LLM Compression", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "AA-SVD : Anchored and Adaptive SVD for Large Language Model ", "item": "https://sciencetostartup.com/paper/aa-svd-anchored-and-adaptive-svd-for-large-language-model-compression" } ] } ] }

Competitive landscape

A framework for compressing large language models without retraining by accounting for input distribution shifts and refining transformer blocks end-to-end.

Segment

LLM Compression

Adoption evidence

No public code link in the paper record yet

Commercial read

5.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

AA-SVD : Anchored and Adaptive SVD for Large Language Model Compression

AA-SVD : Anchored and Adaptive SVD for Large Language Model Compression

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline