ARXIV:2604.04440 · LLM TRAINING · SUBMITTED 07 APR · 20:14 UTC · FRESHNESS UNKNOWN

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Training Transformers in Cosine Coefficient Space

Mohamed Amine Bergach · arXiv

A novel method for training transformers by parameterizing weights in cosine coefficient space, reducing parameter count while maintaining performance.

Blocked on Code›Score3.0Evidence unverified

Opportunity summary

Pain A novel method for training transformers by parameterizing weights in cosine coefficient space, reducing parameter count while maintaining performance.

Evidence 0 refs | 0 sources | 0% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

A novel method for training transformers by parameterizing weights in cosine coefficient space, reducing parameter count while maintaining performance. At each forward pass the full weight matrix is reconstructed via the inverse DCT; gradients…

METHOD

Full abstract

We parameterize the weight matrices of a transformer in the two-dimensional discrete cosine transform (DCT) domain, retaining only the lowest-frequency coefficients. At each forward pass the full weight matrix is reconstructed via the inverse DCT; gradients propagate through the reconstruction to update the spectral coefficients directly. On character-level language modeling (Shakespeare, 1M characters), a 4-layer transformer trained from scratch in this representation matches the perplexity of the standard parameterization (6.1 vs.\ 6.1) while storing 52\% of the parameters. At 4$\times$ compression (29\% of parameters), the model reaches perplexity 6.9 -- outperforming a low-rank baseline (perplexity 8.8 at 21\% of parameters) at a comparable reduction. The method requires no architectural changes, no pre-trained checkpoint, and no auxiliary loss. It reduces to replacing each \texttt{nn.Linear} with a drop-in spectral layer that stores $K$ DCT coefficients instead of $n \times m$ weights.

RESULT

ScienceToStartup currently rates this 3.0/10 on the public viability pass. It reduces to replacing each \texttt{nn.Linear} with a drop-in spectral layer that stores $K$ DCT coefficients instead of $n \times m$ weights.

WHY NOW

LLM Training moved forward this cycle; last verified April 2026. Public score 3.0/10.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score3.0

PainA novel method for training transformers by parameterizing weights in cosine coefficient space, reducing parameter count while maintaining performance.

Evidence0 refs | 0 sources | 0% coverage

Blockerno shell-level blocker reported

Analysis summary

A novel method for training transformers by parameterizing weights in cosine coefficient space, reducing parameter count while maintaining performance.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

A novel method for training transformers by parameterizing weights in cosine coefficient space, reducing parameter count while maintaining performance.

Segment

LLM Training

Adoption evidence

No public code link in the paper record yet

Commercial read

3.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "5da4ad20-eb46-473b-8da5-b46439099861", "arxiv_id": "2604.04440", "canonical_route": "/paper/training-transformers-in-cosine-coefficient-space", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "training-transformers-in-cosine-coefficient-space", "endpoints": { "paper_pack": "/api/v1/paper/training-transformers-in-cosine-coefficient-space/paper-pack", "build_passport": "/api/v1/paper/training-transformers-in-cosine-coefficient-space/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "Training Transformers in Cosine Coefficient Space", "normalized_query": "2604.04440", "route": "/paper/training-transformers-in-cosine-coefficient-space", "paper_ref": "training-transformers-in-cosine-coefficient-space", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/training-transformers-in-cosine-coefficient-space#webpage", "url": "https://sciencetostartup.com/paper/training-transformers-in-cosine-coefficient-space", "name": "Training Transformers in Cosine Coefficient Space", "description": "A novel method for training transformers by parameterizing weights in cosine coefficient space, reducing parameter count while maintaining performance.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/training-transformers-in-cosine-coefficient-space#scholarlyArticle", "headline": "Training Transformers in Cosine Coefficient Space", "description": "A novel method for training transformers by parameterizing weights in cosine coefficient space, reducing parameter count while maintaining performance.", "url": "https://sciencetostartup.com/paper/training-transformers-in-cosine-coefficient-space", "sameAs": "https://arxiv.org/abs/2604.04440", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2604.04440" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-04-06T05:39:31.000Z", "author": [ { "@type": "Person", "name": "Mohamed Amine Bergach" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 3 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "LLM Training" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "LLM Training", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "Training Transformers in Cosine Coefficient Space", "item": "https://sciencetostartup.com/paper/training-transformers-in-cosine-coefficient-space" } ] } ] }

Competitive landscape

A novel method for training transformers by parameterizing weights in cosine coefficient space, reducing parameter count while maintaining performance.

Segment

LLM Training

Adoption evidence

No public code link in the paper record yet

Commercial read

3.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

Training Transformers in Cosine Coefficient Space

Training Transformers in Cosine Coefficient Space

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline