ARXIV:2605.15871 · AI ARCHITECTURES · SUBMITTED 18 MAY · 20:27 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Agentic Discovery of Neural Architectures: AIRA-Compose and AIRA-Design

Alberto Pepe · Chien-Yu Lin · Despoina Magka · Bilge Acun · Yannan Nellie Wu · Anton Protopopov · +2 at arXiv

AIRA-Compose and AIRA-Design autonomously generate novel AI architectures surpassing existing models like Llama 3.2.

Ship in 2-4 weeks›Score9.0Evidence unverified

Opportunity summary

Pain AIRA-Compose and AIRA-Design autonomously generate novel AI architectures surpassing existing models like Llama 3.2.

Evidence 0 refs | 4 sources | 50% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

AIRA-Compose and AIRA-Design autonomously generate novel AI architectures surpassing existing models like Llama 3.2. We introduce a dual-framework approach: AIRA-Compose for high-level architecture search, and AIRA-Design for low-level mechanistic implementation.

METHOD

Full abstract

Toward recursive self-improvement, we investigate LLM agents autonomously designing foundation models beyond standard Transformers. We introduce a dual-framework approach: AIRA-Compose for high-level architecture search, and AIRA-Design for low-level mechanistic implementation. AIRA-Compose uses 11 agents to explore fundamental computational primitives under a 24-hour budget. Agents evaluate million-parameter candidates, extrapolating top designs to 350M, 1B, and 3B scales. This yields 14 architectures across two families: AIRAformers (Transformer-based) and AIRAhybrids (Transformer-Mamba). Pre-trained at 1B scale, these consistently outperform Llama 3.2 and Composer-found baselines. On downstream tasks, AIRAformer-D and AIRAhybrid-D improve accuracy by 2.4% and 3.8% over Llama 3.2. Furthermore, AIRA-Compose finds models with highly efficient scaling frontiers: AIRAformer-C scales 54% and 71% faster than Llama 3.2 and Composer's best Transformer, while AIRAhybrid-C outscales Nemotron-2 by 23% and Composer's best hybrid by 37%. AIRA-Design tasks 20 agents with writing novel attention mechanisms for long-range dependencies and high-performing training scripts. On the Long Range Arena benchmark, agent-designed architectures reach within 2.3% and 2.6% of human state-of-the-art on document matching and text classification. On the Autoresearch benchmark, Greedy Opus 4.5 achieves 0.968 validation bits-per-byte under a fixed time budget, surpassing the published minimum. Together, these frameworks show AI agents can autonomously discover architectures and algorithmic optimizations matching or surpassing hand-designed baselines. This establishes a powerful paradigm for discovering next-generation foundation models, marking a clear step toward recursive self-improvement.

RESULT

ScienceToStartup currently rates this 9.0/10 on the public viability pass. On downstream tasks, AIRAformer-D and AIRAhybrid-D improve accuracy by 2.4% and 3.8% over Llama 3.2. A public repository is linked, so build verification can…

WHY NOW

AI Architectures moved forward this cycle; last verified May 2026. Public score 9.0/10. Implementation evidence is present through a linked repository.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score9.0

PainAIRA-Compose and AIRA-Design autonomously generate novel AI architectures surpassing existing models like Llama 3.2.

Evidence0 refs | 4 sources | 50% coverage

Blockerno shell-level blocker reported

Analysis summary

AIRA-Compose and AIRA-Design autonomously generate novel AI architectures surpassing existing models like Llama 3.2.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

AIRA-Compose and AIRA-Design autonomously generate novel AI architectures surpassing existing models like Llama 3.2.

Segment

AI Architectures

Adoption evidence

Public code linked for build inspection

Commercial read

9.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "98b377b6-e53f-4351-8395-313052a1f44b", "arxiv_id": "2605.15871", "canonical_route": "/paper/agentic-discovery-of-neural-architectures-aira-compose-and-aira-design", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "agentic-discovery-of-neural-architectures-aira-compose-and-aira-design", "endpoints": { "paper_pack": "/api/v1/paper/agentic-discovery-of-neural-architectures-aira-compose-and-aira-design/paper-pack", "build_passport": "/api/v1/paper/agentic-discovery-of-neural-architectures-aira-compose-and-aira-design/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "Agentic Discovery of Neural Architectures: AIRA-Compose and AIRA-Design", "normalized_query": "2605.15871", "route": "/paper/agentic-discovery-of-neural-architectures-aira-compose-and-aira-design", "paper_ref": "agentic-discovery-of-neural-architectures-aira-compose-and-aira-design", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/agentic-discovery-of-neural-architectures-aira-compose-and-aira-design#webpage", "url": "https://sciencetostartup.com/paper/agentic-discovery-of-neural-architectures-aira-compose-and-aira-design", "name": "Agentic Discovery of Neural Architectures: AIRA-Compose and AIRA-Design", "description": "AIRA-Compose and AIRA-Design autonomously generate novel AI architectures surpassing existing models like Llama 3.2.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/agentic-discovery-of-neural-architectures-aira-compose-and-aira-design#scholarlyArticle", "headline": "Agentic Discovery of Neural Architectures: AIRA-Compose and AIRA-Design", "description": "AIRA-Compose and AIRA-Design autonomously generate novel AI architectures surpassing existing models like Llama 3.2.", "url": "https://sciencetostartup.com/paper/agentic-discovery-of-neural-architectures-aira-compose-and-aira-design", "sameAs": "https://arxiv.org/abs/2605.15871", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2605.15871" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-05-15T11:40:41.000Z", "author": [ { "@type": "Person", "name": "Alberto Pepe", "affiliation": { "@type": "Organization", "name": "FAIR at Meta" } }, { "@type": "Person", "name": "Chien-Yu Lin", "affiliation": { "@type": "Organization", "name": "FAIR at Meta" } }, { "@type": "Person", "name": "Despoina Magka", "affiliation": { "@type": "Organization", "name": "FAIR at Meta" } }, { "@type": "Person", "name": "Bilge Acun", "affiliation": { "@type": "Organization", "name": "FAIR at Meta" } }, { "@type": "Person", "name": "Yannan Nellie Wu", "affiliation": { "@type": "Organization", "name": "FAIR at Meta" } }, { "@type": "Person", "name": "Anton Protopopov", "affiliation": { "@type": "Organization", "name": "FAIR at Meta" } }, { "@type": "Person", "name": "Carole-Jean Wu", "affiliation": { "@type": "Organization", "name": "FAIR at Meta" } }, { "@type": "Person", "name": "Yoram Bachrach", "affiliation": { "@type": "Organization", "name": "FAIR at Meta" } } ], "codeRepository": "https://github.com/facebookresearch/repo", "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 9 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "AI Architectures" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code, repo url" } ] }, { "@type": "SoftwareSourceCode", "@id": "https://sciencetostartup.com/paper/agentic-discovery-of-neural-architectures-aira-compose-and-aira-design#software", "name": "Agentic Discovery of Neural Architectures: AIRA-Compose and AIRA-Design - Source Code", "description": "AIRA-Compose and AIRA-Design autonomously generate novel AI architectures surpassing existing models like Llama 3.2.", "codeRepository": "https://github.com/facebookresearch/repo", "url": "https://github.com/facebookresearch/repo" }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "AI Architectures", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "Agentic Discovery of Neural Architectures: AIRA-Compose and ", "item": "https://sciencetostartup.com/paper/agentic-discovery-of-neural-architectures-aira-compose-and-aira-design" } ] }, { "@type": "FAQPage", "mainEntity": [ { "@type": "Question", "name": "What is the startup potential of \"Agentic Discovery of Neural Architectures: AIRA-Compose and \"?", "acceptedAnswer": { "@type": "Answer", "text": "AIRA-Compose and AIRA-Design autonomously generate novel AI architectures surpassing existing models like Llama 3.2." } }, { "@type": "Question", "name": "What products could be built from this research?", "acceptedAnswer": { "@type": "Answer", "text": "The productization would involve offering an AI architecture optimization service or toolkit that companies in AI research and development can integrate into their model development pipelines." } }, { "@type": "Question", "name": "What are the practical use cases?", "acceptedAnswer": { "@type": "Answer", "text": "AIRA-Compose and AIRA-Design could be used to optimize AI architectures for companies developing advanced machine learning models, enabling more efficient and effective designs tailored for specific computational constraints and performance targets." } }, { "@type": "Question", "name": "What industries could this research disrupt?", "acceptedAnswer": { "@type": "Answer", "text": "These frameworks can replace traditional human-driven neural architecture design processes, potentially leading to more innovative AI models developed with less manual intervention and greater adaptive capabilities." } } ] } ] }

Competitive landscape

AIRA-Compose and AIRA-Design autonomously generate novel AI architectures surpassing existing models like Llama 3.2.

Segment

AI Architectures

Adoption evidence

Public code linked for build inspection

Commercial read

9.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

Agentic Discovery of Neural Architectures: AIRA-Compose and AIRA-Design

Agentic Discovery of Neural Architectures: AIRA-Compose and AIRA-Design

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline