ARXIV:2603.01712 · LLM FINE-TUNING · SUBMITTED 17 MAR · 19:46 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsErrorProof: failed

FT-Dojo: Towards Autonomous LLM Fine-Tuning with Language Agents

arXiv

FT-Dojo automates LLM fine-tuning with agents to streamline domain-specific model optimization.

Blocked on Code›Score8.0Evidence failed

Opportunity summary

Pain FT-Dojo automates LLM fine-tuning with agents to streamline domain-specific model optimization.

Evidence 0 refs | 0 sources | 33% coverage

Blocker Evidence failed

Open Build Read PDF Signal Canvas Track

PROBLEM

FT-Dojo automates LLM fine-tuning with agents to streamline domain-specific model optimization. Despite growing interest in autonomous machine learning, no prior work has tackled end-to-end LLM fine-tuning with agents.

METHOD

Full abstract

Fine-tuning large language models for vertical domains remains a labor-intensive and expensive process, requiring domain experts to curate data, configure training, and iteratively diagnose model behavior. Despite growing interest in autonomous machine learning, no prior work has tackled end-to-end LLM fine-tuning with agents. Can LLM-based agents automate this complete process? We frame this as a substantially open problem: agents must navigate an open-ended search space spanning data curation from diverse data sources, processing with complex tools, building a training pipeline, and iteratively refining their approach based on evaluation outcomes in rapidly growing logs--an overall scenario far more intricate than existing benchmarks. To study this question, we introduce FT-Dojo, an interactive environment comprising 13 tasks across 5 domains. We further develop FT-Agent, an autonomous system that mirrors human experts by leveraging evaluation-driven feedback to iteratively diagnose failures and refine fine-tuning strategies. Experiments on FT-Dojo demonstrate that purpose-built fine-tuning agents significantly outperform general-purpose alternatives, with FT-Agent achieving the best performance on 10 out of 13 tasks across all five domains. Ablations show that the approach generalizes effectively to 3B models, with additional insights on data scaling trade-offs and backbone sensitivity. Case analyses reveal that agents can recover from failures through cumulative learning from historical experience, while also exposing fundamental limitations in causal reasoning--highlighting both the promise and current boundaries of autonomous LLM fine-tuning.

RESULT

ScienceToStartup currently rates this 8.0/10 on the public viability pass. Experiments on FT-Dojo demonstrate that purpose-built fine-tuning agents significantly outperform general-purpose alternatives, with FT-Agent achieving the best performance on 10 out of 13 tasks…

WHY NOW

LLM Fine-Tuning moved forward this cycle; last verified April 2026. Public score 8.0/10.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score8.0

PainFT-Dojo automates LLM fine-tuning with agents to streamline domain-specific model optimization.

Evidence0 refs | 0 sources | 33% coverage

Blockermissing authors

Analysis summary

FT-Dojo automates LLM fine-tuning with agents to streamline domain-specific model optimization.

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsErrorProof: failed

Competitive landscape

FT-Dojo automates LLM fine-tuning with agents to streamline domain-specific model optimization.

Segment

LLM Fine-Tuning

Adoption evidence

No public code link in the paper record yet

Commercial read

8.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "0f140142-3969-4267-bcd9-af7ebd420ad0", "arxiv_id": "2603.01712", "canonical_route": "/paper/ft-dojo-towards-autonomous-llm-fine-tuning-with-language-agents", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "ft-dojo-towards-autonomous-llm-fine-tuning-with-language-agents", "endpoints": { "paper_pack": "/api/v1/paper/ft-dojo-towards-autonomous-llm-fine-tuning-with-language-agents/paper-pack", "build_passport": "/api/v1/paper/ft-dojo-towards-autonomous-llm-fine-tuning-with-language-agents/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "FT-Dojo: Towards Autonomous LLM Fine-Tuning with Language Agents", "normalized_query": "2603.01712", "route": "/paper/ft-dojo-towards-autonomous-llm-fine-tuning-with-language-agents", "paper_ref": "ft-dojo-towards-autonomous-llm-fine-tuning-with-language-agents", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/ft-dojo-towards-autonomous-llm-fine-tuning-with-language-agents#webpage", "url": "https://sciencetostartup.com/paper/ft-dojo-towards-autonomous-llm-fine-tuning-with-language-agents", "name": "FT-Dojo: Towards Autonomous LLM Fine-Tuning with Language Agents", "description": "FT-Dojo automates LLM fine-tuning with agents to streamline domain-specific model optimization.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/ft-dojo-towards-autonomous-llm-fine-tuning-with-language-agents#scholarlyArticle", "headline": "FT-Dojo: Towards Autonomous LLM Fine-Tuning with Language Agents", "description": "FT-Dojo automates LLM fine-tuning with agents to streamline domain-specific model optimization.", "url": "https://sciencetostartup.com/paper/ft-dojo-towards-autonomous-llm-fine-tuning-with-language-agents", "sameAs": "https://arxiv.org/abs/2603.01712", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2603.01712" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-03-02T10:37:11.000Z", "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 8 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "LLM Fine-Tuning" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "LLM Fine-Tuning", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "FT-Dojo: Towards Autonomous LLM Fine-Tuning with Language Ag", "item": "https://sciencetostartup.com/paper/ft-dojo-towards-autonomous-llm-fine-tuning-with-language-agents" } ] } ] }

Competitive landscape

FT-Dojo automates LLM fine-tuning with agents to streamline domain-specific model optimization.

Segment

LLM Fine-Tuning

Adoption evidence

No public code link in the paper record yet

Commercial read

8.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

FT-Dojo: Towards Autonomous LLM Fine-Tuning with Language Agents

FT-Dojo: Towards Autonomous LLM Fine-Tuning with Language Agents

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline