ARXIV:2603.07978 · AGENTS · SUBMITTED 02 APR · 02:30 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: unverified proof status

OSExpert: Computer-Use Agents Learning Professional Skills via Exploration

arXiv

OSExpert enhances computer-use agents with a GUI-based exploration algorithm, achieving near-expert performance and closing the efficiency gap with humans, making it a valuable tool for automating complex digital tasks.

Blocked on Code›Score8.0Evidence unverified

Opportunity summary

Pain OSExpert enhances computer-use agents with a GUI-based exploration algorithm, achieving near-expert performance and closing the efficiency gap with humans, making it a valuable tool for automating complex digital tasks.

Evidence 0 refs | 0 sources | 17% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

METHOD

Full abstract

General-purpose computer-use agents have shown impressive performance across diverse digital environments. However, our new benchmark, OSExpert-Eval, indicates they remain far less helpful than human experts. Although inference-time scaling enables adaptation, these agents complete complex tasks inefficiently with degraded performance, transfer poorly to unseen UIs, and struggle with fine-grained action sequences. To solve the problem, we introduce a GUI-based depth-first search (GUI-DFS) exploration algorithm to comprehensively explore and verify an environment's unit functions. The agent then exploits compositionality between unit skills to self-construct a curriculum for composite tasks. To support fine-grained actions, we curate a database of action primitives for agents to discover during exploration; these are saved as a skill set once the exploration is complete. We use the learned skills to improve the agent's performance and efficiency by (1) enriching agents with ready-to-use procedural knowledge, allowing them to plan only once for long trajectories and generate accurate actions, and (2) enabling them to end inference-time scaling earlier by realizing their boundary of capabilities. Extensive experiments show that our environment-learned agent takes a meaningful step toward expert-level computer use, achieving a around 20 percent performance gain on OSExpert-Eval and closing the efficiency gap to humans by around 80 percent

RESULT

ScienceToStartup currently rates this 8.0/10 on the public viability pass. Although inference-time scaling enables adaptation, these agents complete complex tasks inefficiently with degraded performance, transfer poorly to unseen UIs, and struggle with fine-grained action…

WHY NOW

Agents moved forward this cycle; last verified April 2026. Public score 8.0/10.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score8.0

PainOSExpert enhances computer-use agents with a GUI-based exploration algorithm, achieving near-expert performance and closing the efficiency gap with humans, making it a valuable tool for automating complex digital tasks.

Evidence0 refs | 0 sources | 17% coverage

Blockermissing authors

Analysis summary

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: unverified proof status

Competitive landscape

Segment

Agents

Adoption evidence

No public code link in the paper record yet

Commercial read

8.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "e71bbb0e-0196-49d5-936a-5afd81cef003", "arxiv_id": "2603.07978", "canonical_route": "/paper/osexpert-computer-use-agents-learning-professional-skills-via-exploration", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "osexpert-computer-use-agents-learning-professional-skills-via-exploration", "endpoints": { "paper_pack": "/api/v1/paper/osexpert-computer-use-agents-learning-professional-skills-via-exploration/paper-pack", "build_passport": "/api/v1/paper/osexpert-computer-use-agents-learning-professional-skills-via-exploration/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "OSExpert: Computer-Use Agents Learning Professional Skills via Exploration", "normalized_query": "2603.07978", "route": "/paper/osexpert-computer-use-agents-learning-professional-skills-via-exploration", "paper_ref": "osexpert-computer-use-agents-learning-professional-skills-via-exploration", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/osexpert-computer-use-agents-learning-professional-skills-via-exploration#webpage", "url": "https://sciencetostartup.com/paper/osexpert-computer-use-agents-learning-professional-skills-via-exploration", "name": "OSExpert: Computer-Use Agents Learning Professional Skills via Exploration", "description": "OSExpert enhances computer-use agents with a GUI-based exploration algorithm, achieving near-expert performance and closing the efficiency gap with humans, making it a valuable tool for automating complex digital tasks.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/osexpert-computer-use-agents-learning-professional-skills-via-exploration#scholarlyArticle", "headline": "OSExpert: Computer-Use Agents Learning Professional Skills via Exploration", "description": "OSExpert enhances computer-use agents with a GUI-based exploration algorithm, achieving near-expert performance and closing the efficiency gap with humans, making it a valuable tool for automating complex digital tasks.", "url": "https://sciencetostartup.com/paper/osexpert-computer-use-agents-learning-professional-skills-via-exploration", "sameAs": "https://arxiv.org/abs/2603.07978", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2603.07978" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-03-09T05:27:56.000Z", "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 8 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Agents" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Agents", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "OSExpert: Computer-Use Agents Learning Professional Skills v", "item": "https://sciencetostartup.com/paper/osexpert-computer-use-agents-learning-professional-skills-via-exploration" } ] } ] }

Competitive landscape

Segment

Agents

Adoption evidence

No public code link in the paper record yet

Commercial read

8.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

OSExpert: Computer-Use Agents Learning Professional Skills via Exploration

OSExpert: Computer-Use Agents Learning Professional Skills via Exploration

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline