ARXIV:2602.22190 · NATIVE GUI AGENTS · SUBMITTED 02 APR · 02:30 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: unverified proof status

GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL

arXiv

GUI-Libra creates more intelligent and efficient GUI agents for enhancing user experience across web and mobile applications.

Blocked on Code›Score7.0Evidence unverified

Opportunity summary

Pain GUI-Libra creates more intelligent and efficient GUI agents for enhancing user experience across web and mobile applications.

Evidence 0 refs | 0 sources | 17% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

GUI-Libra creates more intelligent and efficient GUI agents for enhancing user experience across web and mobile applications. This gap stems from two limitations: a shortage of high-quality, action-aligned reasoning data, and the direct adoption…

METHOD

Full abstract

Open-source native GUI agents still lag behind closed-source systems on long-horizon navigation tasks. This gap stems from two limitations: a shortage of high-quality, action-aligned reasoning data, and the direct adoption of generic post-training pipelines that overlook the unique challenges of GUI agents. We identify two fundamental issues in these pipelines: (i) standard SFT with CoT reasoning often hurts grounding, and (ii) step-wise RLVR-tyle training faces partial verifiability, where multiple actions can be correct but only a single demonstrated action is used for verification. This makes offline step-wise metrics weak predictors of online task success. In this work, we present GUI-Libra, a tailored training recipe that addresses these challenges. First, to mitigate the scarcity of action-aligned reasoning data, we introduce a data construction and filtering pipeline and release a curated 81K GUI reasoning dataset. Second, to reconcile reasoning with grounding, we propose action-aware SFT that mixes reasoning-then-action and direct-action data and reweights tokens to emphasize action and grounding. Third, to stabilize RL under partial verifiability, we identify the overlooked importance of KL regularization in RLVR and show that a KL trust region is critical for improving offline-to-online predictability; we further introduce success-adaptive scaling to downweight unreliable negative gradients. Across diverse web and mobile benchmarks, GUI-Libra consistently improves both step-wise accuracy and end-to-end task completion. Our results suggest that carefully designed post-training and data curation can unlock significantly stronger task-solving capabilities without costly online data collection. We release our dataset, code, and models to facilitate further research on data-efficient post-training for reasoning-capable GUI agents.

RESULT

ScienceToStartup currently rates this 7.0/10 on the public viability pass. Third, to stabilize RL under partial verifiability, we identify the overlooked importance of KL regularization in RLVR and show that a KL trust region…

WHY NOW

Native GUI Agents moved forward this cycle; last verified April 2026. Public score 7.0/10.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score7.0

PainGUI-Libra creates more intelligent and efficient GUI agents for enhancing user experience across web and mobile applications.

Evidence0 refs | 0 sources | 17% coverage

Blockermissing authors

Analysis summary

GUI-Libra creates more intelligent and efficient GUI agents for enhancing user experience across web and mobile applications.

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: unverified proof status

Competitive landscape

GUI-Libra creates more intelligent and efficient GUI agents for enhancing user experience across web and mobile applications.

Segment

Native GUI Agents

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "b4b877da-0972-42ef-9eec-2f4cd63e1c72", "arxiv_id": "2602.22190", "canonical_route": "/paper/gui-libra-training-native-gui-agents-to-reason-and-act-with-action-aware-supervision-and-partially-verifiable-rl", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "gui-libra-training-native-gui-agents-to-reason-and-act-with-action-aware-supervision-and-partially-verifiable-rl", "endpoints": { "paper_pack": "/api/v1/paper/gui-libra-training-native-gui-agents-to-reason-and-act-with-action-aware-supervision-and-partially-verifiable-rl/paper-pack", "build_passport": "/api/v1/paper/gui-libra-training-native-gui-agents-to-reason-and-act-with-action-aware-supervision-and-partially-verifiable-rl/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL", "normalized_query": "2602.22190", "route": "/paper/gui-libra-training-native-gui-agents-to-reason-and-act-with-action-aware-supervision-and-partially-verifiable-rl", "paper_ref": "gui-libra-training-native-gui-agents-to-reason-and-act-with-action-aware-supervision-and-partially-verifiable-rl", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/gui-libra-training-native-gui-agents-to-reason-and-act-with-action-aware-supervision-and-partially-verifiable-rl#webpage", "url": "https://sciencetostartup.com/paper/gui-libra-training-native-gui-agents-to-reason-and-act-with-action-aware-supervision-and-partially-verifiable-rl", "name": "GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL", "description": "GUI-Libra creates more intelligent and efficient GUI agents for enhancing user experience across web and mobile applications.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/gui-libra-training-native-gui-agents-to-reason-and-act-with-action-aware-supervision-and-partially-verifiable-rl#scholarlyArticle", "headline": "GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL", "description": "GUI-Libra creates more intelligent and efficient GUI agents for enhancing user experience across web and mobile applications.", "url": "https://sciencetostartup.com/paper/gui-libra-training-native-gui-agents-to-reason-and-act-with-action-aware-supervision-and-partially-verifiable-rl", "sameAs": "https://arxiv.org/abs/2602.22190", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2602.22190" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-02-25T18:34:57.000Z", "author": [ { "@type": "Person", "name": "Rui Yang", "affiliation": { "@type": "Organization", "name": "UIUC" } }, { "@type": "Person", "name": "Qianhui Wu", "affiliation": { "@type": "Organization", "name": "Microsoft" } }, { "@type": "Person", "name": "Zhaoyang Wang", "affiliation": { "@type": "Organization", "name": "UNC-Chapel Hill" } } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 7 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Native GUI Agents" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Native GUI Agents", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "GUI-Libra: Training Native GUI Agents to Reason and Act with", "item": "https://sciencetostartup.com/paper/gui-libra-training-native-gui-agents-to-reason-and-act-with-action-aware-supervision-and-partially-verifiable-rl" } ] }, { "@type": "FAQPage", "mainEntity": [ { "@type": "Question", "name": "What is the startup potential of \"GUI-Libra: Training Native GUI Agents to Reason and Act with\"?", "acceptedAnswer": { "@type": "Answer", "text": "GUI-Libra creates more intelligent and efficient GUI agents for enhancing user experience across web and mobile applications." } }, { "@type": "Question", "name": "What products could be built from this research?", "acceptedAnswer": { "@type": "Answer", "text": "GUI-Libra could be productized as an API for software developers to integrate advanced GUI interaction capabilities into their applications, enhancing user experience while reducing manual effort." } }, { "@type": "Question", "name": "What are the practical use cases?", "acceptedAnswer": { "@type": "Answer", "text": "A virtual assistant for web and mobile platforms that can perform complex, task-oriented interactions like booking tickets or managing emails with high precision and minimal user input." } }, { "@type": "Question", "name": "What industries could this research disrupt?", "acceptedAnswer": { "@type": "Answer", "text": "GUI-Libra could replace existing GUI interaction frameworks that lack sophisticated reasoning capabilities, offering improved automation and interaction accuracy." } } ] } ] }

Competitive landscape

GUI-Libra creates more intelligent and efficient GUI agents for enhancing user experience across web and mobile applications.

Segment

Native GUI Agents

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL

GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline