ARXIV:2605.00642 · GUI INTERACTION · SUBMITTED 04 MAY · 20:21 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Learn where to Click from Yourself: On-Policy Self-Distillation for GUI Grounding

Yan Zhang · Daiqing Wu · Huawen Shen · Yu Zhou · Can Ma · arXiv

Develop an AI tool using on-policy self-distillation to automate GUI interactions, enhancing productivity for software developers.

Ship in 2-4 weeks›Score5.0Evidence unverified

Opportunity summary

Pain Develop an AI tool using on-policy self-distillation to automate GUI interactions, enhancing productivity for software developers.

Evidence 0 refs | 3 sources | 50% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

Develop an AI tool using on-policy self-distillation to automate GUI interactions, enhancing productivity for software developers. Recent reinforcement learning methods (e.g., GRPO) have achieved strong performance, but they rely on expensive multiple rollouts and…

METHOD

Full abstract

Graphical User Interface (GUI) grounding maps natural language instructions to the visual coordinates of target elements and serves as a core capability for autonomous GUI agents. Recent reinforcement learning methods (e.g., GRPO) have achieved strong performance, but they rely on expensive multiple rollouts and suffer from sparse signals on hard samples. These limitations make on-policy self-distillation (OPSD), which provides dense token-level supervision from a single rollout, a promising alternative. However, its applicability to GUI grounding remains unexplored. In this paper, we present GUI-SD, the first OPSD framework tailored for GUI grounding. First, it constructs a visually enriched privileged context for the teacher using a target bounding box and a Gaussian soft mask, providing informative guidance without leaking exact coordinates. Second, it employs entropy-guided distillation, which adaptively weights tokens based on digit significance and teacher confidence, concentrating optimization on the most impactful and reliable positions. Extensive experiments on six representative GUI grounding benchmarks show that GUI-SD consistently outperforms GRPO-based methods and naive OPSD in both accuracy and training efficiency. Code and training data are available at https://zhangyan-ucas.github.io/GUI-SD/.

RESULT

ScienceToStartup currently rates this 5.0/10 on the public viability pass. Extensive experiments on six representative GUI grounding benchmarks show that GUI-SD consistently outperforms GRPO-based methods and naive OPSD in both accuracy and training efficiency.…

WHY NOW

GUI Interaction moved forward this cycle; last verified May 2026. Public score 5.0/10. Production flags indicate code availability.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score5.0

PainDevelop an AI tool using on-policy self-distillation to automate GUI interactions, enhancing productivity for software developers.

Evidence0 refs | 3 sources | 50% coverage

Blockerno shell-level blocker reported

Analysis summary

Develop an AI tool using on-policy self-distillation to automate GUI interactions, enhancing productivity for software developers.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

Develop an AI tool using on-policy self-distillation to automate GUI interactions, enhancing productivity for software developers.

Segment

GUI Interaction

Adoption evidence

No public code link in the paper record yet

Commercial read

5.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "9c7b38c0-542f-4234-af30-09e401df728f", "arxiv_id": "2605.00642", "canonical_route": "/paper/learn-where-to-click-from-yourself-on-policy-self-distillation-for-gui-grounding", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "learn-where-to-click-from-yourself-on-policy-self-distillation-for-gui-grounding", "endpoints": { "paper_pack": "/api/v1/paper/learn-where-to-click-from-yourself-on-policy-self-distillation-for-gui-grounding/paper-pack", "build_passport": "/api/v1/paper/learn-where-to-click-from-yourself-on-policy-self-distillation-for-gui-grounding/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "Learn where to Click from Yourself: On-Policy Self-Distillation for GUI Grounding", "normalized_query": "2605.00642", "route": "/paper/learn-where-to-click-from-yourself-on-policy-self-distillation-for-gui-grounding", "paper_ref": "learn-where-to-click-from-yourself-on-policy-self-distillation-for-gui-grounding", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/learn-where-to-click-from-yourself-on-policy-self-distillation-for-gui-grounding#webpage", "url": "https://sciencetostartup.com/paper/learn-where-to-click-from-yourself-on-policy-self-distillation-for-gui-grounding", "name": "Learn where to Click from Yourself: On-Policy Self-Distillation for GUI Grounding", "description": "Develop an AI tool using on-policy self-distillation to automate GUI interactions, enhancing productivity for software developers.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/learn-where-to-click-from-yourself-on-policy-self-distillation-for-gui-grounding#scholarlyArticle", "headline": "Learn where to Click from Yourself: On-Policy Self-Distillation for GUI Grounding", "description": "Develop an AI tool using on-policy self-distillation to automate GUI interactions, enhancing productivity for software developers.", "url": "https://sciencetostartup.com/paper/learn-where-to-click-from-yourself-on-policy-self-distillation-for-gui-grounding", "sameAs": "https://arxiv.org/abs/2605.00642", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2605.00642" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-05-01T13:23:26.000Z", "author": [ { "@type": "Person", "name": "Yan Zhang", "affiliation": { "@type": "Organization", "name": "Institute of Information Engineering, Chinese Academy of Sciences" } }, { "@type": "Person", "name": "Daiqing Wu", "affiliation": { "@type": "Organization", "name": "Institute of Information Engineering, Chinese Academy of Sciences" } }, { "@type": "Person", "name": "Huawen Shen", "affiliation": { "@type": "Organization", "name": "Institute of Information Engineering, Chinese Academy of Sciences" } }, { "@type": "Person", "name": "Yu Zhou", "affiliation": { "@type": "Organization", "name": "College of Computer Science, Nankai University" } }, { "@type": "Person", "name": "Can Ma", "affiliation": { "@type": "Organization", "name": "Institute of Information Engineering, Chinese Academy of Sciences" } } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 5 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "GUI Interaction" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "GUI Interaction", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "Learn where to Click from Yourself: On-Policy Self-Distillat", "item": "https://sciencetostartup.com/paper/learn-where-to-click-from-yourself-on-policy-self-distillation-for-gui-grounding" } ] }, { "@type": "FAQPage", "mainEntity": [ { "@type": "Question", "name": "What is the startup potential of \"Learn where to Click from Yourself: On-Policy Self-Distillat\"?", "acceptedAnswer": { "@type": "Answer", "text": "Develop an AI tool using on-policy self-distillation to automate GUI interactions, enhancing productivity for software developers." } }, { "@type": "Question", "name": "What products could be built from this research?", "acceptedAnswer": { "@type": "Answer", "text": "To productize, focus on building a tool or an API that integrates with popular software suites to automate mundane user tasks, providing time savings for users." } }, { "@type": "Question", "name": "What are the practical use cases?", "acceptedAnswer": { "@type": "Answer", "text": "Create an AI tool that can learn and automate repetitive tasks in any application with a graphical user interface, increasing efficiency for software testers and designers." } }, { "@type": "Question", "name": "What industries could this research disrupt?", "acceptedAnswer": { "@type": "Answer", "text": "This could replace manual scripting in software automation, reducing the need for specialized programming skills to automate repetitive tasks." } } ] } ] }

Competitive landscape

Develop an AI tool using on-policy self-distillation to automate GUI interactions, enhancing productivity for software developers.

Segment

GUI Interaction

Adoption evidence

No public code link in the paper record yet

Commercial read

5.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

Learn where to Click from Yourself: On-Policy Self-Distillation for GUI Grounding

Learn where to Click from Yourself: On-Policy Self-Distillation for GUI Grounding

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline