ARXIV:2603.08533 · MOBILE GUI AUTOMATION · SUBMITTED 02 APR · 02:30 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: unverified proof status

SecAgent: Efficient Mobile GUI Agent with Semantic Context

arXiv

SecAgent is a 3B-scale mobile GUI agent that automates smartphone tasks using a novel semantic context mechanism and a new multilingual dataset, offering efficient and accurate performance.

Blocked on Code›Score8.0Evidence unverified

Opportunity summary

Pain SecAgent is a 3B-scale mobile GUI agent that automates smartphone tasks using a novel semantic context mechanism and a new multilingual dataset, offering efficient and accurate performance.

Evidence 0 refs | 0 sources | 17% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

SecAgent is a 3B-scale mobile GUI agent that automates smartphone tasks using a novel semantic context mechanism and a new multilingual dataset, offering efficient and accurate performance. However, existing approaches face two critical limitations:…

METHOD

Full abstract

Mobile Graphical User Interface (GUI) agents powered by multimodal large language models have demonstrated promising capabilities in automating complex smartphone tasks. However, existing approaches face two critical limitations: the scarcity of high-quality multilingual datasets, particularly for non-English ecosystems, and inefficient history representation methods. To address these challenges, we present SecAgent, an efficient mobile GUI agent at 3B scale. We first construct a human-verified Chinese mobile GUI dataset with 18k grounding samples and 121k navigation steps across 44 applications, along with a Chinese navigation benchmark featuring multi-choice action annotations. Building upon this dataset, we propose a semantic context mechanism that distills history screenshots and actions into concise, natural language summaries, significantly reducing computational costs while preserving task-relevant information. Through supervised and reinforcement fine-tuning, SecAgent outperforms similar-scale baselines and achieves performance comparable to 7B-8B models on our and public navigation benchmarks. We will open-source the training dataset, benchmark, model, and code to advance research in multilingual mobile GUI automation.

RESULT

ScienceToStartup currently rates this 8.0/10 on the public viability pass. Through supervised and reinforcement fine-tuning, SecAgent outperforms similar-scale baselines and achieves performance comparable to 7B-8B models on our and public navigation benchmarks.

WHY NOW

Mobile GUI Automation moved forward this cycle; last verified April 2026. Public score 8.0/10.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score8.0

PainSecAgent is a 3B-scale mobile GUI agent that automates smartphone tasks using a novel semantic context mechanism and a new multilingual dataset, offering efficient and accurate performance.

Evidence0 refs | 0 sources | 17% coverage

Blockermissing authors

Analysis summary

SecAgent is a 3B-scale mobile GUI agent that automates smartphone tasks using a novel semantic context mechanism and a new multilingual dataset, offering efficient and accurate performance.

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: unverified proof status

Competitive landscape

SecAgent is a 3B-scale mobile GUI agent that automates smartphone tasks using a novel semantic context mechanism and a new multilingual dataset, offering efficient and accurate performance.

Segment

Mobile GUI Automation

Adoption evidence

No public code link in the paper record yet

Commercial read

8.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "7abcfb61-7804-466f-aa31-9380b8f0f762", "arxiv_id": "2603.08533", "canonical_route": "/paper/secagent-efficient-mobile-gui-agent-with-semantic-context", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "secagent-efficient-mobile-gui-agent-with-semantic-context", "endpoints": { "paper_pack": "/api/v1/paper/secagent-efficient-mobile-gui-agent-with-semantic-context/paper-pack", "build_passport": "/api/v1/paper/secagent-efficient-mobile-gui-agent-with-semantic-context/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "SecAgent: Efficient Mobile GUI Agent with Semantic Context", "normalized_query": "2603.08533", "route": "/paper/secagent-efficient-mobile-gui-agent-with-semantic-context", "paper_ref": "secagent-efficient-mobile-gui-agent-with-semantic-context", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/secagent-efficient-mobile-gui-agent-with-semantic-context#webpage", "url": "https://sciencetostartup.com/paper/secagent-efficient-mobile-gui-agent-with-semantic-context", "name": "SecAgent: Efficient Mobile GUI Agent with Semantic Context", "description": "SecAgent is a 3B-scale mobile GUI agent that automates smartphone tasks using a novel semantic context mechanism and a new multilingual dataset, offering efficient and accurate performance.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/secagent-efficient-mobile-gui-agent-with-semantic-context#scholarlyArticle", "headline": "SecAgent: Efficient Mobile GUI Agent with Semantic Context", "description": "SecAgent is a 3B-scale mobile GUI agent that automates smartphone tasks using a novel semantic context mechanism and a new multilingual dataset, offering efficient and accurate performance.", "url": "https://sciencetostartup.com/paper/secagent-efficient-mobile-gui-agent-with-semantic-context", "sameAs": "https://arxiv.org/abs/2603.08533", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2603.08533" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-03-09T16:04:08.000Z", "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 8 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Mobile GUI Automation" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Mobile GUI Automation", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "SecAgent: Efficient Mobile GUI Agent with Semantic Context", "item": "https://sciencetostartup.com/paper/secagent-efficient-mobile-gui-agent-with-semantic-context" } ] } ] }

Competitive landscape

SecAgent is a 3B-scale mobile GUI agent that automates smartphone tasks using a novel semantic context mechanism and a new multilingual dataset, offering efficient and accurate performance.

Segment

Mobile GUI Automation

Adoption evidence

No public code link in the paper record yet

Commercial read

8.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

SecAgent: Efficient Mobile GUI Agent with Semantic Context

SecAgent: Efficient Mobile GUI Agent with Semantic Context

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline