ARXIV:2603.09046 · MOBILE AI SECURITY · SUBMITTED 02 APR · 02:30 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: unverified proof status

FlexServe: A Fast and Secure LLM Serving System for Mobile Devices with Flexible Resource Isolation

arXiv

FlexServe is a fast and secure LLM serving system for mobile devices that optimizes resource isolation for enhanced performance.

Blocked on Code›Score4.0Evidence unverified

Opportunity summary

Pain FlexServe is a fast and secure LLM serving system for mobile devices that optimizes resource isolation for enhanced performance.

Evidence 0 refs | 0 sources | 17% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

FlexServe is a fast and secure LLM serving system for mobile devices that optimizes resource isolation for enhanced performance. During LLM inference, both model weights and user data are valuable, and attackers may even…

METHOD

Full abstract

Device-side Large Language Models (LLMs) have witnessed explosive growth, offering higher privacy and availability compared to cloud-side LLMs. During LLM inference, both model weights and user data are valuable, and attackers may even compromise the OS kernel to steal them. ARM TrustZone is the de facto hardware-based isolation technology on mobile devices, used to protect sensitive applications from a compromised OS. However, protecting LLM inference with TrustZone incurs significant overhead due to its inflexible isolation of memory and the NPU. To address these challenges, this paper introduces FlexServe, a fast and secure LLM serving system for mobile devices. It first introduces a Flexible Resource Isolation mechanism to construct Flexible Secure Memory (Flex-Mem) and Flexible Secure NPU (Flex-NPU). Both memory pages and the NPU can be efficiently switched between unprotected and protected modes. Based on these mechanisms, FlexServe designs a fast and secure LLM inference framework within TrustZone's secure world. The LLM-Aware Memory Management and Secure Inference Pipeline are introduced to accelerate inference. A Multi-Model Scheduler is proposed to optimize multi-model workflows. We implement a prototype of FlexServe and compare it with two TrustZone-based strawman designs. The results show that FlexServe achieves an average $10.05\times$ speedup in Time to First Token (TTFT) compared to the strawman, and an average $2.44\times$ TTFT speedup compared to an optimized strawman with pipeline and secure NPU enabled. For multi-model agent workflows, the end-to-end speedup is up to $24.30\times$ and $4.05\times$ compared to the strawman and optimized strawman, respectively.

RESULT

ScienceToStartup currently rates this 4.0/10 on the public viability pass. The results show that FlexServe achieves an average $10.05\times$ speedup in Time to First Token (TTFT) compared to the strawman, and an average $2.44\times$…

WHY NOW

Mobile AI Security moved forward this cycle; last verified April 2026. Public score 4.0/10.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score4.0

PainFlexServe is a fast and secure LLM serving system for mobile devices that optimizes resource isolation for enhanced performance.

Evidence0 refs | 0 sources | 17% coverage

Blockermissing authors

Analysis summary

FlexServe is a fast and secure LLM serving system for mobile devices that optimizes resource isolation for enhanced performance.

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: unverified proof status

Competitive landscape

FlexServe is a fast and secure LLM serving system for mobile devices that optimizes resource isolation for enhanced performance.

Segment

Mobile AI Security

Adoption evidence

No public code link in the paper record yet

Commercial read

4.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "e7690060-5f09-49cd-8146-7619f40100ea", "arxiv_id": "2603.09046", "canonical_route": "/paper/flexserve-a-fast-and-secure-llm-serving-system-for-mobile-devices-with-flexible-resource-isolation", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "flexserve-a-fast-and-secure-llm-serving-system-for-mobile-devices-with-flexible-resource-isolation", "endpoints": { "paper_pack": "/api/v1/paper/flexserve-a-fast-and-secure-llm-serving-system-for-mobile-devices-with-flexible-resource-isolation/paper-pack", "build_passport": "/api/v1/paper/flexserve-a-fast-and-secure-llm-serving-system-for-mobile-devices-with-flexible-resource-isolation/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "FlexServe: A Fast and Secure LLM Serving System for Mobile Devices with Flexible Resource Isolation", "normalized_query": "2603.09046", "route": "/paper/flexserve-a-fast-and-secure-llm-serving-system-for-mobile-devices-with-flexible-resource-isolation", "paper_ref": "flexserve-a-fast-and-secure-llm-serving-system-for-mobile-devices-with-flexible-resource-isolation", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/flexserve-a-fast-and-secure-llm-serving-system-for-mobile-devices-with-flexible-resource-isolation#webpage", "url": "https://sciencetostartup.com/paper/flexserve-a-fast-and-secure-llm-serving-system-for-mobile-devices-with-flexible-resource-isolation", "name": "FlexServe: A Fast and Secure LLM Serving System for Mobile Devices with Flexible Resource Isolation", "description": "FlexServe is a fast and secure LLM serving system for mobile devices that optimizes resource isolation for enhanced performance.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/flexserve-a-fast-and-secure-llm-serving-system-for-mobile-devices-with-flexible-resource-isolation#scholarlyArticle", "headline": "FlexServe: A Fast and Secure LLM Serving System for Mobile Devices with Flexible Resource Isolation", "description": "FlexServe is a fast and secure LLM serving system for mobile devices that optimizes resource isolation for enhanced performance.", "url": "https://sciencetostartup.com/paper/flexserve-a-fast-and-secure-llm-serving-system-for-mobile-devices-with-flexible-resource-isolation", "sameAs": "https://arxiv.org/abs/2603.09046", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2603.09046" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-03-10T00:31:25.000Z", "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 4 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Mobile AI Security" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Mobile AI Security", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "FlexServe: A Fast and Secure LLM Serving System for Mobile D", "item": "https://sciencetostartup.com/paper/flexserve-a-fast-and-secure-llm-serving-system-for-mobile-devices-with-flexible-resource-isolation" } ] } ] }

Competitive landscape

FlexServe is a fast and secure LLM serving system for mobile devices that optimizes resource isolation for enhanced performance.

Segment

Mobile AI Security

Adoption evidence

No public code link in the paper record yet

Commercial read

4.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

FlexServe: A Fast and Secure LLM Serving System for Mobile Devices with Flexible Resource Isolation

FlexServe: A Fast and Secure LLM Serving System for Mobile Devices with Flexible Resource Isolation

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline