ARXIV:2603.23781 · LLM SECURITY ANALYSIS · SUBMITTED 02 APR · 02:30 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Leveraging Large Language Models for Trustworthiness Assessment of Web Applications

Oleksandr Yarotskyi · José D'Abruzzo Pereira · João R. Campos · arXiv

Leveraging LLMs to automate the trustworthiness assessment of web applications by verifying adherence to secure coding practices.

Blocked on Code›Score5.0Evidence unverified

Opportunity summary

Pain Leveraging LLMs to automate the trustworthiness assessment of web applications by verifying adherence to secure coding practices.

Evidence 0 refs | 0 sources | 17% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

Leveraging LLMs to automate the trustworthiness assessment of web applications by verifying adherence to secure coding practices. However, "trust" assessment remains an open problem as existing techniques primarily focus on detecting known vulnerabilities or…

METHOD

Full abstract

The widespread adoption of web applications has made their security a critical concern and has increased the need for systematic ways to assess whether they can be considered trustworthy. However, "trust" assessment remains an open problem as existing techniques primarily focus on detecting known vulnerabilities or depend on manual evaluation, which limits their scalability; therefore, evaluating adherence to secure coding practices offers a complementary, pragmatic perspective by focusing on observable development behaviors. In practice, the identification and verification of secure coding practices are predominantly performed manually, relying on expert knowledge and code reviews, which is time-consuming, subjective, and difficult to scale. This study presents an empirical methodology to automate the trustworthiness assessment of web applications by leveraging Large Language Models (LLMs) to verify adherence to secure coding practices. We conduct a comparative analysis of prompt engineering techniques across five state-of-the-art LLMs, ranging from baseline zero-shot classification to prompts enriched with semantic definitions, structural context derived from call graphs, and explicit instructional guidance. Furthermore, we propose an extension of a hierarchical Quality Model (QM) based on the Logic Score of Preference (LSP), in which LLM outputs are used to populate the model's quality attributes and compute a holistic trustworthiness score. Experimental results indicate that excessive structural context can introduce noise, whereas rule-based instructional prompting improves assessment reliability. The resulting trustworthiness score allows discriminating between secure and vulnerable implementations, supporting the feasibility of using LLMs for scalable and context-aware trust assessment.

RESULT

ScienceToStartup currently rates this 5.0/10 on the public viability pass. Experimental results indicate that excessive structural context can introduce noise, whereas rule-based instructional prompting improves assessment reliability.

WHY NOW

LLM Security Analysis moved forward this cycle; last verified April 2026. Public score 5.0/10.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score5.0

PainLeveraging LLMs to automate the trustworthiness assessment of web applications by verifying adherence to secure coding practices.

Evidence0 refs | 0 sources | 17% coverage

Blockerno shell-level blocker reported

Analysis summary

Leveraging LLMs to automate the trustworthiness assessment of web applications by verifying adherence to secure coding practices.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

Leveraging LLMs to automate the trustworthiness assessment of web applications by verifying adherence to secure coding practices.

Segment

LLM Security Analysis

Adoption evidence

No public code link in the paper record yet

Commercial read

5.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "b9995bf9-5668-4dca-9c76-0dd1b50a5f44", "arxiv_id": "2603.23781", "canonical_route": "/paper/leveraging-large-language-models-for-trustworthiness-assessment-of-web-applications", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "leveraging-large-language-models-for-trustworthiness-assessment-of-web-applications", "endpoints": { "paper_pack": "/api/v1/paper/leveraging-large-language-models-for-trustworthiness-assessment-of-web-applications/paper-pack", "build_passport": "/api/v1/paper/leveraging-large-language-models-for-trustworthiness-assessment-of-web-applications/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "Leveraging Large Language Models for Trustworthiness Assessment of Web Applications", "normalized_query": "2603.23781", "route": "/paper/leveraging-large-language-models-for-trustworthiness-assessment-of-web-applications", "paper_ref": "leveraging-large-language-models-for-trustworthiness-assessment-of-web-applications", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/leveraging-large-language-models-for-trustworthiness-assessment-of-web-applications#webpage", "url": "https://sciencetostartup.com/paper/leveraging-large-language-models-for-trustworthiness-assessment-of-web-applications", "name": "Leveraging Large Language Models for Trustworthiness Assessment of Web Applications", "description": "Leveraging LLMs to automate the trustworthiness assessment of web applications by verifying adherence to secure coding practices.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/leveraging-large-language-models-for-trustworthiness-assessment-of-web-applications#scholarlyArticle", "headline": "Leveraging Large Language Models for Trustworthiness Assessment of Web Applications", "description": "Leveraging LLMs to automate the trustworthiness assessment of web applications by verifying adherence to secure coding practices.", "url": "https://sciencetostartup.com/paper/leveraging-large-language-models-for-trustworthiness-assessment-of-web-applications", "sameAs": "https://arxiv.org/abs/2603.23781", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2603.23781" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-03-24T23:33:54.000Z", "author": [ { "@type": "Person", "name": "Oleksandr Yarotskyi" }, { "@type": "Person", "name": "José D'Abruzzo Pereira" }, { "@type": "Person", "name": "João R. Campos" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 5 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "LLM Security Analysis" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "LLM Security Analysis", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "Leveraging Large Language Models for Trustworthiness Assessm", "item": "https://sciencetostartup.com/paper/leveraging-large-language-models-for-trustworthiness-assessment-of-web-applications" } ] } ] }

Competitive landscape

Leveraging LLMs to automate the trustworthiness assessment of web applications by verifying adherence to secure coding practices.

Segment

LLM Security Analysis

Adoption evidence

No public code link in the paper record yet

Commercial read

5.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

Leveraging Large Language Models for Trustworthiness Assessment of Web Applications

Leveraging Large Language Models for Trustworthiness Assessment of Web Applications

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline