ARXIV:2605.31167 · LLM EVALUATION · SUBMITTED 01 JUN · 20:20 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

LLM-FACETS: A Privacy-Preserving Framework for Evaluating LLM Transparency and Accountability

Tom Lucas · Alessio Buscemi · Alfredo Capozucca · German Castignani · Barbara Delacroix · arXiv

LLM-FACETS is a privacy-preserving, open-source framework with a browser interface for evaluating LLM transparency and accountability for diverse stakeholders.

Ship in 2-4 weeks›Score8.0Evidence unverified

Opportunity summary

Pain LLM-FACETS is a privacy-preserving, open-source framework with a browser interface for evaluating LLM transparency and accountability for diverse stakeholders.

Evidence 0 refs | 4 sources | 67% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

LLM-FACETS is a privacy-preserving, open-source framework with a browser interface for evaluating LLM transparency and accountability for diverse stakeholders. Yet auditing LLMs remains inaccessible to non-technical practitioners: existing tools require programming expertise and non-trivial…

METHOD

Full abstract

Assessing whether Large Language Models outputs are factually grounded, epistemically calibrated, and methodologically reproducible is a prerequisite for responsible AI deployment. Yet auditing LLMs remains inaccessible to non-technical practitioners: existing tools require programming expertise and non-trivial environment setup, and cloud-hosted platforms transmit evaluation data to external services, creating barriers for domain experts and compliance officers legally responsible for AI oversight. We introduce LLM-FACETS (LLM FActuality Cross-EvaluaTion System): an open-source framework with a browser-accessible interface and a plugin architecture, structured around three practitioner profiles (technical experts, domain experts, compliance officers) that mirror the stakeholder categories identified in the EU AI Act and the NIST AI Risk Management Framework. The architecture makes data flows explicit: deterministic metrics (BLEU, ROUGE, BERTScore) run entirely within the self-hosted server with no outbound transmission; LLM-judge metrics contact external APIs explicitly, with users retaining full credential control. The framework operationalizes transparency through three mechanisms: token-level log-probability visualization for epistemic uncertainty, multi-judge consensus to mitigate judge bias, and RAG Triad metrics (Faithfulness, Answer Relevance, Context Relevance) to detect and localize hallucinations. A plugin architecture allows any new metric or dataset to be integrated without modifying the evaluation pipeline. The open-source implementation enables cross-checking across multiple metrics targeting the same property, ensuring reproducibility and decoupling AI accountability from the teams building the systems assessed. We verify the framework through cross-validation of 18 metric implementations against canonical reference libraries.

RESULT

ScienceToStartup currently rates this 8.0/10 on the public viability pass. The open-source implementation enables cross-checking across multiple metrics targeting the same property, ensuring reproducibility and decoupling AI accountability from the teams building the systems…

WHY NOW

LLM Evaluation moved forward this cycle; last verified June 2026. Public score 8.0/10. Implementation evidence is present through a linked repository.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score8.0

PainLLM-FACETS is a privacy-preserving, open-source framework with a browser interface for evaluating LLM transparency and accountability for diverse stakeholders.

Evidence0 refs | 4 sources | 67% coverage

Blockerno shell-level blocker reported

Analysis summary

LLM-FACETS is a privacy-preserving, open-source framework with a browser interface for evaluating LLM transparency and accountability for diverse stakeholders.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

LLM-FACETS is a privacy-preserving, open-source framework with a browser interface for evaluating LLM transparency and accountability for diverse stakeholders.

Segment

LLM Evaluation

Adoption evidence

Public code linked for build inspection

Commercial read

8.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "84bb10d9-502c-40cf-b885-4107f492bae5", "arxiv_id": "2605.31167", "canonical_route": "/paper/llm-facets-a-privacy-preserving-framework-for-evaluating-llm-transparency-and-accountability", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "llm-facets-a-privacy-preserving-framework-for-evaluating-llm-transparency-and-accountability", "endpoints": { "paper_pack": "/api/v1/paper/llm-facets-a-privacy-preserving-framework-for-evaluating-llm-transparency-and-accountability/paper-pack", "build_passport": "/api/v1/paper/llm-facets-a-privacy-preserving-framework-for-evaluating-llm-transparency-and-accountability/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "LLM-FACETS: A Privacy-Preserving Framework for Evaluating LLM Transparency and Accountability", "normalized_query": "2605.31167", "route": "/paper/llm-facets-a-privacy-preserving-framework-for-evaluating-llm-transparency-and-accountability", "paper_ref": "llm-facets-a-privacy-preserving-framework-for-evaluating-llm-transparency-and-accountability", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/llm-facets-a-privacy-preserving-framework-for-evaluating-llm-transparency-and-accountability#webpage", "url": "https://sciencetostartup.com/paper/llm-facets-a-privacy-preserving-framework-for-evaluating-llm-transparency-and-accountability", "name": "LLM-FACETS: A Privacy-Preserving Framework for Evaluating LLM Transparency and Accountability", "description": "LLM-FACETS is a privacy-preserving, open-source framework with a browser interface for evaluating LLM transparency and accountability for diverse stakeholders.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/llm-facets-a-privacy-preserving-framework-for-evaluating-llm-transparency-and-accountability#scholarlyArticle", "headline": "LLM-FACETS: A Privacy-Preserving Framework for Evaluating LLM Transparency and Accountability", "description": "LLM-FACETS is a privacy-preserving, open-source framework with a browser interface for evaluating LLM transparency and accountability for diverse stakeholders.", "url": "https://sciencetostartup.com/paper/llm-facets-a-privacy-preserving-framework-for-evaluating-llm-transparency-and-accountability", "sameAs": "https://arxiv.org/abs/2605.31167", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2605.31167" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-05-29T11:20:47.000Z", "author": [ { "@type": "Person", "name": "Tom Lucas" }, { "@type": "Person", "name": "Alessio Buscemi" }, { "@type": "Person", "name": "Alfredo Capozucca" }, { "@type": "Person", "name": "German Castignani" }, { "@type": "Person", "name": "Barbara Delacroix" } ], "codeRepository": "https://github.com/Scriptor-Group/AIMVi", "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 8 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "LLM Evaluation" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code, repo url" } ] }, { "@type": "SoftwareSourceCode", "@id": "https://sciencetostartup.com/paper/llm-facets-a-privacy-preserving-framework-for-evaluating-llm-transparency-and-accountability#software", "name": "LLM-FACETS: A Privacy-Preserving Framework for Evaluating LLM Transparency and Accountability - Source Code", "description": "LLM-FACETS is a privacy-preserving, open-source framework with a browser interface for evaluating LLM transparency and accountability for diverse stakeholders.", "codeRepository": "https://github.com/Scriptor-Group/AIMVi", "url": "https://github.com/Scriptor-Group/AIMVi" }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "LLM Evaluation", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "LLM-FACETS: A Privacy-Preserving Framework for Evaluating LL", "item": "https://sciencetostartup.com/paper/llm-facets-a-privacy-preserving-framework-for-evaluating-llm-transparency-and-accountability" } ] } ] }

Competitive landscape

LLM-FACETS is a privacy-preserving, open-source framework with a browser interface for evaluating LLM transparency and accountability for diverse stakeholders.

Segment

LLM Evaluation

Adoption evidence

Public code linked for build inspection

Commercial read

8.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

LLM-FACETS: A Privacy-Preserving Framework for Evaluating LLM Transparency and Accountability

LLM-FACETS: A Privacy-Preserving Framework for Evaluating LLM Transparency and Accountability

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline