ARXIV:2605.13773 · LLM UNDERSTANDING · SUBMITTED 14 MAY · 20:10 UTC · FRESHNESS FRESH

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

(How) Do Large Language Models Understand High-Level Message Sequence Charts?

Mohammad Reza Mousavi · arXiv

This paper evaluates the semantic understanding of LLMs on High-Level Message Sequence Charts, revealing modest capabilities with significant struggles in complex reasoning tasks.

Blocked on Code›Score2.0Evidence unverified

Opportunity summary

Pain This paper evaluates the semantic understanding of LLMs on High-Level Message Sequence Charts, revealing modest capabilities with significant struggles in complex reasoning tasks.

Evidence 0 refs | 0 sources | 0% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

This paper evaluates the semantic understanding of LLMs on High-Level Message Sequence Charts, revealing modest capabilities with significant struggles in complex reasoning tasks. It is, however, unclear whether these tasks are performed consistently with…

METHOD

Full abstract

Large Language Models (LLMs) are being employed widely to automate tasks across the software development life-cycle. It is, however, unclear whether these tasks are performed consistently with respect to the semantics of the artefacts being handled. This question is particularly under-researched concerning architectural design specification. In this paper, we address this question for High-Level Message Sequence Charts (HMSCs). These are visual models with a rigorous formal semantics that have been used for various purposes, including as a foundation for Sequence Diagrams in the Unified Modelling Language (UML). We examine whether LLMs "understand" the semantics of HMSCs by examining three LLMs (Gemini-3, GPT-5.4, and Qwen-3.6) on how they perform 129 semantic tasks ranging from querying basic semantic constructs in HMSCs (i.e., events and their ordering) to semantic-preserving abstractions and compositions, and calculating the set of traces and trace-equivalent labelled transition systems. The results show that LLMs only have a modest understanding of the formal semantics of HMSCs (ca. 52% overall accuracy), with great variability across different semantic concepts: while LLMs seem to understand the basic semantic concepts of MSCs (ca. 88% accuracy), they struggle with semantic reasoning in tasks involving abstraction and composition (ca. 36% accuracy) and traces and LTSs (ca. 42% accuracy). In particular, all three LLMs struggle with the notions of co-region and explicit causal dependencies and never employed them in semantic-preserving transformations.

RESULT

ScienceToStartup currently rates this 2.0/10 on the public viability pass. The results show that LLMs only have a modest understanding of the formal semantics of HMSCs (ca.

WHY NOW

LLM Understanding moved forward this cycle; last verified May 2026. Public score 2.0/10.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score2.0

PainThis paper evaluates the semantic understanding of LLMs on High-Level Message Sequence Charts, revealing modest capabilities with significant struggles in complex reasoning tasks.

Evidence0 refs | 0 sources | 0% coverage

Blockerno shell-level blocker reported

Analysis summary

This paper evaluates the semantic understanding of LLMs on High-Level Message Sequence Charts, revealing modest capabilities with significant struggles in complex reasoning tasks.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

This paper evaluates the semantic understanding of LLMs on High-Level Message Sequence Charts, revealing modest capabilities with significant struggles in complex reasoning tasks.

Segment

LLM Understanding

Adoption evidence

No public code link in the paper record yet

Commercial read

2.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "82e7b7c5-74ca-4cc2-9ca4-07f7ecabd103", "arxiv_id": "2605.13773", "canonical_route": "/paper/how-do-large-language-models-understand-high-level-message-sequence-charts", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "how-do-large-language-models-understand-high-level-message-sequence-charts", "endpoints": { "paper_pack": "/api/v1/paper/how-do-large-language-models-understand-high-level-message-sequence-charts/paper-pack", "build_passport": "/api/v1/paper/how-do-large-language-models-understand-high-level-message-sequence-charts/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "(How) Do Large Language Models Understand High-Level Message Sequence Charts?", "normalized_query": "2605.13773", "route": "/paper/how-do-large-language-models-understand-high-level-message-sequence-charts", "paper_ref": "how-do-large-language-models-understand-high-level-message-sequence-charts", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/how-do-large-language-models-understand-high-level-message-sequence-charts#webpage", "url": "https://sciencetostartup.com/paper/how-do-large-language-models-understand-high-level-message-sequence-charts", "name": "(How) Do Large Language Models Understand High-Level Message Sequence Charts?", "description": "This paper evaluates the semantic understanding of LLMs on High-Level Message Sequence Charts, revealing modest capabilities with significant struggles in complex reasoning tasks.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/how-do-large-language-models-understand-high-level-message-sequence-charts#scholarlyArticle", "headline": "(How) Do Large Language Models Understand High-Level Message Sequence Charts?", "description": "This paper evaluates the semantic understanding of LLMs on High-Level Message Sequence Charts, revealing modest capabilities with significant struggles in complex reasoning tasks.", "url": "https://sciencetostartup.com/paper/how-do-large-language-models-understand-high-level-message-sequence-charts", "sameAs": "https://arxiv.org/abs/2605.13773", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2605.13773" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-05-13T16:50:51.000Z", "author": [ { "@type": "Person", "name": "Mohammad Reza Mousavi" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 2 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "LLM Understanding" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "LLM Understanding", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "(How) Do Large Language Models Understand High-Level Message", "item": "https://sciencetostartup.com/paper/how-do-large-language-models-understand-high-level-message-sequence-charts" } ] } ] }

Competitive landscape

This paper evaluates the semantic understanding of LLMs on High-Level Message Sequence Charts, revealing modest capabilities with significant struggles in complex reasoning tasks.

Segment

LLM Understanding

Adoption evidence

No public code link in the paper record yet

Commercial read

2.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

(How) Do Large Language Models Understand High-Level Message Sequence Charts?

(How) Do Large Language Models Understand High-Level Message Sequence Charts?

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline