ARXIV:2605.07517 · RAG SYSTEMS · SUBMITTED 11 MAY · 20:41 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

LARAG: Link-Aware Retrieval Strategy for RAG Systems in Hyperlinked Technical Documentation

Giorgia Bolognesi · Claudio Estatico · Ulderico Fugacci · Isabella Mastroianni · Claudio Muselli · Luca Oneto · arXiv

A lightweight retrieval strategy for RAG that leverages hyperlink topology in technical documentation to improve answer quality and efficiency.

Ship in 2-4 weeks›Score7.0Evidence unverified

Opportunity summary

Pain A lightweight retrieval strategy for RAG that leverages hyperlink topology in technical documentation to improve answer quality and efficiency.

Evidence 0 refs | 3 sources | 50% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

A lightweight retrieval strategy for RAG that leverages hyperlink topology in technical documentation to improve answer quality and efficiency. However, standard embedding-based retrievers treat naturally structured corpora, such as technical manuals, as flat collections…

METHOD

Full abstract

Retrieval-Augmented Generation (RAG) enhances the factual grounding of Large Language Models by conditioning their outputs on external documents. However, standard embedding-based retrievers treat naturally structured corpora, such as technical manuals, as flat collections of passages, thereby overlooking the hyperlink topology that users rely on when navigating such content. We introduce LARAG (Link-Aware RAG): a lightweight, link-aware retrieval strategy that leverages the author-defined hyperlink structure already present in HTML documentation, encoding hyperlink relations as metadata in the chunk representations and exploiting them to perform a form of graph-like retrieval of locally relevant content. In a benchmark of twenty expert-designed queries over Rulex Platform technical documentation and four prompting strategies, LARAG consistently improves answer quality, achieving the highest BERTScore F1, while retrieving fewer chunks and generating fewer tokens than a baseline RAG architecture used for comparison. These results show that directly leveraging the existing hyperlink topology of technical documentation, even without explicit graph construction or inference, enables an implicit form of graph-like retrieval that yields a more faithful and efficient RAG pipeline, providing better grounding at lower cost.

RESULT

ScienceToStartup currently rates this 7.0/10 on the public viability pass. In a benchmark of twenty expert-designed queries over Rulex Platform technical documentation and four prompting strategies, LARAG consistently improves answer quality, achieving the highest…

WHY NOW

RAG Systems moved forward this cycle; last verified May 2026. Public score 7.0/10. Production flags indicate code availability.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score7.0

PainA lightweight retrieval strategy for RAG that leverages hyperlink topology in technical documentation to improve answer quality and efficiency.

Evidence0 refs | 3 sources | 50% coverage

Blockerno shell-level blocker reported

Analysis summary

A lightweight retrieval strategy for RAG that leverages hyperlink topology in technical documentation to improve answer quality and efficiency.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

A lightweight retrieval strategy for RAG that leverages hyperlink topology in technical documentation to improve answer quality and efficiency.

Segment

RAG Systems

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "fd6fff24-d266-4139-a3fd-ec5535ad7603", "arxiv_id": "2605.07517", "canonical_route": "/paper/larag-link-aware-retrieval-strategy-for-rag-systems-in-hyperlinked-technical-documentation", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "larag-link-aware-retrieval-strategy-for-rag-systems-in-hyperlinked-technical-documentation", "endpoints": { "paper_pack": "/api/v1/paper/larag-link-aware-retrieval-strategy-for-rag-systems-in-hyperlinked-technical-documentation/paper-pack", "build_passport": "/api/v1/paper/larag-link-aware-retrieval-strategy-for-rag-systems-in-hyperlinked-technical-documentation/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "LARAG: Link-Aware Retrieval Strategy for RAG Systems in Hyperlinked Technical Documentation", "normalized_query": "2605.07517", "route": "/paper/larag-link-aware-retrieval-strategy-for-rag-systems-in-hyperlinked-technical-documentation", "paper_ref": "larag-link-aware-retrieval-strategy-for-rag-systems-in-hyperlinked-technical-documentation", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/larag-link-aware-retrieval-strategy-for-rag-systems-in-hyperlinked-technical-documentation#webpage", "url": "https://sciencetostartup.com/paper/larag-link-aware-retrieval-strategy-for-rag-systems-in-hyperlinked-technical-documentation", "name": "LARAG: Link-Aware Retrieval Strategy for RAG Systems in Hyperlinked Technical Documentation", "description": "A lightweight retrieval strategy for RAG that leverages hyperlink topology in technical documentation to improve answer quality and efficiency.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/larag-link-aware-retrieval-strategy-for-rag-systems-in-hyperlinked-technical-documentation#scholarlyArticle", "headline": "LARAG: Link-Aware Retrieval Strategy for RAG Systems in Hyperlinked Technical Documentation", "description": "A lightweight retrieval strategy for RAG that leverages hyperlink topology in technical documentation to improve answer quality and efficiency.", "url": "https://sciencetostartup.com/paper/larag-link-aware-retrieval-strategy-for-rag-systems-in-hyperlinked-technical-documentation", "sameAs": "https://arxiv.org/abs/2605.07517", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2605.07517" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-05-08T09:50:53.000Z", "author": [ { "@type": "Person", "name": "Giorgia Bolognesi" }, { "@type": "Person", "name": "Claudio Estatico" }, { "@type": "Person", "name": "Ulderico Fugacci" }, { "@type": "Person", "name": "Isabella Mastroianni" }, { "@type": "Person", "name": "Claudio Muselli" }, { "@type": "Person", "name": "Luca Oneto" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 7 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "RAG Systems" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "RAG Systems", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "LARAG: Link-Aware Retrieval Strategy for RAG Systems in Hype", "item": "https://sciencetostartup.com/paper/larag-link-aware-retrieval-strategy-for-rag-systems-in-hyperlinked-technical-documentation" } ] } ] }

Competitive landscape

A lightweight retrieval strategy for RAG that leverages hyperlink topology in technical documentation to improve answer quality and efficiency.

Segment

RAG Systems

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

LARAG: Link-Aware Retrieval Strategy for RAG Systems in Hyperlinked Technical Documentation

LARAG: Link-Aware Retrieval Strategy for RAG Systems in Hyperlinked Technical Documentation

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline