ARXIV:2604.12223 · INTERPRETABLE TEXT CLASSIFICATION · SUBMITTED 15 APR · 17:02 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

LLM-Guided Semantic Bootstrapping for Interpretable Text Classification with Tsetlin Machines

Jiechao Gao · Rohan Kumar Yadav · Yuangang Li · Yuandong Pan · Jie Wang · Ying Liu · +1 at arXiv

A framework that transfers LLM knowledge into symbolic Tsetlin Machines for interpretable and accurate text classification without runtime LLM calls.

Blocked on Code›Score3.0Evidence unverified

Opportunity summary

Pain A framework that transfers LLM knowledge into symbolic Tsetlin Machines for interpretable and accurate text classification without runtime LLM calls.

Evidence 0 refs | 3 sources | 50% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

A framework that transfers LLM knowledge into symbolic Tsetlin Machines for interpretable and accurate text classification without runtime LLM calls. We propose a semantic bootstrapping framework that transfers LLM knowledge into symbolic form, combining…

METHOD

Full abstract

Pretrained language models (PLMs) like BERT provide strong semantic representations but are costly and opaque, while symbolic models such as the Tsetlin Machine (TM) offer transparency but lack semantic generalization. We propose a semantic bootstrapping framework that transfers LLM knowledge into symbolic form, combining interpretability with semantic capacity. Given a class label, an LLM generates sub-intents that guide synthetic data creation through a three-stage curriculum (seed, core, enriched), expanding semantic diversity. A Non-Negated TM (NTM) learns from these examples to extract high-confidence literals as interpretable semantic cues. Injecting these cues into real data enables a TM to align clause logic with LLM-inferred semantics. Our method requires no embeddings or runtime LLM calls, yet equips symbolic models with pretrained semantic priors. Across multiple text classification tasks, it improves interpretability and accuracy over vanilla TM, achieving performance comparable to BERT while remaining fully symbolic and efficient.

RESULT

ScienceToStartup currently rates this 3.0/10 on the public viability pass. Injecting these cues into real data enables a TM to align clause logic with LLM-inferred semantics.

WHY NOW

Interpretable Text Classification moved forward this cycle; last verified April 2026. Public score 3.0/10.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score3.0

PainA framework that transfers LLM knowledge into symbolic Tsetlin Machines for interpretable and accurate text classification without runtime LLM calls.

Evidence0 refs | 3 sources | 50% coverage

Blockerno shell-level blocker reported

Analysis summary

A framework that transfers LLM knowledge into symbolic Tsetlin Machines for interpretable and accurate text classification without runtime LLM calls.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

A framework that transfers LLM knowledge into symbolic Tsetlin Machines for interpretable and accurate text classification without runtime LLM calls.

Segment

Interpretable Text Classification

Adoption evidence

No public code link in the paper record yet

Commercial read

3.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "2b9a9c10-7c8c-4e58-b942-3a162d9e8a74", "arxiv_id": "2604.12223", "canonical_route": "/paper/llm-guided-semantic-bootstrapping-for-interpretable-text-classification-with-tsetlin-machines", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "llm-guided-semantic-bootstrapping-for-interpretable-text-classification-with-tsetlin-machines", "endpoints": { "paper_pack": "/api/v1/paper/llm-guided-semantic-bootstrapping-for-interpretable-text-classification-with-tsetlin-machines/paper-pack", "build_passport": "/api/v1/paper/llm-guided-semantic-bootstrapping-for-interpretable-text-classification-with-tsetlin-machines/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "LLM-Guided Semantic Bootstrapping for Interpretable Text Classification with Tsetlin Machines", "normalized_query": "2604.12223", "route": "/paper/llm-guided-semantic-bootstrapping-for-interpretable-text-classification-with-tsetlin-machines", "paper_ref": "llm-guided-semantic-bootstrapping-for-interpretable-text-classification-with-tsetlin-machines", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/llm-guided-semantic-bootstrapping-for-interpretable-text-classification-with-tsetlin-machines#webpage", "url": "https://sciencetostartup.com/paper/llm-guided-semantic-bootstrapping-for-interpretable-text-classification-with-tsetlin-machines", "name": "LLM-Guided Semantic Bootstrapping for Interpretable Text Classification with Tsetlin Machines", "description": "A framework that transfers LLM knowledge into symbolic Tsetlin Machines for interpretable and accurate text classification without runtime LLM calls.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/llm-guided-semantic-bootstrapping-for-interpretable-text-classification-with-tsetlin-machines#scholarlyArticle", "headline": "LLM-Guided Semantic Bootstrapping for Interpretable Text Classification with Tsetlin Machines", "description": "A framework that transfers LLM knowledge into symbolic Tsetlin Machines for interpretable and accurate text classification without runtime LLM calls.", "url": "https://sciencetostartup.com/paper/llm-guided-semantic-bootstrapping-for-interpretable-text-classification-with-tsetlin-machines", "sameAs": "https://arxiv.org/abs/2604.12223", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2604.12223" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-04-14T03:02:25.000Z", "author": [ { "@type": "Person", "name": "Jiechao Gao" }, { "@type": "Person", "name": "Rohan Kumar Yadav" }, { "@type": "Person", "name": "Yuangang Li" }, { "@type": "Person", "name": "Yuandong Pan" }, { "@type": "Person", "name": "Jie Wang" }, { "@type": "Person", "name": "Ying Liu" }, { "@type": "Person", "name": "Michael Lepech" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 3 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Interpretable Text Classification" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Interpretable Text Classification", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "LLM-Guided Semantic Bootstrapping for Interpretable Text Cla", "item": "https://sciencetostartup.com/paper/llm-guided-semantic-bootstrapping-for-interpretable-text-classification-with-tsetlin-machines" } ] } ] }

Competitive landscape

A framework that transfers LLM knowledge into symbolic Tsetlin Machines for interpretable and accurate text classification without runtime LLM calls.

Segment

Interpretable Text Classification

Adoption evidence

No public code link in the paper record yet

Commercial read

3.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

LLM-Guided Semantic Bootstrapping for Interpretable Text Classification with Tsetlin Machines

LLM-Guided Semantic Bootstrapping for Interpretable Text Classification with Tsetlin Machines

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline