ARXIV:2605.07990 · LLM AGENTS · SUBMITTED 11 MAY · 20:47 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Tool Calling is Linearly Readable and Steerable in Language Models

Zekun Wu · Ze Wang · Seonglae Cho · Yufei Yang · Adriano Koshiyama · Sahan Bulathwela · +1 at arXiv

This research identifies a linear and steerable mechanism within language models for tool selection, enabling error prediction and targeted intervention.

Blocked on Code›Score4.0Evidence unverified

Opportunity summary

Pain This research identifies a linear and steerable mechanism within language models for tool selection, enabling error prediction and targeted intervention.

Evidence 0 refs | 3 sources | 50% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

This research identifies a linear and steerable mechanism within language models for tool selection, enabling error prediction and targeted intervention. Probing 12 instruction-tuned models across Gemma 3, Qwen 3, Qwen 2.5, and Llama 3.1…

METHOD

Full abstract

When a tool-calling agent picks the wrong tool, the failure is invisible until execution: the email gets sent, the meeting gets missed. Probing 12 instruction-tuned models across Gemma 3, Qwen 3, Qwen 2.5, and Llama 3.1 (270M to 27B), we find the identity of the chosen tool is linearly readable and steerable inside the model. Adding the mean-difference between two tools' average internal activations switches which tool the model selects at 77-100% accuracy on name-only single-turn prompts (93-100% at 4B+), and the JSON arguments that follow autoregressively match the new tool's schema, so flipping the name is enough. The same per-tool means also flag likely errors before they happen: on Gemma 3 12B and 27B, queries where the gap between the top-1 and top-2 tool is smallest produce 14-21x more wrong calls than queries with the largest gap. The causal effect concentrates along one direction, the row of the output layer that produces the target tool's first token: a unit vector along it at matched magnitude already reaches 93-100%, while what is left over leaves the choice almost untouched. Activation patching localises this to a small set of mid- and late-layer attention heads, and a within-topic probe across 14 same-domain $τ$-bench airline tools reaches top-1 61-89% across five 4B-14B models, ruling out the reading that we are just moving the model along a topic axis. Even base models encode the right tool before they can emit it: cosine readout from the internal state recovers 69-82% on BFCL while base generation reaches only 2-10%, suggesting pretraining forms the representation and instruction tuning later wires it to the output. We measure tool identity selection and JSON schema correctness in single-turn fixed-menu settings; multi-turn agentic transfer is more fragile and is discussed in Limitations.

RESULT

ScienceToStartup currently rates this 4.0/10 on the public viability pass. We measure tool identity selection and JSON schema correctness in single-turn fixed-menu settings; multi-turn agentic transfer is more fragile and is discussed in Limitations.

WHY NOW

LLM Agents moved forward this cycle; last verified May 2026. Public score 4.0/10.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score4.0

PainThis research identifies a linear and steerable mechanism within language models for tool selection, enabling error prediction and targeted intervention.

Evidence0 refs | 3 sources | 50% coverage

Blockerno shell-level blocker reported

Analysis summary

This research identifies a linear and steerable mechanism within language models for tool selection, enabling error prediction and targeted intervention.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

This research identifies a linear and steerable mechanism within language models for tool selection, enabling error prediction and targeted intervention.

Segment

LLM Agents

Adoption evidence

No public code link in the paper record yet

Commercial read

4.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "881d917f-d8d0-48b4-bc71-b024c63038a0", "arxiv_id": "2605.07990", "canonical_route": "/paper/tool-calling-is-linearly-readable-and-steerable-in-language-models", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "tool-calling-is-linearly-readable-and-steerable-in-language-models", "endpoints": { "paper_pack": "/api/v1/paper/tool-calling-is-linearly-readable-and-steerable-in-language-models/paper-pack", "build_passport": "/api/v1/paper/tool-calling-is-linearly-readable-and-steerable-in-language-models/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "Tool Calling is Linearly Readable and Steerable in Language Models", "normalized_query": "2605.07990", "route": "/paper/tool-calling-is-linearly-readable-and-steerable-in-language-models", "paper_ref": "tool-calling-is-linearly-readable-and-steerable-in-language-models", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/tool-calling-is-linearly-readable-and-steerable-in-language-models#webpage", "url": "https://sciencetostartup.com/paper/tool-calling-is-linearly-readable-and-steerable-in-language-models", "name": "Tool Calling is Linearly Readable and Steerable in Language Models", "description": "This research identifies a linear and steerable mechanism within language models for tool selection, enabling error prediction and targeted intervention.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/tool-calling-is-linearly-readable-and-steerable-in-language-models#scholarlyArticle", "headline": "Tool Calling is Linearly Readable and Steerable in Language Models", "description": "This research identifies a linear and steerable mechanism within language models for tool selection, enabling error prediction and targeted intervention.", "url": "https://sciencetostartup.com/paper/tool-calling-is-linearly-readable-and-steerable-in-language-models", "sameAs": "https://arxiv.org/abs/2605.07990", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2605.07990" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-05-08T16:47:08.000Z", "author": [ { "@type": "Person", "name": "Zekun Wu" }, { "@type": "Person", "name": "Ze Wang" }, { "@type": "Person", "name": "Seonglae Cho" }, { "@type": "Person", "name": "Yufei Yang" }, { "@type": "Person", "name": "Adriano Koshiyama" }, { "@type": "Person", "name": "Sahan Bulathwela" }, { "@type": "Person", "name": "Maria Perez-Ortiz" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 4 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "LLM Agents" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "LLM Agents", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "Tool Calling is Linearly Readable and Steerable in Language ", "item": "https://sciencetostartup.com/paper/tool-calling-is-linearly-readable-and-steerable-in-language-models" } ] } ] }

Competitive landscape

This research identifies a linear and steerable mechanism within language models for tool selection, enabling error prediction and targeted intervention.

Segment

LLM Agents

Adoption evidence

No public code link in the paper record yet

Commercial read

4.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

Tool Calling is Linearly Readable and Steerable in Language Models

Tool Calling is Linearly Readable and Steerable in Language Models

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline