Question 1

Where does tool-use overhead come from?

Accepted Answer

Three places. First, enabling tools injects a special tool-use system prompt — on Anthropic models that ranges from 290 to 804 tokens depending on the model and tool_choice setting. Second, every tool definition you attach (name, description, JSON schema) is serialized into the prompt on every call. Third, built-in tools carry their own fixed costs: the bash tool adds 245 input tokens and the text editor tool adds about 700. None of this appears in your visible prompt, but all of it is billed.

Question 2

Why does tool_choice change the overhead?

Accepted Answer

When tool_choice is auto or none, the model gets a shorter instruction block because it merely needs to know tools exist and may be used. When tool_choice is any or a specific tool, the system prompt grows — on Opus 4.8 from 290 to 410 tokens, on Opus 4.7 from 675 to 804 — because the model receives additional forcing instructions. If you do not need to force a tool call, leaving tool_choice on auto is a small but free saving on every single request.

Question 3

How accurate are these numbers?

Accepted Answer

The per-model system prompt token counts come directly from Anthropic's published tool-use documentation and were verified on 2026-06-11 — the verification date is shown in the tool itself. Schema token counts are your own estimate since they depend entirely on how verbose your descriptions are; a typical small tool runs 100-200 tokens, a complex one with nested objects can exceed 500. The monthly dollar figure uses our verified pricing dataset at one call-equivalent of overhead per request.

Question 4

How do I reduce tool overhead?

Accepted Answer

Trim tool descriptions to what the model needs for selection, not full documentation — the description is loaded on every call whether the tool is used or not. Remove tools the agent rarely calls; ten attached tools at 200 tokens each is 2,000 tokens per request before you say a word. Use prompt caching: tool definitions sit at the start of the prompt and are ideal cache content, cutting their effective cost by up to 90% on cache hits.

Question 5

Does this apply to OpenAI and Google models too?

Accepted Answer

The structure does — every provider serializes your tool schemas into the context and adds framework instructions — but the exact token counts in this calculator are Anthropic's published figures, and other vendors do not document theirs with the same precision. As a rule of thumb, schema tokens dominate once you attach more than a handful of tools, and that part of the math transfers directly to any provider.

Tool Call Overhead Calculator

How it works

Frequently asked questions

Where does tool-use overhead come from?

Why does tool_choice change the overhead?

How accurate are these numbers?

How do I reduce tool overhead?

Does this apply to OpenAI and Google models too?

Related tools

Tool Schema Builder

Web Search Tool Cost Calculator

Token Cost Calculator

Structured Data Token Overhead