Question 1

What is context compaction?

Accepted Answer

Compaction replaces the accumulated conversation history with a short summary, so subsequent turns send a few thousand tokens of summary instead of tens of thousands of raw transcript. Claude Code's /compact command and most agent frameworks do this automatically when the window fills. The trade is fidelity for cost: the model loses verbatim detail but keeps the gist.

Question 2

Does the calculator include the cost of the summarization call itself?

Accepted Answer

Yes. Every compaction is modeled as one extra API call that reads the entire accumulated context as input and writes the summary as output, at the selected model's real rates. That is why very frequent compaction on small contexts can come out more expensive than doing nothing — the summary calls eat the savings.

Question 3

How do I choose the compaction interval?

Accepted Answer

Watch the two bar charts. Compact too rarely and per-turn cost climbs toward the no-compaction curve before each reset; compact too often and you pay for summary calls that barely shrink anything. For typical agentic coding sessions with 4-6k tokens of growth per turn, intervals of 8-15 turns usually land near the sweet spot — but slide the control and check your own numbers.

Question 4

Is a smaller summary always better?

Accepted Answer

Cheaper, yes — better, not necessarily. A 1k-token summary of a 100k-token session discards a lot, and the agent may re-fetch files or re-ask questions, which costs tokens elsewhere. The calculator only prices the direct token math; budget some slack in the summary size for the context your agent genuinely needs to keep working.

Question 5

Why do per-turn costs saw-tooth in the compacted chart?

Accepted Answer

Each compaction turn pays double: the normal turn plus the summarization call, then the next turn restarts from the cheap summary-sized context. The spikes are the summaries; the drops are the resets. The area under the green curve versus the gray one is your saving, and the hero number is exactly that difference.

Context Compaction Savings

How it works

Frequently asked questions

What is context compaction?

Does the calculator include the cost of the summarization call itself?

How do I choose the compaction interval?

Is a smaller summary always better?

Why do per-turn costs saw-tooth in the compacted chart?

Related tools

Agent Session Cost Estimator

Context Window Visualizer

Conversation Memory Planner

Prompt Compressor