Cost Tracking for Claude Agent SDK Applications

A practical walkthrough of token and spend accounting in the Claude Agent SDK. Read total_cost_usd from the result message, deduplicate parallel tool calls that share an assistant id, break spend down per model with modelUsage, sum cost across query() calls yourself, and read cache_creation/cache_read tokens.

by JPette1783·added 2026-06-05·

Claude Code

HarnessClaude Code

Command center

Source

Review first

Review safety and privacy notes before installing or copying commands.

Safety notes Privacy notes

Install & copy

## Overview

The Claude Agent SDK reports detailed token usage for each interaction. This
guide covers reading cost and usage correctly, especially with parallel tool use
and multi-step conversations.

## Scopes

- **query() call**: one `query()` invocation; produces one `result` message.
- **Step**: one request/response cycle within a call; produces assistant messages
  with usage.
- **Session**: multiple `query()` calls linked by a session id (`resume`); each
  call reports its own cost.

## Get the total for a call

The `result` message includes `total_cost_usd` (estimated) and cumulative usage:

```typescript
for await (const message of query({ prompt: "Summarize this project" })) {
  if (message.type === "result") {
    console.log(`Total cost: $${message.total_cost_usd}`);
  }
}
```

This is a **client-side estimate** from a bundled price table, not authoritative
billing. Use the Usage and Cost API or the Console for real charges.

## Per-step and per-model usage

Each assistant message carries `usage` (input/output tokens) and an `id`. Parallel
tool calls in one turn share an `id`, so deduplicate by id to avoid double
counting:

```typescript
const seen = new Set();
let input = 0, output = 0;
for await (const message of query({ prompt: "..." })) {
  if (message.type === "assistant" && !seen.has(message.message.id)) {
    seen.add(message.message.id);
    input += message.message.usage.input_tokens;
    output += message.message.usage.output_tokens;
  }
}
```

The result message's `modelUsage` (TS) / `model_usage` (Python) breaks cost and
tokens down per model, useful when subagents run a cheaper model.

## Accumulate across calls

The SDK has no session-level total; sum each call's `total_cost_usd` yourself when
running multiple `query()` calls.

## Cache tokens and failures

The SDK uses prompt caching automatically. The usage object includes
`cache_creation_input_tokens` (written, higher rate) and `cache_read_input_tokens`
(read, reduced rate); track them to understand caching savings. Both success and
error results include cost, so always read it regardless of subtype. To extend
cache TTL to one hour on API-key/Bedrock/Vertex/Foundry, set
`ENABLE_PROMPT_CACHING_1H`.

## Source

- Track cost and usage: https://code.claude.com/docs/en/agent-sdk/cost-tracking

Trust & readiness

TrustReview first
Sourcesource-backed
Safety notesPresent
ReviewedNo

Community context

Related entries(4)
Related guides(1)
Community signals

Compare

Integrations & API

Contribute

Suggest a metadata change Claim this listing

Documentation Source repository Browse directory

Review first — review before installing

Open the source and read safety notes before installing.

Citation facts

Source-backed facts for citing this resource, derived directly from the registry — also available as plain text for AI assistants.

Canonical URL: https://heyclau.de/entry/guides/cost-tracking-for-claude-agent-sdk-applications
Source URLs: https://code.claude.com/docs/en/agent-sdk/cost-tracking, https://github.com/JSONbored/awesome-claude/blob/main/content/guides/cost-tracking-for-claude-agent-sdk-applications.mdx
Safety notes: total_cost_usd / costUSD are client-side estimates from a bundled price table, not authoritative billing; do not bill end users or trigger financial decisions from them., Estimates can drift when pricing changes or the SDK version does not recognize a model; use the Usage and Cost API or Console for real billing., Both success and error result messages include usage and cost; read cost regardless of subtype so failed runs are still accounted for.
Privacy notes: Usage data is token counts and cost, not content; it is safe to log, though it can reveal activity volume., Per-model and end-user attribution may be sent to an observability backend if you also enable telemetry; govern that data accordingly., The SDK uses prompt caching automatically; cache token fields reveal reuse patterns but not content.
Author: JPette1783
Submitted by: JPette1783
Claim status: unclaimed
Last verified: 2026-06-05

Decision playbook

Review trust signals before you adopt

Signals are present but mixed. Use the checklist below to confirm the source and operational safety for your environment.

Compare context

Selected

Current score

Baseline

—

Delta

No baseline selected

No major trust-signal divergence detected in the current selection.

Source and provenance checks

Needs review

Confirm ownership and provenance before trusting install instructions.

Source link availableRequired
Open the canonical repository and verify ownership.
Done
Source provenance statusRequired
Marked as source-backed.
Done
Metadata reviewed
No reviewed flag detected in metadata.
Pending

Safety and privacy checks

Complete

Validate risk disclosures before installation or API wiring.

Safety notes presentRequired
Review the listed safety guidance before running commands.
Done
Privacy notes presentRequired
Review data handling notes before connecting accounts or secrets.
Done
Trust level risk gateRequired
Trust level does not block evaluation.
Done

Package and install checks

Needs review

Check package metadata and artifact integrity signals.

Install payload available
Install or copy payload is available for review.
Done
Package verification flag
No package verification flag provided.
Pending
Checksum metadata
No checksum provided for downloaded artifact.
Pending

Compare-driven decision checks

Needs review

Use compare context to validate trade-offs before adoption.

Compare tray has multiple entries
Add at least one more entry to compare trust differences.
Pending
Baseline comparison available
No baseline peer selected yet.
Pending
Diverging trust signals identified
No major trust-signal divergence found.
Pending

Setup at a glance

Copy & paste

Copy-ready — paste the snippet to get started.

Install command

Not provided

Config snippet

Not provided

Copy snippet

Provided

Prerequisites

3 to clear

Platforms

1 listed

Install type

Copy & paste

Adoption plan

Balanced adoption plan

Current risk score 24/100. Use staged verification before broader rollout.

Risk 24

Pre-adoption checks

Validate source and review signals before any execution.

Confirm source provenanceRequired
Source URL/provenance metadata is present.
Done
Confirm metadata review state
No review metadata found; increase manual validation.
Pending
Verify install payload
Install/config payload exists and can be inspected.
Done

Security checks

Confirm safety, privacy, and package integrity signals.

Review safety notesRequired
Safety notes are present.
Done
Review privacy notesRequired
Privacy notes are present.
Done
Verify package integrity metadata
No package verification/checksum metadata.
Pending

Rollout

Adopt in controlled steps based on the selected plan.

Run in isolated sandbox firstRequired
Use a constrained sandbox and observe behavior across multiple tasks.
Pending
Roll out graduallyRequired
Roll out to a small cohort before wider usage.
Pending
Set monitoring and fallback
Define rollback path and monitor errors after adoption.
Pending

Evidence readiness

Evidence readiness matrix · balanced

Missing required evidence: Metadata review. Risk score 31.

Risk 31

Source provenance

Present

Source repository/provenance is listed.

Required in this preset

Metadata review

Missing

Review metadata is missing.

Required in this preset

Safety notes

Present

Safety notes are present.

Required in this preset

Privacy notes

Present

Privacy notes are present.

Optional in this preset

Package integrity

Missing

Package integrity metadata is missing.

Optional in this preset

Install payload

Present

Install payload is available.

Required in this preset

Required gaps: Metadata review

Decision timeline

Decision timeline · balanced

Blocking gaps: Check metadata review status. Risk 28.

Risk 28

triage

Confirm source provenanceRequired

Source/provenance metadata is available.

Done

triage

Check metadata review statusRequired

Review metadata is missing.

Pending

verify

Review safety notesRequired

Safety notes are available.

Done

verify

Review privacy notes

Privacy notes are available.

Done

verify

Validate package integrity metadata

Package integrity metadata is missing.

Pending

rollout

Verify install payload and commandsRequired

Install payload is available.

Done

Blockers: Check metadata review status

Prerequisite readiness

3 prerequisites to line up before setup. Have accounts and credentials ready first.

0/3 ready

Account & credentials1Install & runtime1General1

Safety & privacy surface

3 safety and 3 privacy notes across 4 risk areas. Review closely: credentials & tokens, third-party handling.

4 areas

SafetyGeneraltotal_cost_usd / costUSD are client-side estimates from a bundled price table, not authoritative billing; do not bill end users or trigger financial decisions from them.
SafetyGeneralEstimates can drift when pricing changes or the SDK version does not recognize a model; use the Usage and Cost API or Console for real billing.
SafetyExecution & processesBoth success and error result messages include usage and cost; read cost regardless of subtype so failed runs are still accounted for.
PrivacyCredentials & tokensUsage data is token counts and cost, not content; it is safe to log, though it can reveal activity volume.
PrivacyThird-party handlingPer-model and end-user attribution may be sent to an observability backend if you also enable telemetry; govern that data accordingly.
PrivacyCredentials & tokensThe SDK uses prompt caching automatically; cache token fields reveal reuse patterns but not content.

Safety notes

total_cost_usd / costUSD are client-side estimates from a bundled price table, not authoritative billing; do not bill end users or trigger financial decisions from them.
Estimates can drift when pricing changes or the SDK version does not recognize a model; use the Usage and Cost API or Console for real billing.
Both success and error result messages include usage and cost; read cost regardless of subtype so failed runs are still accounted for.

Privacy notes

Usage data is token counts and cost, not content; it is safe to log, though it can reveal activity volume.
Per-model and end-user attribution may be sent to an observability backend if you also enable telemetry; govern that data accordingly.
The SDK uses prompt caching automatically; cache token fields reveal reuse patterns but not content.

Prerequisites

The Claude Agent SDK installed for Python or TypeScript.
An async loop over query() results so you can read assistant and result messages.
For authoritative billing, access to the Usage and Cost API or the Console.

Schema details

Install type: copy
Troubleshooting: No

Full copyable content

## Overview

The Claude Agent SDK reports detailed token usage for each interaction. This
guide covers reading cost and usage correctly, especially with parallel tool use
and multi-step conversations.

## Scopes

- **query() call**: one `query()` invocation; produces one `result` message.
- **Step**: one request/response cycle within a call; produces assistant messages
  with usage.
- **Session**: multiple `query()` calls linked by a session id (`resume`); each
  call reports its own cost.

## Get the total for a call

The `result` message includes `total_cost_usd` (estimated) and cumulative usage:

```typescript
for await (const message of query({ prompt: "Summarize this project" })) {
  if (message.type === "result") {
    console.log(`Total cost: $${message.total_cost_usd}`);
  }
}
```

This is a **client-side estimate** from a bundled price table, not authoritative
billing. Use the Usage and Cost API or the Console for real charges.

## Per-step and per-model usage

Each assistant message carries `usage` (input/output tokens) and an `id`. Parallel
tool calls in one turn share an `id`, so deduplicate by id to avoid double
counting:

```typescript
const seen = new Set();
let input = 0, output = 0;
for await (const message of query({ prompt: "..." })) {
  if (message.type === "assistant" && !seen.has(message.message.id)) {
    seen.add(message.message.id);
    input += message.message.usage.input_tokens;
    output += message.message.usage.output_tokens;
  }
}
```

The result message's `modelUsage` (TS) / `model_usage` (Python) breaks cost and
tokens down per model, useful when subagents run a cheaper model.

## Accumulate across calls

The SDK has no session-level total; sum each call's `total_cost_usd` yourself when
running multiple `query()` calls.

## Cache tokens and failures

The SDK uses prompt caching automatically. The usage object includes
`cache_creation_input_tokens` (written, higher rate) and `cache_read_input_tokens`
(read, reduced rate); track them to understand caching savings. Both success and
error results include cost, so always read it regardless of subtype. To extend
cache TTL to one hour on API-key/Bedrock/Vertex/Foundry, set
`ENABLE_PROMPT_CACHING_1H`.

## Source

- Track cost and usage: https://code.claude.com/docs/en/agent-sdk/cost-tracking

About this resource

Overview

The Claude Agent SDK reports detailed token usage for each interaction. This guide covers reading cost and usage correctly, especially with parallel tool use and multi-step conversations.

Scopes

query() call: one query() invocation; produces one result message.
Step: one request/response cycle within a call; produces assistant messages with usage.
Session: multiple query() calls linked by a session id (resume); each call reports its own cost.

Get the total for a call

The result message includes total_cost_usd (estimated) and cumulative usage:

for await (const message of query({ prompt: "Summarize this project" })) {
  if (message.type === "result") {
    console.log(`Total cost: $${message.total_cost_usd}`);
  }
}

This is a client-side estimate from a bundled price table, not authoritative billing. Use the Usage and Cost API or the Console for real charges.

Per-step and per-model usage

Each assistant message carries usage (input/output tokens) and an id. Parallel tool calls in one turn share an id, so deduplicate by id to avoid double counting:

const seen = new Set();
let input = 0, output = 0;
for await (const message of query({ prompt: "..." })) {
  if (message.type === "assistant" && !seen.has(message.message.id)) {
    seen.add(message.message.id);
    input += message.message.usage.input_tokens;
    output += message.message.usage.output_tokens;
  }
}

The result message's modelUsage (TS) / model_usage (Python) breaks cost and tokens down per model, useful when subagents run a cheaper model.

Accumulate across calls

The SDK has no session-level total; sum each call's total_cost_usd yourself when running multiple query() calls.

Cache tokens and failures

The SDK uses prompt caching automatically. The usage object includes cache_creation_input_tokens (written, higher rate) and cache_read_input_tokens (read, reduced rate); track them to understand caching savings. Both success and error results include cost, so always read it regardless of subtype. To extend cache TTL to one hour on API-key/Bedrock/Vertex/Foundry, set ENABLE_PROMPT_CACHING_1H.

Source

Track cost and usage: https://code.claude.com/docs/en/agent-sdk/cost-tracking

#claude-agent-sdk #cost-tracking #usage #observability #developer-tools

Source citations

Source methodology →

Add this badge to your README

Show that Cost Tracking for Claude Agent SDK Applications is listed on HeyClaude. Paste this Markdown into your README — it renders the badge and links back to this page.

[![Listed on HeyClaude](https://heyclau.de/badge/guides/cost-tracking-for-claude-agent-sdk-applications.svg)](https://heyclau.de/entry/guides/cost-tracking-for-claude-agent-sdk-applications)

How it compares

Cost Tracking for Claude Agent SDK Applications side by side with 3 alternatives on trust, install, platform support, and disclosed safety notes — all from reviewed registry metadata.

Field	Cost Tracking for Claude Agent SDK Applications A practical walkthrough of token and spend accounting in the Claude Agent SDK. Read total_cost_usd from the result message, deduplicate parallel tool calls that share an assistant id, break spend down per model with modelUsage, sum cost across query() calls yourself, and read cache_creation/cache_read tokens. Open dossier	Agent Skills in Claude Agent SDK Applications A practical walkthrough of using Agent Skills in the Claude Agent SDK: how skills are discovered from the filesystem via settingSources, the skills option to enable or filter them, tool access, and troubleshooting discovery. Open dossier	OpenTelemetry Observability for Claude Agent SDK Agents A practical walkthrough of exporting OpenTelemetry traces, metrics, and events from the Claude Agent SDK: enabling telemetry, configuring OTLP exporters, reading agent spans, linking traces to your app, and controlling sensitive data. Open dossier	Secure Deployment for Claude Agent SDK Applications A practical walkthrough of securely deploying Claude Agent SDK applications: the prompt-injection threat model, isolation options (sandbox runtime, containers, gVisor, VMs), least privilege, the proxy credential pattern, and filesystem controls. Open dossier
Next steps	Open dossier API JSON Open LLM Open source Newsletter Claim listing	Open dossier API JSON Open LLM Open source Newsletter Claim listing	Open dossier API JSON Open LLM Open source Newsletter Claim listing	Open dossier API JSON Open LLM Open source Newsletter Claim listing
Trust
Review status	Not reviewed	Not reviewed	Not reviewed	Not reviewed
Package trust	Package not verified	Package not verified	Package not verified	Package not verified
Source provenance	Source-backed	Source-backed	Source-backed	Source-backed
Submitter	JPette1783	JPette1783	JPette1783	JPette1783
Install risk	Review first	Review first	Review first	Review first
Notes	Safety ✓ Privacy ✓	Safety ✓ Privacy ✓	Safety ✓ Privacy ✓	Safety ✓ Privacy ✓
Brand	—	—	—	—
Category	guides	guides	guides	guides
Source	Source-backed	Source-backed	Source-backed	Source-backed
Author	JPette1783	JPette1783	JPette1783	JPette1783
Added	2026-06-05	2026-06-05	2026-06-05	2026-06-05
Platforms	Claude Code	Claude Code	Claude Code	Claude Code
Harness	Claude Code	Claude Code	Claude Code	Claude Code
Source repo	—	—	—	—
Safety notes	✓total_cost_usd / costUSD are client-side estimates from a bundled price table, not authoritative billing; do not bill end users or trigger financial decisions from them. Estimates can drift when pricing changes or the SDK version does not recognize a model; use the Usage and Cost API or Console for real billing. Both success and error result messages include usage and cost; read cost regardless of subtype so failed runs are still accounted for.	✓The skills option is a context filter, not a sandbox: unlisted skills are hidden from the model but their files remain on disk and are reachable via Read and Bash. Skills are model-invoked; pair them with a tight allowedTools list (and dontAsk where appropriate) so an invoked skill cannot use more tools than intended. The allowed-tools frontmatter in SKILL.md does not apply through the SDK; control tool access with the main allowedTools option.	✓Telemetry is off until you set CLAUDE_CODE_ENABLE_TELEMETRY=1 and choose an exporter; turning it on starts exporting usage data, so confirm the destination is approved. Do not use the console exporter through the SDK; stdout is the SDK's message channel. Point OTLP at a collector or local Jaeger instead. Traces are beta and require CLAUDE_CODE_ENHANCED_TELEMETRY_BETA=1; span names and attributes may change between releases.	✓Agents generate actions dynamically and can be influenced by content they process (prompt injection); apply defense in depth, not a single control. Use least privilege: mount only needed directories (prefer read-only), restrict network to specific endpoints, and drop Linux capabilities in containers. Inject credentials via a proxy outside the agent boundary so the agent never sees secrets; do not mount ~/.ssh, ~/.aws, .env, or similar into the agent.
Privacy notes	✓Usage data is token counts and cost, not content; it is safe to log, though it can reveal activity volume. Per-model and end-user attribution may be sent to an observability backend if you also enable telemetry; govern that data accordingly. The SDK uses prompt caching automatically; cache token fields reveal reuse patterns but not content.	✓Skill descriptions are loaded so the model can decide when to use them; keep sensitive workflow detail and secrets out of descriptions. Skills sourced from outside your project run their instructions in your sessions; review them before enabling. Skill content is sent to the model provider when a skill is invoked; treat it like any other prompt content.	✓By default telemetry is structural (durations, model/tool names, token counts), not content; opt-in vars (OTEL_LOG_USER_PROMPTS, OTEL_LOG_TOOL_DETAILS, OTEL_LOG_TOOL_CONTENT, OTEL_LOG_RAW_API_BODIES) add prompt and tool content. Exporter headers can carry tokens; supply them via the environment, not committed config. End-user attribution via OTEL_RESOURCE_ATTRIBUTES creates a per-user audit trail; handle that data per your policy and percent-encode values.	✓Even read-only code mounts can expose credentials in .env, ~/.git-credentials, ~/.aws, .npmrc, and key files; exclude or sanitize them before mounting. Route egress through a proxy that enforces a domain allowlist and logs requests, so a compromised agent cannot exfiltrate data to arbitrary hosts. The built-in sandbox proxy does not inspect TLS; for stronger guarantees use a TLS-terminating proxy with its CA installed in the agent's trust store.
Prerequisites	The Claude Agent SDK installed for Python or TypeScript. An async loop over query() results so you can read assistant and result messages. For authoritative billing, access to the Usage and Cost API or the Console.	The Claude Agent SDK installed for Python or TypeScript. SKILL.md files in .claude/skills/ (project) or ~/.claude/skills/ (user). A cwd that points at or below the directory containing .claude/skills/, within the same repository.	The Claude Agent SDK installed for Python or TypeScript. An OTLP-compatible backend or collector (Honeycomb, Datadog, Grafana, Jaeger, etc.). Ability to set environment variables for the process or via options.env.	A Claude Agent SDK application you intend to run beyond a trusted local laptop. Knowledge of which files, endpoints, and credentials the agent legitimately needs. Container, sandbox, or VM tooling appropriate to your isolation choice.
Install	—	—	—	—
Config	—	—	—	—
Citations	Source repositorygithub.com 2026-07-20T21:01:12+00:00 Documentationcode.claude.com Submitted by JPette17832026-06-05 Source methodology →	Source repositorygithub.com 2026-07-20T21:01:12+00:00 Documentationcode.claude.com Submitted by JPette17832026-06-05 Source methodology →	Source repositorygithub.com 2026-07-20T21:01:12+00:00 Documentationcode.claude.com Submitted by JPette17832026-06-05 Source methodology →	Source repositorygithub.com 2026-07-20T21:01:12+00:00 Documentationcode.claude.com Submitted by JPette17832026-06-05 Source methodology →
Claim	Unclaimed	Unclaimed	Unclaimed	Unclaimed

Open 4 picks in the interactive comparison tool

Related guides

Source-backed guides for putting this to work.

Building In-Process MCP Tools with the Claude Agent SDK

Define in-process MCP tools for the Claude Agent SDK with createSdkMcpServer and the tool helper, then wire them into query.

Added 1mo ago

guides Review first Source-backed Review first

Safety ✓ Privacy ✓by JPette1783

Signals

Loading live community signals…

Citation facts

Review trust signals before you adopt

Source and provenance checks

Safety and privacy checks

Package and install checks

Compare-driven decision checks

Copy & paste

Balanced adoption plan

Pre-adoption checks

Security checks

Rollout

Evidence readiness matrix · balanced

Source provenance

Metadata review

Safety notes

Privacy notes

Package integrity

Install payload

Decision timeline · balanced

Confirm source provenanceRequired

Check metadata review statusRequired

Review safety notesRequired

Review privacy notes

Validate package integrity metadata

Verify install payload and commandsRequired

Prerequisite readiness

Safety & privacy surface

Safety notes

Privacy notes

Prerequisites

Schema details

About this resource

Overview

Scopes

Get the total for a call

Per-step and per-model usage

Accumulate across calls

Cache tokens and failures

Source

Source citations

Add this badge to your README

How it compares

Related resources

Agent Skills in Claude Agent SDK Applications

OpenTelemetry Observability for Claude Agent SDK Agents

Secure Deployment for Claude Agent SDK Applications

Agent SDK Production Architect Agent

Related guides

Building In-Process MCP Tools with the Claude Agent SDK

Signals