Caching

llm-usage-metrics uses caching to keep runs fast, stable offline, and predictable.

Why caching exists

Cache	Purpose	Default TTL	Location
update-check cache	avoids querying npm on every startup	1 hour	`<platform-cache-root>/llm-usage-metrics/update-check.json`
pricing cache	stores normalized LiteLLM pricing data	24 hours	`<platform-cache-root>/llm-usage-metrics/litellm-pricing-cache.json`
parse-file cache	stores parsed file diagnostics/events keyed by file fingerprint	7 days	`<platform-cache-root>/llm-usage-metrics/parse-file-cache.<source>.json`

On Linux with no XDG_CACHE_HOME, <platform-cache-root> defaults to ~/.cache.

read cached npm version if still fresh
otherwise continue immediately and refresh the cache in a detached background process
on fetch failure, fallback to previous cached version when available
optional session-scoped mode creates a per-shell cache file
skipped entirely for --help, --version, npx, and direct source/development runs

Relevant env vars:

tries fresh cache first
if cache is stale and network is enabled, fetches LiteLLM pricing and rewrites cache
if network fails, falls back to stale cache when possible
with --pricing-offline, uses cache only and fails if no cache exists

Relevant options/env:

key is (source, file path)
cache validity requires matching dependency fingerprints and TTL
stores parse diagnostics and normalized events
cache files are sharded by source id, so source-scoped runs load only relevant shards
dependency fingerprints include the primary file and any adapter-declared sidecar inputs
persisted as best-effort JSON, bounded by max entries and max byte size

Relevant env vars:

Use pricing cache only (offline mode):

llm-usage monthly --pricing-offline

Increase parsing throughput and keep parse cache enabled:

LLM_USAGE_PARSE_MAX_PARALLEL=16 llm-usage daily

Shorten pricing cache TTL to 2 hours:

LLM_USAGE_PRICING_CACHE_TTL_MS=7200000 llm-usage monthly

When pricing fails in offline mode, run once without --pricing-offline to warm the cache.
For stale reports after source file changes, verify source files updated mtime and cache TTL settings.
To force update checks every run, set LLM_USAGE_UPDATE_CACHE_TTL_MS=0.