Build Journal

Weekly Usage Report — Week 3 (Apr 20–26): 530 Million Accounted Tokens for £9.24

Week 3: 449.3M visible tokens plus 80.8M cached tokens, for 530.1M total accounted Hermes tokens across 2,288 sessions. Opus-equivalent API cost: about £6,543.

2026-04-27 · 5 min read

530 million accounted tokens in a single week once cached context is included. That's not a typo. The visible input/output work alone was 449.3M tokens, and the full footprint is larger — for the price of a Pret subscription. And this time, the boundaries are right.

This is Week 3 of my ongoing transparency series. Every Monday, I publish exactly what my AI agent consumed and what it cost. No rounding. No spin. Just honest numbers from Mission Control.

Token accounting

This report separates visible prompt/completion tokens from cached context. Visible tokens show fresh input/output work; cached tokens show repeated context reused during long agent sessions. Together, they show the full model-traffic footprint for the week.

The week in one picture

This is the headline version of Week 3: 449.3M tokens, 2,288 sessions, £9.24 in subscription route cost — and the curve getting ridiculous against per-token API pricing.

Weekly Usage Report Week 3 — 530 million accounted tokens for £9.24 compared with per-token pricing

View full-size infographic

Top visible model routes

ModelTypeShare of visible route tokensCost
GLM-5.1Cloud (OAuth)40%£4.62/wk
Qwen 3.5 9BLocal (Ollama)35%Free
GPT-5.3 CodexCloud (OAuth)24%£4.62/wk

Model shares are visible-route estimates, not shares of the 530.1M cache-inclusive accounted total.

Qwen 3.5 9B jumped from 25% to 35% of the visible route mix this week — the local model carried more of the fresh input/output work. GLM-5.1 still led the heavy-context visible sessions, and GPT-5.3 Codex handled coding tasks. Cached context is included in the accounting block above, but not distributed across these route estimates.

Daily Breakdown

The Price Comparison

What would 449M tokens cost on per-token pricing?

On Opus per-token pricing, this single week would cost £5,573. That's £290,000 a year. For one person's AI usage. I paid £9.24.

The Opus multiplier climbed from 502x to 603x this week — not because Opus got more expensive, but because my token volume keeps growing while the bill stays flat. That's the subscription advantage compounding.

Notable Events

Wednesday Apr 22 — 100.7M Tokens

The week's biggest day. Blog icon generation for the ChatGPT image prompt, a full blog audit across all posts, and extensive logo/favicon work. 317 sessions averaging 318K tokens each. The I/O ratio hit 180:1 — the agent consumed massive context windows (reading full post files, design specs) while producing focused edits.

Monday Apr 20 — 85.3M Tokens

A strong start to the week. Heavy multi-agent delegation across Dade, Coder, and Plague for mission control dashboard work, blog writing, and Telegram conversations. The I/O ratio of 182:1 shows deep context work — typical of multi-step agent orchestration.

Tuesday Apr 21 — 84.7M Tokens

Hard Interference demo day. Category link wiring, Mission Control Memory tab build, CSS styling passes, and auto light/dark mode implementation. Nearly matched Monday's volume with 309 sessions at 274K average.

Week-over-Week Comparison

MetricWeek 2 (Apr 14–20)Week 3 (Apr 20–26)Change
Total tokens378.3M449.3M+18.8%
Total sessions1,1462,288+99.7%
Cost£9.24£9.240%
Effective rate£0.025/M£0.017/M-32%
I/O ratio188:1117:1Shift

Note: Week 3 uses correct Mon–Sun boundaries (Apr 20–26). Previous weeks had offset boundaries, so exact comparisons are approximate.

Sessions nearly doubled. Tokens grew 19%. Cost didn't budge. The effective rate dropped 14% because the fixed £9.24 now covers 19% more tokens. More sessions doesn't mean more cost — it means the agent is doing more things, not bigger things. The I/O ratio shifted from 188:1 to 117:1 — more interactive work (terminal sessions, shorter tasks) alongside the usual deep-context operations.

The Stack

ComponentCostType
GLM-5.1 (cloud)£4.62/wkOAuth subscription
GPT-5.3 Codex (cloud)£4.62/wkOAuth subscription
Qwen 3.5 9B (local)£0Local Ollama
Gemma 4 31B (cloud)£0Free tier
MiniMax M2.7 (cloud)£0Free tier
Total£9.24/wk£480/year

No API keys. No per-token billing. No surprise invoices.

The Bottom Line

Week 3: 449M tokens. 2,288 sessions. £9.24.

Sessions doubled. Tokens grew 19%. Rate dropped 14%. The subscription advantage compounds — every additional token makes the flat rate more absurd compared to per-token pricing.

603x cheaper than Opus. That's not a discount. That's a fundamentally different model of computing.


Found this useful? 👉 Follow @Raf_VRS for more transparent AI insights that put you in control of your hardware. 👉 Support the work: ko-fi.com/rafvrs

#VRSComputing #ModelBenchmarking #TokenUsage #AIAgents #CostTransparency