Keep AI spend flat while token usage grows

Coinbase cut its AI bill nearly in half while token usage kept growing. The lever that did the most work was routing: send each request to the cheapest model that can handle it, cache-aware, instead of running the whole agent loop on one frontier model.

Tejas Bhakta
Tejas Bhakta
June 26, 20263 min read
Keep AI spend flat while token usage grows