
Keep AI spend flat while token usage grows
Coinbase cut its AI bill nearly in half while token usage kept growing. The lever that did the most work was routing: send each request to the cheapest model that can handle it, cache-aware, instead of running the whole agent loop on one frontier model.






































