Cutting LLM costs 60%: caching, routing, and smaller models that still work | TechTrio Blog