Anthropic prompt caching lets you cache up to 80k token prefixes at 50% discount after a 5-minute
Tag: API optimization
Gemini 2.5 Flash review
Gemini 2.5 Flash is Google's workhorse model that trades reasoning depth for 3x faster outputs and dramatically lower costs. With thinking budget...