Get the latest tech news
Context caching guide
The Gemini API context caching feature is designed to reduce the cost of requests that contain repeat content with high input token counts. When to use context caching Context caching is particularly well suited to scenarios where a substantial initial context is referenced repeatedly by shorter requests.
Send feedback Stay organized with collections Save and categorize content based on your preferences. The Gemini API context caching feature is designed to reduce the cost of requests that contain repeat content with high input token counts. Storage duration: The amount of time cached tokens are stored, billed hourly.
Or read this on Hacker News