Get the latest tech news

Context caching guide


The Gemini API context caching feature is designed to reduce the cost of requests that contain repeat content with high input token counts. When to use context caching Context caching is particularly well suited to scenarios where a substantial initial context is referenced repeatedly by shorter requests.

Send feedback Stay organized with collections Save and categorize content based on your preferences. The Gemini API context caching feature is designed to reduce the cost of requests that contain repeat content with high input token counts. Storage duration: The amount of time cached tokens are stored, billed hourly.

Get the Android app

Or read this on Hacker News

Read more on:

Photo of Google Gemini

Google Gemini

Photo of Context Caching

Context Caching

Related news:

News photo

Google Gemini can power a virtual AI teammate with its own Workspace account

News photo

Google Gemini: Everything you need to know about the new generative AI platform

News photo

Google Gemini for Android might grab a neat trick from the web version soon