Get the latest tech news

AWS now allows prompt caching with 90% cost reduction


AWS added Intelligent Prompt Routing and Prompt Caching to Bedrock in hopes of getting model usage prices down.

To answer customer demand, AWS announced two new capabilities on Bedrock to cut the cost of running AI models and applications, that are already available on competitor platforms. Simple yes-or-no questions like “Do you have a reservation?” are managed by a smaller model, but more complicated ones like “What vegan options are available?” would be routed to a bigger one. Luma CEO and co-founder Amit Jain told VentureBeat that AWS is the first cloud provider partner of the company to host its models.

Get the Android app

Or read this on Venture Beat

Read more on:

Photo of AWS

AWS

Photo of % cost reduction

% cost reduction

Related news:

News photo

AWS pledges $100M in cloud credits to help education organizations build learning tools

News photo

AWS brings prompt routing and caching to its Bedrock LLM service

News photo

AWS brings third-party apps to its SageMaker AI platform