Google Tweaks Gemini Pricing To Cut AI Costs

This article first appeared on GuruFocus. Google (NASDAQ:GOOG) is changing how it prices Gemini, and the idea is pretty straightforward: give developers more control over how much they spend based on how fast they need things to run. Instead of a one-size-fits-all setup, Google is now offering different tiers like Standard, Flex, Priority, Batch and…


Google Tweaks Gemini Pricing To Cut AI Costs

This article first appeared on GuruFocus.

Google (NASDAQ:GOOG) is changing how it prices Gemini, and the idea is pretty straightforward: give developers more control over how much they spend based on how fast they need things to run.

Instead of a one-size-fits-all setup, Google is now offering different tiers like Standard, Flex, Priority, Batch and Caching. Each one is built for a different kind of workload. For example, if speed is not critical, the Flex tier lets you run tasks at about a 50% discount by using off-peak capacity, though results can take anywhere from 1 to 15 minutes. The Batch option is even slower, with jobs taking up to 24 hours, but it also comes with similar cost savings.

On the flip side, if something needs to happen instantly, like a chatbot or fraud detection system, the Priority tier is there, but it costs 75% to 100% more than standard. Then there is the Caching tier, which helps cut costs when you are working with repeated data or large files.

Source link