Google Gemini API Pricing 2026: Plans & Hidden Costs

LLM API Providers · Last verified May 6, 2026 · Medium confidence · 1 source · Claim this profile

Price range $0–$18 /per million tokens · free + paid

Plans 4 incl. free tier

Sources 1 verified data point

Confidence

65%

medium

Compare Google Gemini API to:

Groq Together AI Fireworks AI + Build a 3-way comparison

Price checkPer per million tokens

FreeFree Flash-Lite (Paid)Custom Flash (Paid)Custom

See all 4 plans

All Google Gemini API Plans & Pricing

Plan	Monthly	Annual	Best For
Free rate_limit: Rate-limited for prototyping	Free	Free	Prototyping and evaluation
Verified pricing · last checked May 2026 · 1 source Get this price at Google Gemini API →
What's included at Free Best for: Prototyping and evaluation Free API key via Google AI Studio Gemini 2.5 Flash-Lite: free input & output Gemini 3 Flash Preview: free input & output Gemini 3.1 Flash-Lite Preview: free input & output Rate-limited for prototyping Content used to improve Google products Limits rate_limitRate-limited for prototyping
Flash-Lite (Paid)	Contact Sales	Contact Sales	High-volume, cost-sensitive production workloads
Verified pricing · last checked May 2026 · 1 source Get this price at Google Gemini API →
What's included at Flash-Lite (Paid) Best for: High-volume, cost-sensitive production workloads Gemini 2.5 Flash-Lite: $0.10 input / $0.40 output per M tokens Gemini 3.1 Flash-Lite Preview: $0.25 input / $1.50 output per M tokens Most cost-efficient Gemini models Batch API: 50% cost reduction Great for high-volume, cost-sensitive workloads
Flash (Paid)	Contact Sales	Contact Sales	Production apps balancing cost and capability
Verified pricing · last checked May 2026 · 1 source Get this price at Google Gemini API →
What's included at Flash (Paid) Best for: Production apps balancing cost and capability Gemini 2.5 Flash: $0.30 input / $2.50 output per M tokens Gemini 3 Flash Preview: $0.50 input / $3.00 output per M tokens Balanced speed and capability Multimodal: text, image, video, audio Audio input: $1.00/M tokens
Pro (Paid)	Contact Sales	Contact Sales	Complex reasoning, long-context, and multimodal tasks
Verified pricing · last checked May 2026 · 1 source Get this price at Google Gemini API →
What's included at Pro (Paid) Best for: Complex reasoning, long-context, and multimodal tasks Gemini 2.5 Pro: $1.25 input (≤200K) / $10.00 output per M tokens $2.50 input / $15.00 output for prompts >200K tokens Gemini 3.1 Pro Preview: $2.00 input (≤200K) / $12.00 output per M tokens $4.00 input / $18.00 output for prompts >200K tokens Google Search grounding: $14/1,000 queries (5,000/mo free) Context caching available (up to 90% input cost reduction)

View all features by plan (compare side-by-side)

Free

Free API key via Google AI Studio
Gemini 2.5 Flash-Lite: free input & output
Gemini 3 Flash Preview: free input & output
Gemini 3.1 Flash-Lite Preview: free input & output
Rate-limited for prototyping
Content used to improve Google products

Flash-Lite (Paid)

Gemini 2.5 Flash-Lite: $0.10 input / $0.40 output per M tokens
Gemini 3.1 Flash-Lite Preview: $0.25 input / $1.50 output per M tokens
Most cost-efficient Gemini models
Batch API: 50% cost reduction
Great for high-volume, cost-sensitive workloads

Flash (Paid)

Gemini 2.5 Flash: $0.30 input / $2.50 output per M tokens
Gemini 3 Flash Preview: $0.50 input / $3.00 output per M tokens
Balanced speed and capability
Multimodal: text, image, video, audio
Audio input: $1.00/M tokens

Pro (Paid)

Gemini 2.5 Pro: $1.25 input (≤200K) / $10.00 output per M tokens
$2.50 input / $15.00 output for prompts >200K tokens
Gemini 3.1 Pro Preview: $2.00 input (≤200K) / $12.00 output per M tokens
$4.00 input / $18.00 output for prompts >200K tokens
Google Search grounding: $14/1,000 queries (5,000/mo free)
Context caching available (up to 90% input cost reduction)

Try Google Gemini API Free

Compare Google Gemini API with alternativesAdjust seats, lock a tier, add up to 2 more products side-by-side. Shareable URL.

Quick Answer

Last verified: May 6, 2026

Medium confidence

Google Gemini API costs Free to $18 per per million tokens as of July 2026, with 4 plans available including a free tier. Plan: Free (free). Enterprise pricing is available on request. Pricing depends on your chosen tier, contract length, and negotiated discounts.

Use the interactive pricing calculator to estimate your exact cost based on team size and requirements.

Free tier: Yes

Google Gemini API offers 4 pricing tiers: Free, Flash-Lite (Paid), Flash (Paid), Pro (Paid). The Flash-Lite (Paid) plan is high-volume, cost-sensitive production workloads.

Compared to other llm api providers software, Google Gemini API is positioned at the budget-friendly price point.

4 documented hidden costs beyond list price

How much does Google Gemini API cost?

Google Gemini API offers 4 pricing plans, starting with a free tier and scaling to custom enterprise pricing. Plans include Free (free), Flash-Lite (Paid) (custom pricing), Flash (Paid) (custom pricing), Pro (Paid) (custom pricing).

Google Gemini API Pricing Overview

Google Gemini API has 4 pricing plans, including a free tier. Paid plans range from $0 to $18/per million tokens. The Free plan is free and is best for prototyping and evaluation. The Flash-Lite (Paid) plan requires contacting sales for a custom quote and is designed for high-volume, cost-sensitive production workloads. The Flash (Paid) plan requires contacting sales for a custom quote and is designed for production apps balancing cost and capability. The Pro (Paid) plan requires contacting sales for a custom quote and is designed for complex reasoning, long-context, and multimodal tasks.

There are at least 4 documented hidden costs beyond Google Gemini API's list price, including implementation, training, and add-on fees.

This pricing was last verified in May 6, 2026 from 1 independent source.

Try Google Gemini API Free

Google Gemini API pricing starts at $0 on the Free tier, which provides rate-limited access to Gemini models via AI Studio for prototyping. For production workloads, the Flash-Lite (Paid), Flash (Paid), and Pro (Paid) tiers are all billed on a per-token usage basis with no monthly subscription or minimum commitment. According to Artificial Analysis data from April 2026, the provider median across 51 tracked models sits at $0.56 per 1M input tokens and $2.20 per 1M output tokens, with model-level rates ranging from $0 (open Gemma models) up to $10.00 per 1M output tokens for Gemini 2.5 Pro.

How Google Gemini API Pricing Compares

Compare Google Gemini API pricing against top alternatives in LLM API Providers.

Groq $0-$3.0/per million tokens Compare → Together AI $0.03-$9.95/per million tokens / hour Compare → Fireworks AI $0-$9/per million tokens / hour Compare →

Usage-Based Rates

Per-unit pricing for Google Gemini API API usage.

Flash-Lite (Paid)

Model	Unit	Rate
Gemini 2.5 Flash-Lite	1M input tokens	$0.100
Gemini 2.5 Flash-Lite	1M output tokens	$0.400
Gemini 3.1 Flash-Lite Preview	1M input tokens	$0.250
Gemini 3.1 Flash-Lite Preview	1M output tokens	$1.50
Gemini 3.1 Flash-Lite Preview	1M cached input tokens	$0.025

Same rate regardless of context length
Audio input at $0.50/M tokens for 3.1 Flash-Lite
Context caching storage: $1.00/M tokens per hour

Flash (Paid)

Model	Unit	Rate
Gemini 2.5 Flash	1M input tokens	$0.300
Gemini 2.5 Flash	1M output tokens	$2.50
Gemini 2.5 Flash (thinking)	1M output tokens	$3.50
Gemini 3 Flash Preview	1M input tokens	$0.500
Gemini 3 Flash Preview	1M output tokens	$3.00
Gemini 3 Flash Preview	1M cached input tokens	$0.050

Thinking/reasoning output billed at higher rate for 2.5 Flash
3 Flash Preview output price includes thinking tokens
Audio input at $1.00/M tokens
Context caching storage: $1.00/M tokens per hour

Pro (Paid)

Model	Unit	Rate
Gemini 2.5 Pro (≤200K ctx)	1M input tokens	$1.25
Gemini 2.5 Pro (>200K ctx)	1M input tokens	$2.50
Gemini 2.5 Pro	1M output tokens	$10.00
Gemini 2.5 Pro (thinking)	1M output tokens	$15.00
Gemini 3.1 Pro Preview (≤200K ctx)	1M input tokens	$2.00
Gemini 3.1 Pro Preview (>200K ctx)	1M input tokens	$4.00
Gemini 3.1 Pro Preview (≤200K ctx)	1M output tokens	$12.00
Gemini 3.1 Pro Preview (>200K ctx)	1M output tokens	$18.00
Gemini 3.1 Pro Preview (≤200K ctx)	1M cached input tokens	$0.200
Gemini 3.1 Pro Preview (>200K ctx)	1M cached input tokens	$0.400

Input price doubles above 200K context window for both models
2.5 Pro has separate thinking output rate; 3.1 Pro output includes thinking
Context caching storage: $4.50/M tokens per hour for 3.1 Pro

Compare Google Gemini API vs Alternatives

Before committing to Google Gemini API, compare pricing with these 3 alternatives in the same category.

VSGroq

Free

Prototyping and evaluation

Full comparison

VSTogether AI

From $0.03/per million tokens / hour

Variable-volume API usage

Full comparison

VSFireworks AI

Free

Variable-volume API usage

Full comparison

All Google Gemini API alternatives & migration guides

What Companies Actually Pay for Google Gemini API

Median per-1M-token pricing across 24 models

Input $0.400/1M

Output $2.50/1M

Flagship models in this provider's catalog

Model	Input /1M	Output /1M	Blended /1M
google/gemini-3.1-pro-preview	$2.00	$12.00	—
google/gemini-3-pro-image	$2.00	$12.00	—
google/gemini-3.5-flash	$1.50	$9.00	—
google/gemini-2.5-pro	$1.25	$10.00	—
google/gemini-3.1-flash-image	$0.500	$3.00	—

Review scores

Trustpilot 1.2out of 5 (15)

Third-party review aggregates, as of Jul 2026

Top pricing complaints

Frequent factual errors and inconsistent outputs requiring manual verificationOpaque content policy violations with no explanation or recourseSlow performance and reliability issues with video and generative featuresMandatory platform migrations causing lost work and blocked applications

Source: OpenRouter API — medians aggregated from 24 models routed. Reflects router-surface pricing (may include modest markup vs direct provider rates).

Google Gemini API Year 1 Total Cost by Company Size

Real deployment costs including licenses, implementation, training, and admin — not just the sticker price.

Individual Developer (Free Tier) $0 Year 1 total

Solo developer prototyping or building a low-traffic application using the Gemini API Free tier via AI Studio, staying within free-tier rate limits.

Production App — Flash Model ~$16.50/month ($0.30 × 10M input + $2.50 × 5M output, per OpenRouter median rates as of July 2026) Year 1 total

$0.30 × 10M input

$2.50 × 5M output, per OpenRouter median rates as of July 2026

Total ~$16.50/month ($0.30 × 10M input + $2.50 × 5M output, per OpenRouter median rates as of July 2026)

A production application processing approximately 10 million input tokens and 5 million output tokens per month using the Flash (Paid) tier via OpenRouter at median provider rates.

Production App — Pro Model ~$62.50/month ($1.25 × 10M input + $10.00 × 5M output) Year 1 total

$1.25 × 10M input

$10.00 × 5M output

Total ~$62.50/month ($1.25 × 10M input + $10.00 × 5M output)

A production application processing approximately 10 million input tokens and 5 million output tokens per month using the Pro (Paid) tier at current OpenRouter rates for Gemini 2.5 Pro.

CURRENT TIER DATA

How Google Gemini API Pricing Compares

Software	Starting Price	Top Price
Google Gemini API	Free	$18/per million tokens
Amazon Bedrock	$0.07/per million tokens	$75/per million tokens
Anyscale	$0.15/per million tokens	$5/per million tokens
Baidu ERNIE API	$0.1/per million tokens	$10/per million tokens
Cerebras Inference API	$0.1/per million tokens	$6/per million tokens
Claude API	$0.03/per million tokens	$75/per million tokens

Detailed pricing comparisons:

Browse all LLM API Providers pricing →

4 Google Gemini API Hidden Costs Beyond the List Price

Beyond the listed price, Google Gemini API has at least 4 documented hidden costs that can significantly increase total cost of ownership.

Watch for 4 hidden costs

Content Policy Restrictions 5-15% of license costs
medium 3 sources

Trustpilot "FLOW is, quite possibly, the worst AI video creator I have ever encountered. I created images with Gemini (same company) and when I loaded them into FLOW to create video, it changed the subject to something random OR I got a violate policies message."
Trustpilot "Their Tyrannically BS restriction really makes me wanna go Super-Saiyan RN"
Trustpilot "very strict censorship with google, my videos are dark like my music so most times it just comes up saying i am breaking their policies etc...or google gets the VIDEO text wrong 95% of time!!"
Output Reliability and Accuracy Costs 10-25% of license costs
high 3 sources

Trustpilot "Its so full of errors and even lies. Has cost me alot to follow its advice. Should be taken of marked until its works and dont ruin projects/economy."
Trustpilot "never get a sraight answer, If the very same question is asked again you get a different answer."
Trustpilot "You must check the google answer for correctness yourself and critically assess whether or not it could be correct."
Platform Migration Costs 5-20% of license costs
high 2 sources

Trustpilot "I spent months creating an app in Firebase Studio and when I did the mandatory migration to AI Studio... IT IS BLOCKED. Google IS USELESS."
Trustpilot "So time-consuming with zero results, it's full of bugs, crashes way too often, you can spend days making an app and there is no way you're ever getting that app working outside of googles UI environment."
Free Tier Data Usage Policy 5-10% of license costs

Tip

Ask your Google Gemini API sales rep about these costs upfront. Getting them in writing before signing can save you from surprise charges later.

Full hidden costs breakdown →

Intelligence sourced from 1 independent sources

Trustpilot Consumer reviews

Key claims include inline source attribution. Data verified against multiple independent sources. 13 source citations total.

Google Gemini API Contract Terms

Google Gemini API contracts do not auto-renew. Changes require advance notice. These terms are sourced from verified buyer experiences.

Contract Terms

Auto-Renewal No

Mid-Term Downgrade Not allowed

Payment Terms Pay-as-you-go per token consumed; no upfront commitment required for standard API access

How to Negotiate Google Gemini API Pricing

Google Gemini API contracts are negotiable. These 4 tactics are sourced from real buyer experiences and procurement specialists.

Negotiation Playbook 4 tactics

Start on Free Tier high success

The Gemini API Free tier requires no credit card and provides access to all major models via AI Studio. Use this to prototype and measure actual token consumption before committing to any paid volume, giving you precise data to negotiate committed-use discounts.

CURRENT TIER DATA

Model Tier Selection high success

Flash-Lite (via OpenRouter at ~$0.25/1M input) costs approximately 8x less than Pro models (~$2.00/1M input). Benchmark your specific workload across model tiers before defaulting to Pro — many tasks perform acceptably on Flash or Flash-Lite at a fraction of the cost.

OpenRouter pricing data

Negotiate Google Cloud Committed Use medium success

For high-volume production workloads, negotiate Committed Use Discounts (CUDs) through Google Cloud's enterprise sales team. CUDs typically require 1-year or 3-year commitments but can yield significant per-token savings over pay-as-you-go rates.

General Google Cloud pricing doctrine

Leverage Competing Provider Pricing medium success

OpenRouter routes Gemini models at provider-median blended rates of ~$0.85/1M tokens. Use competing provider pricing (OpenRouter, Vertex AI pricing) as leverage in any enterprise negotiation with Google to justify custom rate discussions.

OpenRouter pricing data

Full negotiation guide →

Google Gemini API Pricing FAQ

01 How much does the Google Gemini API cost?

Gemini API pricing varies by model. The cheapest option is Gemini 2.5 Flash-Lite at $0.10 per million input tokens and $0.40 per million output tokens. Gemini 2.5 Pro costs $1.25/$10.00 per million tokens (≤200K context). A free tier is available with up to 1,500 requests/day on Flash models via Google AI Studio.

02 Is the Gemini API free?

Yes, Google offers a free tier for the Gemini API through Google AI Studio. The free tier provides access to Flash models with up to 1,500 requests/day and free input/output tokens. Pro models also have a free tier but are rate-limited. For production use, you pay per token on the paid tier with no monthly minimum.

03 Gemini API vs OpenAI API: which is cheaper?

Gemini is generally cheaper than OpenAI for comparable models. Gemini 2.5 Flash at $0.30/$2.50 per million tokens is significantly cheaper than GPT-4o. Gemini 2.5 Pro at $1.25/$10.00 per million tokens undercuts GPT-4o pricing. For budget workloads, Gemini Flash-Lite at $0.10/$0.40 per million tokens has no OpenAI equivalent at that price.

04 What is context caching in the Gemini API?

Context caching lets you cache repeated prompt content (like system instructions or documents) and reuse it across multiple requests. Cached tokens are billed at roughly 90% discount compared to fresh input tokens. This is highly cost-effective for applications that repeatedly process the same large documents or instructions.

05 What is the Batch API discount on Gemini?

The Gemini API Batch API offers a 50% cost reduction on token pricing for asynchronous workloads. Batch requests are processed within 24 hours. This is ideal for offline data processing, bulk classification, or any task that doesn't require real-time responses.

06 Does the Google Gemini API have a free tier?

Yes. The Free tier provides access to Gemini models via AI Studio at no cost, subject to rate limits on requests per minute and per day. It is designed for prototyping and low-volume experimentation, not production-scale workloads.

07 How is the Google Gemini API billed on paid plans?

The Flash-Lite (Paid), Flash (Paid), and Pro (Paid) tiers are all billed on a per-token usage basis with no monthly subscription fee. According to Artificial Analysis data as of April 2026, the provider median across 51 tracked models is $0.56 per 1M input tokens and $2.20 per 1M output tokens, with individual models ranging from near-free (Gemma open models at $0) to premium (Gemini 2.5 Pro at $1.25/$10.00 per 1M input/output tokens).

08 What is the difference between Flash-Lite, Flash, and Pro tiers?

Flash-Lite (Paid) targets the lowest-cost, highest-throughput use cases. Flash (Paid) balances speed and capability for most production workloads. Pro (Paid) is the highest-capability tier suited for complex reasoning tasks. All three are strictly usage-based — there is no monthly minimum or subscription commitment.

09 Can I use Google Gemini API models for free indefinitely?

Yes, through the Free tier. Google provides free access to Gemini models via AI Studio with rate limits, and several Gemma open-weight models are available at $0 per token even on paid infrastructure, according to Artificial Analysis data (April 2026).

10 What models are available through the Gemini API?

The Gemini API offers multiple model tiers: Flash-Lite (the most cost-efficient), Flash (balanced performance and cost), and Pro (highest capability). As of July 2026, specific versions available via OpenRouter include Gemini 2.5 Flash Lite, Gemini 2.5 Flash, Gemini 2.5 Pro, Gemini 3.1 Flash Lite, Gemini 3.1 Flash, Gemini 3.1 Pro Preview, and Gemini 3.5 Flash, among others.

11 When should I upgrade from the free tier to a paid plan?

Upgrade to a paid tier when: (1) you consistently hit free-tier rate limits, (2) you are deploying to production and need higher throughput or SLA guarantees, or (3) you need access to the full Pro model lineup at full context lengths. The free tier is designed for prototyping, not production traffic.

12 Can my free-tier usage data be used to train Google models?

Yes — requests made under the free tier may be used by Google to improve its models. If your use case involves sensitive or proprietary data, you should use the paid API tier, which typically includes data-processing terms that prevent use of your data for model training.

Is this pricing incorrect? — we'll verify and update it.

1 AI answer about this product’s pricing was flagged for review against CostBench’s record. Claim this record free to see them

Try Google Gemini API Free