GPT-4o API Pricing 2026:
Cost Per Token, Calculator & Comparison

Complete breakdown of GPT-4o API costs, monthly estimates, and how it stacks up against Claude, Gemini, and other leading models.

GPT-4o API Pricing at a Glance

As of 2026, GPT-4o is priced at $2.50 per million input tokens and $10.00 per million output tokens on OpenAI's standard pay-as-you-go API.

Token TypePrice per 1M TokensPrice per 1K Tokens
Input (prompt)$2.50$0.0025
Cached Input$1.25$0.00125
Output (completion)$10.00$0.0100
Quick estimate: A typical chatbot exchange (500 input + 300 output tokens) costs approximately $0.00425 — meaning you can run ~235 conversations per dollar.
Heads up — GPT-4o is now a legacy model. OpenAI's current line is GPT-5.5 / GPT-5.4. GPT-4o is still callable at the same $2.50/$10 rate, but if you're choosing a model today, compare it against the current generation in the 2026 AI API pricing guide or the cost calculator.

GPT-4o vs. Other Top Models (2026)

How does GPT-4o compare to its closest competitors on price?

ModelProviderInput /1MOutput /1MContext
GPT-4o (legacy)OpenAI$2.50$10.00128K
GPT-5.4 (current)OpenAI$2.50$15.00400K
GPT-5.4 miniOpenAI$0.75$4.50400K
Claude Sonnet 4.6Anthropic$3.00$15.001M
Claude Haiku 4.5Anthropic$1.00$5.00200K
Gemini 3 FlashGoogle$0.50$3.001M
Gemini 2.5 ProGoogle$1.25$10.001M

Monthly Cost Estimates for GPT-4o

Real-world costs depend heavily on your input/output ratio and request volume. Here are estimates for common use cases:

Use CaseMonthly RequestsAvg Tokens/RequestEst. Monthly Cost
Small chatbot10,000500 in / 300 out~$42.50
Customer support50,000800 in / 400 out~$300
Document summarizer5,0002,000 in / 500 out~$75
Code assistant20,0001,000 in / 800 out~$210
High-volume API500,000300 in / 200 out~$1,375

When to Use GPT-4o vs. Cheaper Alternatives

GPT-4o sits in the mid-tier of 2026 pricing — not the cheapest, but not the most expensive. Here's when each tier makes sense:

Use GPT-4o when:

Consider GPT-4o mini instead when:

Consider Claude Sonnet or Gemini 2.5 Pro when:

How to Calculate Your GPT-4o Costs

The formula is straightforward:

Monthly cost = (Input tokens × $0.0000025) + (Output tokens × $0.00001)

Example: 10M input tokens + 3M output tokens = $25 + $30 = $55/month

For cached prompts (using OpenAI's prompt caching feature), input tokens are billed at 50% of the standard rate, dropping to $1.25/M — useful for applications that repeat the same system prompt or context across many requests.

GPT-4o Pricing FAQs

Does OpenAI charge for failed API calls?

No. You are only billed for tokens actually processed. If a request fails before processing, there is no charge. If it fails mid-response, you are charged for the tokens generated up to that point.

Are there volume discounts for GPT-4o?

OpenAI's standard API uses pay-as-you-go pricing with no automatic volume discounts. Enterprise customers can negotiate custom pricing. Batch API processing (asynchronous requests) is available at a 50% discount for non-time-sensitive workloads.

How does GPT-4o compare to GPT-4 Turbo?

GPT-4o replaced GPT-4 Turbo as OpenAI's flagship model. It is faster, cheaper, and matches or exceeds GPT-4 Turbo quality across most benchmarks, making GPT-4 Turbo essentially obsolete for new applications.

Calculate exactly what GPT-4o will cost for your specific usage — enter your token counts and get an instant estimate.

Open Free Cost Calculator →