Cost per model
Understanding Model Costs
Different AI models have varying pricing structures based on their capabilities and computational requirements. Here's a detailed breakdown of the costs for each available model. You'll see that the cost for doing any kind of chat is very, very, low. It is quite difficult to spend more than a few cents per chat, even with the most capable models.
What is Blended Cost?
Blended cost is a simplified metric that represents the average cost per 1 million tokens for a model, taking into account typical usage patterns where conversations include both input (prompt) and output (completion) tokens. This gives you a single number to compare models at a glance, making it easier to choose the most cost-effective option for your needs.
The blended cost is calculated based on realistic conversation patterns where output tokens are typically 3x input tokens, meaning for every word you send to the AI you get three words back on average. This metric helps you understand the practical cost of using each model in real-world scenarios.
Real cost examples
A single, short question for Perplexity Reasoning:
- 363 total tokens
- $0.0005 per message
- $0.0068 total cost
A longer chat with Claude 4.5 Sonnet to learn about SEO best practices:
- 3,020 total tokens
- $0.05 total cost
Complete Model Pricing Table
The table below shows all available models with their detailed pricing information. The Blended Cost column provides a quick comparison metric for typical usage patterns.
Model | Input (per 1M tokens) | Output (per 1M tokens) | Blended Cost | Additional Fees | Description |
---|---|---|---|---|---|
Gemini 2.5 Flash | $0.10 | $0.40 | $0.18 | - | Fast and efficient with vision, reasoning, and tool support |
GPT-5 Mini | $0.25 | $2.00 | $0.69 | - | Small and powerful OpenAI model |
Qwen 3 Thinking | $0.65 | $3.00 | $1.24 | - | Chinese reasoning model |
GPT-5 | $1.25 | $10.00 | $3.44 | - | The new flagship from OpenAI |
Gemini 2.5 Pro | $1.25 | $10.00 | $3.44 | - | The latest and greatest from Google with full tool support |
Perplexity Reasoning | $2.00 | $8.00 | $3.50 | $5.00 per 1K requests | Industry leading search |
GPT-4o | $2.50 | $10.00 | $4.38 | - | The one you know and love |
Claude 4.5 Sonnet | $3.00 | $15.00 | $6.00 | - | Often smarter than GPT-4o (Our default) |
Grok 4 | $3.00 | $15.00 | $6.00 | - | The new hyped model |
Deep Research (pplx) | $3.00 | $15.00 | $6.00 | $5.00 per 1K requests | Will do deep research on the web |
Claude Opus 4.1 | $15.00 | $75.00 | $30.00 | - | Anthropic's best model |
Understanding the Table
- Input/Output costs are per 1 million tokens
- Blended Cost represents typical usage patterns combining input and output
- Additional Fees apply to some models (like Perplexity) on top of token costs
- Models are sorted by blended cost from most affordable to most expensive
Cost Optimization Tips
- Choose the Right Model: For help selecting the most cost-effective model for your needs, check our model selection guide.
- Optimize Prompt Length: Since prompt tokens are charged, keep your inputs concise while maintaining clarity.
- Keep conversations short: This is the most important hack. To create the sense of conversation that ChatGPT pioneered, with every message you sent, the ENTIRE conversation is sent to the model. This means chat cost increases exponentially with the length of the conversation.
Related Resources
GPT-5 Pro Guide - When to Use OpenAI's Premium Reasoning Model
Complete guide to GPT-5 Pro, when it's worth the premium pricing, and how to maximize its advanced reasoning capabilities
Gemini vs Claude vs GPT (2025): Cost, Quality, and Best Use Cases
Expert comparison of Google Gemini 2.5, Anthropic Claude 4, and OpenAI GPT models. Includes real blended costs, strengths, and practical recommendations for 2025.
GPT-5 Mini Guide - Efficient Reasoning for Everyday Tasks
Complete guide to GPT-5 Mini, OpenAI's efficient reasoning model that balances cost and capability
Cost Optimization with New Models: Maximizing Value Across the AI Lineup
Smart strategies for minimizing AI costs while maximizing output quality. Learn which models to use when, how to combine them efficiently, and real cost-saving techniques.