Cost per model

Understanding Model Costs

Different AI models have varying pricing structures based on their capabilities and computational requirements. Here's a detailed breakdown of the costs for each available model. You'll see that the cost for doing any kind of chat is very, very, low. It is quite difficult to spend more than a few cents per chat, even with the most capable models.

What is Blended Cost?

Blended cost is a simplified metric that represents the average cost per 1 million tokens for a model, taking into account typical usage patterns where conversations include both input (prompt) and output (completion) tokens. This gives you a single number to compare models at a glance, making it easier to choose the most cost-effective option for your needs.

The blended cost is calculated based on realistic conversation patterns where output tokens are typically 3x input tokens, meaning for every word you send to the AI you get three words back on average. This metric helps you understand the practical cost of using each model in real-world scenarios.

Real cost examples

A single, short question for Perplexity Reasoning:

  • 363 total tokens
  • $0.0005 per message
  • $0.0068 total cost

A longer chat with Claude 4 Sonnet to learn about SEO best practices:

  • 3,020 total tokens
  • $0.05 total cost

Complete Model Pricing Table

The table below shows all available models with their detailed pricing information. The Blended Cost column provides a quick comparison metric for typical usage patterns.

ModelInput (per 1M tokens)Output (per 1M tokens)Blended CostAdditional FeesDescription
Gemini 2.5 Flash$0.10$0.40$0.18-Fast and efficient with vision, reasoning, and tool support
GPT-5 Mini$0.25$2.00$0.69-Small and powerful OpenAI model
Qwen 3 Thinking$0.65$3.00$1.24-Chinese reasoning model
GPT-5$1.25$10.00$3.44-The new flagship from OpenAI
Gemini 2.5 Pro$1.25$10.00$3.44-The latest and greatest from Google with full tool support
Perplexity Reasoning$2.00$8.00$3.50$5.00 per 1K requestsIndustry leading search
GPT-4o$2.50$10.00$4.38-The one you know and love
Claude 4 Sonnet$3.00$15.00$6.00-Often smarter than GPT-4o (Our default)
Grok 4$3.00$15.00$6.00-The new hyped model
Claude 3.5 Sonnet$3.00$15.00$6.00-Often smarter than GPT-4o
Deep Research (pplx)$3.00$15.00$6.00$5.00 per 1K requestsWill do deep research on the web
Claude Opus 4.1$15.00$75.00$30.00-Anthropic's best model

Understanding the Table

  • Input/Output costs are per 1 million tokens
  • Blended Cost represents typical usage patterns combining input and output
  • Additional Fees apply to some models (like Perplexity) on top of token costs
  • Models are sorted by blended cost from most affordable to most expensive

Cost Optimization Tips

  1. Choose the Right Model: For help selecting the most cost-effective model for your needs, check our model selection guide.
  2. Optimize Prompt Length: Since prompt tokens are charged, keep your inputs concise while maintaining clarity.
  3. Keep conversations short: This is the most important hack. To create the sense of conversation that ChatGPT pioneered, with every message you sent, the ENTIRE conversation is sent to the model. This means chat cost increases exponentially with the length of the conversation.

Copyright © 2025 magicdoor.ai