1. Use case

Input tokensText you send to the AI
Output tokensResponse the AI generates
Reasoning tokens
AI's thinking process (some models only)
API callsTotal number of requests
Prompt cacheReuses 80% of repeated input to reduce cost
Show speed

Paste your prompt (optional)

2. Cost simulation

ModelTotal Min~Total Max
OpenAI text-embedding-3-small
$0.060~0.060
GPT OSS 120B
$1.30~2.17
Llama 4 Scout
$1.32~2.76
Nemotron 3 Nano 30B A3B
$1.47~2.43
DeepSeek V4 Flash
$2.27~3.61
Grok 4.1 Fast
$2.40~4.80
Llama 4 Maverick
$2.61~5.49
GPT-5 Nano
$2.79~4.71
Gemma 4 31B
$2.90~4.72
Dola Seed 2.0 mini
$2.94~4.86
Gemini 2.5 Flash Lite
$2.94~4.86
Nemotron 3 Super
$3.24~5.40
DeepSeek V3.2
$3.25~5.07
Longcat Flash Chat
$3.48~7.32
Grok 4.1 Fast (Reasoning)
$3.90~6.30
Mistral Small 4
$4.41~7.29
K-EXAONE
$5.88~9.72
Trinity Large Thinking
$6.27~10.35
DeepSeek V4 Pro
$7.05~11.22
MiniMax M2.5
$8.04~13.56
ERNIE 4.5 300B A47B
$8.10~13.38
MiniMax M2.7
$8.82~14.58
GPT-5.4 Nano
$8.85~14.85
Gemini 3.1 Flash Lite
$10.65~17.85
Qwen3.6 Flash
$10.65~17.85
Grok 4.20
$12.75~24.75
Qwen3.6 Plus
$13.84~23.20
Dola Seed 2.0 Lite
$13.95~23.55
GPT-5 Mini
$13.95~23.55
MiMo V2.5
$14.40~24.00
GLM-5
$14.47~23.69
Kimi K2.5
$14.52~24.12
Qwen3.5 397B A17B
$16.61~27.85
Gemini 2.5 Flash
$17.40~29.40
Nova 2 Lite
$17.40~29.40
Grok 4.20 (Reasoning)
$20.25~32.25
Grok 4.3
$20.25~32.25
Dola Seed 2.0 Pro
$21.30~35.70
Gemini 3 Flash
$21.30~35.70
MiMo V2 Pro
$22.80~37.20
MiMo V2.5 Pro
$22.80~37.20
Kimi K2.6
$25.35~42.15
GLM-5.1
$26.25~43.05
GLM 5V Turbo
$30.00~49.20
GPT-5.4 Mini
$31.95~53.55
GPT-4.1
$34.80~73.20
Claude Haiku 4.5
$36.00~60.00
Qwen3.6 Max
$44.30~74.26
Mistral Medium 3.5
$54.00~90.00
Gemini 2.5 Pro
$69.75~117.75
GPT-5
$69.75~117.75
Gemini 3.1 Pro
$85.20~142.80
GPT-5.4
$106.50~178.50
Claude Sonnet 4
$108.00~180.00
Claude Sonnet 4.5
$108.00~180.00
Claude Sonnet 4.6
$108.00~180.00
Claude Opus 4.5
$180.00~300.00
Claude Opus 4.6
$180.00~300.00
Claude Opus 4.7
$180.00~300.00
GPT-5.5
$213.00~357.00
Claude Opus 4
$540.00~900.00
Claude Opus 4.1
$540.00~900.00
GPT-5.4 Pro
$1278.00~2142.00
GPT-5.5 Pro
$1278.00~2142.00

4. Simulation summary

Cheapest model

OpenAI text-embedding-3-small

$0.060 /3,000calls

Best performance model

GPT-5.5

$213.00 /3,000calls

Calculation basis

Input tokens: 1,000

Output tokens: 1,200 ~ 2,800 (±40%)

Reasoning tokens: 1,000

Usage: 3,000 calls

Token presets are statistical averages for each scenario. Actual token counts vary depending on prompt content. Reasoning tokens only apply to models that support Extended Thinking.

Pricing last updated: 2026년 5월 7일