Google

Gemini 3.1 Pro

Name: Google Gemini 3.1 Pro
Author: Google

Compare

Model ID:gemini-3.1-pro-preview

2026-02-19

Compare

Gemini 3.1 Pro is Google's most advanced reasoning model, significantly outperforming Gemini 3 Pro across software engineering, agentic reliability, and token efficiency. It supports a 1M-token context window with multimodal inputs including text, images, video, audio, code, and PDFs, and introduces a new medium thinking level for better cost-speed-performance balance. The model excels at agentic coding, structured planning, financial modeling, spreadsheet automation, and high-context enterprise tasks requiring long-horizon stability and autonomous tool orchestration.

Google AI PlusGoogle AI ProGoogle AI UltraAPI|VisionReasoningWeb SearchFile|Proprietary Model

Knowledge Cutoff

2025-01-31

The date this AI finished learning. It may not know about things that happened after this date.

Input → Output Format

The types of content this AI can receive, and what it can produce in return.

Context Memory

1.0MIN66KOUT

The maximum amount of text the AI can read and process in a single request. A larger number means it can handle longer documents or conversations.

Cost/1M Words

$2IN$12OUT

The cost of using this AI directly in your own application. Shown in USD per 1 million units of text (tokens).

Calculate Cost

Source:Official Docs Google DeepMind MMLU-Pro Leaderboard OpenRouter

AI Performance Evaluation

Arena Overall Score

1493

±5

As of 2026-05-01

Overall Rank

No.4

28,096 Votes

Arena by Ability

Hard Prompts

1513±6No.5

Expert Knowledge

1519±13No.7

Instruction Following

1488±7No.5

Conversation Memory

1501±9No.5

Creative

1490±10🥉 No.3

Coding

1529±8No.6

Math

1507±14🥉 No.3

Arena by Occupation

Creative Writing

1487±8🥉 No.3

Social Sciences

1510±9No.4

Media

1478±9No.4

Business

1484±8No.9

Healthcare

1508±14No.10

Legal

1504±13No.5

Software

1518±7No.6

Mathematics

1497±16No.12

Source:Arena Intelligence

Overall

AA Intelligence Index

57%↑18%

LiveBench

81%↑20%

ForecastBench

60%↑1%

Reasoning & Math

GPQA Diamond

94%↑12%

HLE

45%↑27%

LB Reasoning

84%↑15%

LB Math

91%↑17%

LB Data

79%↑25%

Coding

AA Coding Index

56%↑19%

LB Coding

76%↑4%

LB Agentic

65%↑20%

TAU2

96%↑15%

TerminalBench

54%↑20%

SciCode

59%↑17%

Language & Instructions

IFBench

77%↑14%

AA-LCR

73%↑11%

Hallucination (HHEM)

10%↑0%

Factual (HHEM)

90%↑0%

LB Language

85%↑13%

LB IF

79%↑28%

Output Speed

Standard Mode

120tok/s↑43

First Output 24.16s

Source:Artificial Analysis LiveBench ForecastBench Vectara HHEM

Multilingual Capabilities

MGSM 🇰🇷

94%

MGSM 🇯🇵

94%

KMMLU 🇰🇷

82%

JMMLU 🇯🇵

82%

Google