Gemini 3.1 Pro is Google's most advanced reasoning model, significantly outperforming Gemini 3 Pro across software engineering, agentic reliability, and token efficiency. It supports a 1M-token context window with multimodal inputs including text, images, video, audio, code, and PDFs, and introduces a new medium thinking level for better cost-speed-performance balance. The model excels at agentic coding, structured planning, financial modeling, spreadsheet automation, and high-context enterprise tasks requiring long-horizon stability and autonomous tool orchestration.
Google AI PlusGoogle AI ProGoogle AI UltraAPI|VisionReasoningWeb SearchFile|Proprietary Model
Knowledge Cutoff
2025-01-31
Input → Output Format
Context Memory
1.0MIN66KOUT
AI Performance Evaluation
Arena Overall Score
1493
±5As of 2026-05-01
Overall Rank
No.4
28,096 Votes
Arena by Ability
Hard Prompts
1513±6No.5
Expert Knowledge
1519±13No.7
Instruction Following
1488±7No.5
Conversation Memory
1501±9No.5
Creative
1490±10🥉 No.3
Coding
1529±8No.6
Math
1507±14🥉 No.3
Arena by Occupation
Creative Writing
1487±8🥉 No.3
Social Sciences
1510±9No.4
Media
1478±9No.4
Business
1484±8No.9
Healthcare
1508±14No.10
Legal
1504±13No.5
Software
1518±7No.6
Mathematics
1497±16No.12
Source:Arena Intelligence
Overall
AA Intelligence Index
57%↑18%
LiveBench
81%↑20%
ForecastBench
60%↑1%
Reasoning & Math
GPQA Diamond
94%↑12%
HLE
45%↑27%
LB Reasoning
84%↑15%
LB Math
91%↑17%
LB Data
79%↑25%
Coding
AA Coding Index
56%↑19%
LB Coding
76%↑4%
LB Agentic
65%↑20%
TAU2
96%↑15%
TerminalBench
54%↑20%
SciCode
59%↑17%
Language & Instructions
IFBench
77%↑14%
AA-LCR
73%↑11%
Hallucination (HHEM)
10%↑0%
Factual (HHEM)
90%↑0%
LB Language
85%↑13%
LB IF
79%↑28%
Output Speed
Standard Mode
120tok/s↑43
First Output 24.16s
Multilingual Capabilities
MGSM 🇰🇷
94%
MGSM 🇯🇵
94%
KMMLU 🇰🇷
82%
JMMLU 🇯🇵
82%
