Gemini 3.1 Flash Lite is Google's high-efficiency model optimized for cost-sensitive, high-volume use cases. It outperforms Gemini 2.5 Flash Lite on overall quality and approaches Gemini 2.5 Flash performance across key capabilities including audio input, RAG snippet ranking, translation, data extraction, and code completion. It supports full thinking levels (minimal/low/medium/high) for fine-grained cost-performance trade-offs, and is priced at half the cost of Gemini 3 Flash.
API|VisionReasoningWeb SearchFile|Proprietary Model
Knowledge Cutoff
2025-01-31
Input → Output Format
Context Memory
1.0MIN66KOUT
AI Performance Evaluation
Arena Overall Score
1439
±5As of 2026-05-01
Overall Rank
No.53
22,387 Votes
Arena by Ability
Hard Prompts
1448±6No.65
Expert Knowledge
1448±14No.67
Instruction Following
1412±8No.78
Conversation Memory
1447±9No.53
Creative
1420±11No.46
Coding
1461±8No.86
Math
1437±15No.49
Arena by Occupation
Creative Writing
1426±9No.46
Social Sciences
1461±10No.46
Media
1413±10No.49
Business
1433±9No.60
Healthcare
1465±15No.55
Legal
1444±14No.63
Software
1460±7No.68
Mathematics
1432±17No.69
Source:Arena Intelligence
Overall
AA Intelligence Index
34%↓6%
LiveBench
62%↑1%
Reasoning & Math
GPQA Diamond
82%↑0%
HLE
16%↓1%
LB Reasoning
60%↓9%
LB Math
74%↓1%
LB Data
55%↑2%
Coding
AA Coding Index
30%↓6%
LB Coding
69%↓4%
LB Agentic
33%↓12%
TAU2
31%↓49%
TerminalBench
24%↓10%
SciCode
42%↑0%
Language & Instructions
IFBench
77%↑14%
AA-LCR
65%↑3%
Hallucination (HHEM)
8.2%↓2%
Factual (HHEM)
92%↑2%
LB Language
73%↑1%
LB IF
69%↑18%
Output Speed
Standard Mode
315tok/s↑238
First Output 4.91s
