MiMo-V2.5 is a native omnimodal model by Xiaomi. It delivers Pro-level agentic performance while surpassing MiMo-V2-Omni in multimodal perception across image and video understanding tasks. Its 1M context window supports complete documents, extended conversations, and complex task contexts in a single pass, making it ideal for integration with agent frameworks where strong reasoning, rich perception, and cost efficiency all matter.
Reasoning|Proprietary Model
Knowledge Cutoff
Unknown
Input → Output Format
Context Memory
1.0MIN131KOUT
AI Performance Evaluation
Arena Overall Score
1424
±8As of 2026-05-01
Overall Rank
No.70
5,131 Votes
Arena by Ability
Hard Prompts
1454±10No.61
Expert Knowledge
1459±25No.53
Instruction Following
1429±14No.51
Conversation Memory
1444±20No.56
Creative
1388±21No.85
Coding
1489±16No.51
Math
1402±29No.104
Arena by Occupation
Creative Writing
1409±17No.63
Social Sciences
1437±20No.78
Media
1404±20No.57
Business
1433±19No.61
Healthcare
1436±32No.89
Legal
1390±32No.131
Software
1472±13No.57
Mathematics
1433±31No.67
Source:Arena Intelligence
Overall
AA Intelligence Index
54%↑15%
Reasoning & Math
GPQA Diamond
87%↑4%
HLE
34%↑16%
Coding
AA Coding Index
46%↑9%
TAU2
94%↑14%
TerminalBench
43%↑9%
SciCode
50%↑8%
Language & Instructions
IFBench
80%↑17%
AA-LCR
73%↑11%
Output Speed
Standard Mode
60tok/s↓18
First Output 1.98s
Source:Artificial Analysis