1 / 3
Swipe to compare

MiMo-V2.5 is a native omnimodal model by Xiaomi. It delivers Pro-level agentic performance while surpassing MiMo-V2-Omni in multimodal perception across image and video understanding tasks. Its 1M context window supports complete documents, extended conversations, and complex task contexts in a single pass, making it ideal for integration with agent frameworks where strong reasoning, rich perception, and cost efficiency all matter.

Author
XiaomiXiaomi
Release Date
2026-04-22
Knowledge Cutoff
License
Proprietary
I/O Format
Context Length
1.0M / 131K
API I/O (1M)
$0.4 / $2
How to Use
Output Speed
60 tok/s
Arena Overall
1424
Intelligence Index
53.8
Coding Index
45.5
Math Index
LiveBench
ForecastBench
GPQA Diamond
86.6%
HLE
33.8%
MMLU-Pro
AIME 2025
MATH-500
LB Reasoning
LB Math
LB Data Analysis
LiveCodeBench
LB Coding
LB Agentic
TAU2
94.2%
TerminalBench
43.2%
SciCode
50.2%
IFBench
79.9%
AA-LCR
0.7
Hallucination (HHEM)
Factual Consistency (HHEM)
LB Language
LB Instruction Following