LongCat Flash Chat is a large-scale Mixture-of-Experts model from Meituan with 560 billion total parameters, dynamically activating 18.6B to 31.3B (averaging ~27B) based on contextual demands. Its shortcut-connected MoE design achieves over 100 tokens per second during inference while supporting a 128K-token context window. The model delivers highly competitive performance in reasoning, coding, and instruction following, with exceptional strengths in agentic tasks and complex multi-step tool-use interactions.
Open ModelMIT
Knowledge Cutoff
2025-03-31
Input → Output Format
Context Memory
131KIN131KOUT
AI Performance Evaluation
Arena Overall Score
1434
±6As of 2026-05-01
Overall Rank
No.59
12,004 Votes
Arena by Ability
Hard Prompts
1458±7No.56
Expert Knowledge
1465±18No.48
Instruction Following
1412±10No.77
Conversation Memory
1419±13No.85
Creative
1390±14No.81
Coding
1497±11No.42
Math
1433±20No.55
Arena by Occupation
Creative Writing
1391±12No.92
Social Sciences
1454±13No.56
Media
1399±13No.64
Business
1433±12No.59
Healthcare
1467±21No.49
Legal
1429±20No.78
Software
1488±9No.41
Mathematics
1447±22No.47
Source:Arena Intelligence
Overall
AA Intelligence Index
24%↓15%
Reasoning & Math
GPQA Diamond
64%↓19%
HLE
6.0%↓12%
Coding
AA Coding Index
17%↓20%
TAU2
80%↓1%
TerminalBench
11%↓23%
SciCode
28%↓13%
Language & Instructions
IFBench
43%↓20%
AA-LCR
26%↓36%
Output Speed
Standard Mode
144tok/s↑66
First Output 4.24s
Source:Artificial Analysis