Arcee AI

Trinity Large Thinking

Name: Arcee AI Trinity Large Thinking
Author: Arcee AI

Try It Compare

2026-04-01

Try It Compare

Trinity Large Thinking is an open-source reasoning model from Arcee AI, built on a 398B-parameter sparse Mixture-of-Experts architecture that activates approximately 13B parameters per token. Post-trained with extended chain-of-thought reasoning and agentic reinforcement learning, it achieves state-of-the-art results on agentic benchmarks including τ²-Bench (94.7%) and PinchBench (91.9%). Released under the Apache 2.0 license, it offers frontier-level tool use and multi-turn conversation capabilities that can be run fully locally or via hosted API.

Reasoning|Open ModelApache 2.0

Knowledge Cutoff

2024

The date this AI finished learning. It may not know about things that happened after this date.

Input → Output Format

The types of content this AI can receive, and what it can produce in return.

Context Memory

262KIN262KOUT

The maximum amount of text the AI can read and process in a single request. A larger number means it can handle longer documents or conversations.

Cost/1M Words

$0.22IN$0.85OUT

The cost of using this AI directly in your own application. Shown in USD per 1 million units of text (tokens).

Calculate Cost

AI Performance Evaluation

Arena Overall Score

1375

±6

As of 2026-04-07

Overall Rank

No.119

12,625 Votes

Arena by Ability

Hard Prompts

1400±7No.115

Expert Knowledge

1414±20No.92

Instruction Following

1372±10No.112

Conversation Memory

1372±13No.121

Creative

1357±14No.104

Coding

1443±11No.92

Math

1362±20No.136

Arena by Occupation

Creative Writing

1358±11No.115

Social Sciences

1402±14No.110

Media

1355±13No.100

Business

1385±13No.107

Healthcare

1416±21No.99

Legal

1401±21No.98

Software

1425±9No.104

Mathematics

1380±24No.120

Source:Arena Intelligence

Overall

AA Intelligence Index

32%↓7%

LiveBench

30%↓30%

Reasoning & Math

GPQA Diamond

75%↓7%

HLE

15%↓3%

LB Reasoning

21%↓48%

LB Math

45%↓29%

LB Data

40%↓13%

Coding

AA Coding Index

27%↓9%

LB Coding

66%↓7%

LB Agentic

3.3%↓42%

TAU2

90%↑10%

TerminalBench

23%↓11%

SciCode

36%↓6%

Language & Instructions

IFBench

56%↓7%

AA-LCR

33%↓29%

Hallucination (HHEM)

6.9%↓3%

Factual (HHEM)

93%↑3%

LB Language

42%↓30%

LB IF

12%↓39%

Output Speed

Standard Mode

118tok/s↑41

First Output 17.54s

Source:Artificial Analysis LiveBench Vectara HHEM

Arcee AI