NVIDIA

Nemotron 3 Super

Name: NVIDIA Nemotron 3 Super
Author: NVIDIA

Try It Compare

2026-03-11

Try It Compare

Nemotron 3 Super is NVIDIA's open hybrid Mamba-Transformer MoE model with 120 billion total parameters, activating just 12 billion for maximum compute efficiency. Its hybrid architecture integrates Mamba layers for sequence efficiency with Transformer layers for precision reasoning, delivering over 5× throughput compared to its predecessor. With a native 1M-token context window and NVFP4 precision optimized for Blackwell GPUs, it scores 85.6% on PinchBench — the best among open models — making it well-suited for complex multi-agent applications, software development, and agentic reasoning.

Reasoning|Open Model

Knowledge Cutoff

2026-02-01

The date this AI finished learning. It may not know about things that happened after this date.

Input → Output Format

The types of content this AI can receive, and what it can produce in return.

Context Memory

262KIN1MOUT

The maximum amount of text the AI can read and process in a single request. A larger number means it can handle longer documents or conversations.

Cost/1M Words

$0.09IN$0.45OUT

The cost of using this AI directly in your own application. Shown in USD per 1 million units of text (tokens).

Calculate Cost

Source:Official Docs

AI Performance Evaluation

Arena Overall Score

1361

±7

As of 2026-05-01

Overall Rank

No.151

7,409 Votes

Arena by Ability

Hard Prompts

1380±9No.149

Expert Knowledge

1398±24No.127

Instruction Following

1347±13No.154

Conversation Memory

1349±17No.156

Creative

1302±18No.182

Coding

1409±14No.149

Math

1379±25No.137

Arena by Occupation

Creative Writing

1324±15No.168

Social Sciences

1366±17No.163

Media

1317±17No.160

Business

1350±16No.164

Healthcare

1350±26No.175

Legal

1368±26No.158

Software

1404±11No.146

Mathematics

1398±27No.116

Source:Arena Intelligence

Overall

AA Intelligence Index

36%↓3%

LiveBench

32%↓29%

Reasoning & Math

GPQA Diamond

80%↓2%

HLE

19%↑2%

LB Reasoning

34%↓35%

LB Math

36%↓38%

LB Data

21%↓32%

Coding

AA Coding Index

31%↓5%

LB Coding

54%↓19%

LB Agentic

23%↓22%

TAU2

68%↓13%

TerminalBench

29%↓5%

SciCode

36%↓6%

Language & Instructions

IFBench

72%↑8%

AA-LCR

60%↓2%

LB Language

30%↓42%

LB IF

28%↓23%

Output Speed

Standard Mode

80tok/s↑2

First Output 1.88s

Reasoning Mode

189tok/s↑102

First Output 11.59s

Source:Artificial Analysis LiveBench OpenRouter

NVIDIA