Who developed Nemotron 3 Super?

It was developed by NVIDIA as a large language model.

How can I access Nemotron 3 Super?

Access is provided through NVIDIA's AI platforms and enterprise offerings.

Is pricing information available for Nemotron 3 Super?

Pricing details are not publicly listed and require direct inquiry with NVIDIA.

What tasks is Nemotron 3 Super best suited for?

It is designed for workloads that benefit from very long context such as document understanding and large-scale code analysis.

Nemotron 3 Super

Verified

NVIDIA's closed LLM for million-token text processing.

NVIDIALanguage ModelsClosed

Function callingJSON modeStructured outputsReasoning

Model page

Updated 2026-06-14

About Nemotron 3 Super

NVIDIA engineered Nemotron 3 Super as a proprietary LLM with an expansive one-million-token context limit. This design enables the model to ingest and reason over entire books, codebases, or multi-hour transcripts in a single pass while preserving factual consistency.

Because the weights remain closed, deployment occurs through NVIDIA-controlled channels that emphasize security and compliance. The text-only modality focuses computational resources on language understanding and generation without multimodal overhead.

Organizations typically apply the model to legal discovery, technical research synthesis, and enterprise knowledge retrieval. Its scale suits scenarios where retaining full context across hundreds of thousands of tokens improves answer accuracy and reduces fragmentation.

Capabilities

Long-context reasoning

Text generation

Instruction following

Code synthesis

Document summarization

Technical question answering

How Nemotron 3 Super compares

Nemotron 3 Super (striped bar) vs other language models on intelligence, speed and price.

Price

USD per 1M output tokens · Lower is better · Nemotron 3 Super ranks #25 of 72

$0.35

Phi 4 Mini Instruct

$0.40

GLM 4.7 Flash

$0.40

Llama 3.3 Nemotron Super 49B V1.5

$0.40

Hermes 4 70B

$0.40

Qwen3 30B A3B Thinking 2507

$0.41

DeepSeek V3.2 Exp

$0.45

Nemotron 3 Super

$0.50

Cydonia 24B V4.1

$0.50

Olmo 3 32B Think

$0.60

Solar Pro 3

$0.63

Ring-2.6-1T

$0.63

Ling-2.6-1T

$0.75

Mercury 2

Sources: Artificial Analysis (intelligence, speed) · OpenRouter (price).

Best for

Long-Form Document Analysis

Processes and reasons over entire books, legal contracts, or research papers in a single pass thanks to its 1M-token context window.

Enterprise Codebase Understanding

Navigates and explains large multi-file code repositories while retaining full project context for refactoring or security reviews.

Extended Multi-Turn Research

Maintains coherent dialogue across dozens of iterative queries when exploring complex technical or scientific topics.

Strengths & limitations

Strengths

+Handles up to 1M token contexts
+NVIDIA-optimized inference efficiency
+Strong performance on technical domains
+Suitable for enterprise-scale text tasks

Limitations

–Text-only modality
–No native multimodal support
–Large context increases compute cost

Pricing by provider

Live per-provider pricing & uptime, routed via OpenRouter. Prices are USD per 1M tokens.

Provider	Input /1M	Output /1M	Context	Uptime
DekaLLM(fp8)	$0.09	$0.45	262K	98.2%
DeepInfra(bf16)	$0.10	$0.50	262K	96.7%
DigitalOcean	$0.30	$0.65	1000K	—
Nebius(fp4)	$0.30	$0.90	262K	—

Cost calculator

Estimate what Nemotron 3 Super would cost for your usage.

Input tokens / requestOutput tokens / requestRequests / month

$0.00032

per request

$3.15

estimated / month

Based on Nemotron 3 Super's $0.09/1M input · $0.45/1M output. Estimate only — actual cost varies by provider and caching.

Quick start

OpenRouter's API is OpenAI-compatible — most SDKs work by just swapping the base URL. Only the model slug changes between models.

JavaScript · openai

import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://openrouter.ai/api/v1",
  apiKey: process.env.OPENROUTER_API_KEY,
});

const completion = await client.chat.completions.create({
  model: "nvidia/nemotron-3-super-120b-a12b",
  messages: [{ role: "user", content: "Hello!" }],
});

console.log(completion.choices[0].message.content);

Model slug: nvidia/nemotron-3-super-120b-a12b

Editor's verdict

Our take on Nemotron 3 Super

Nemotron 3 Super is NVIDIA's proprietary language models with a 1000K-token context window.

At $0.45 per 1M output tokens, it is very cost-efficient for its class, served by 4 providers.

It is available through NVIDIA's API and aggregators like OpenRouter.

Best suited to handles up to 1m token contexts and nvidia-optimized inference efficiency.

Did you find this helpful?

Frequently asked questions

The model supports a context window of 1,000,000 tokens.

User reviews

Real, verified reviews from the community shape this model's rating.

Loading reviews…

Other Nemotron models

Sibling versions in the Nemotron family from NVIDIA.

Nemotron 3 Ultra

NVIDIA · Language Models

Verified

NVIDIA's Nemotron 3 Ultra handles million-token text contexts with ease.

Closed1000K ctx$2.50/1M out

Nemotron 3 Nano 30B A3B

NVIDIA · Language Models

Verified

NVIDIA LLM built for long-context text understanding at scale.

Closed262K ctx$0.20/1M out

Promote Nemotron 3 Super

Add this badge to your website, or share the tool.

DFeatured on DhanasviNemotron 3 Super 2

Nemotron 3 Super

About Nemotron 3 Super

Capabilities