# New NVIDIA Nemotron 3 Super Delivers 5x Higher Throughput for Agentic AI

_Friday, June 26, 2026 at 6:39 PM EDT · AI · Latest · Tier 2 — Notable_

![New NVIDIA Nemotron 3 Super Delivers 5x Higher Throughput for Agentic AI — Primary](https://blogs.nvidia.com/wp-content/uploads/2026/03/nemotron-3-super-1920x1080-1.jpg)

NVIDIA launched Nemotron 3 Super, a 120 billion parameter open model with 12 billion active parameters. The model is designed to run complex agentic AI systems at scale and combines advanced reasoning capabilities to efficiently complete tasks with high accuracy for autonomous agents.

NVIDIA said the hybrid mixture of experts architecture delivers up to 5x higher throughput and up to 2x higher accuracy than the previous Nemotron Super model. Mamba layers provide 4x higher memory and compute efficiency. Multi token prediction allows the model to predict multiple future words simultaneously for 3x faster inference.

The model has a 1 million token context window that enables agents to retain full workflow state in memory. This helps prevent goal drift during long tasks. On the NVIDIA Blackwell platform the model runs in NVFP4 precision and achieves up to 4x faster inference than FP8 on NVIDIA Hopper with no loss in accuracy.

NVIDIA is releasing the model with open weights under a permissive license. The company is publishing training datasets totaling over 10 trillion tokens along with 15 reinforcement learning environments and evaluation recipes. Developers can access Nemotron 3 Super at build.nvidia.com, Perplexity, OpenRouter and Hugging Face.

## Sources

- [NVIDIA Blog](https://blogs.nvidia.com/blog/nemotron-3-super-agentic-ai/)

---
Canonical: https://techandbusiness.org/newswire/WMYow9Ig064KslncDO0c9Q
Retrieved: 2026-06-27T03:03:19.225Z
Publisher: Tech & Business (techandbusiness.org)
