# NVIDIA AI-Q deep research agent reaches #1 on DeepResearch Bench I and II using Nemotron models

_Friday, June 26, 2026 at 8:04 PM EDT · AI · Latest · Tier 2 — Notable_

![NVIDIA AI-Q deep research agent reaches #1 on DeepResearch Bench I and II using Nemotron models — Primary](https://cdn-uploads.huggingface.co/production/uploads/6972e0ac2ef5ed3b506c731f/0Pj1joFkWDX6nrQrZ-Yt8.png)

NVIDIA AI-Q deep research agent achieved first place on DeepResearch Bench with a score of 55.95. It also led DeepResearch Bench II with a score of 54.50. Both benchmarks assess research agents on report quality, information recall, analysis and presentation.

The agent uses a multi-agent architecture with planner, researcher and orchestrator components. It runs on the NVIDIA NeMo Agent Toolkit and fine-tuned NVIDIA Nemotron 3 Super models. Specialist subagents handle evidence gathering, causal exploration, benchmarking, critique and trend scanning in parallel.

Training used about 67,000 trajectories drawn from open datasets such as OpenScholar, ResearchQA and Fathom-DeepResearch-SFT. A NVIDIA Nemotron-3-Super-120B-A12B model received supervised fine-tuning for one epoch across 16 by 8 NVIDIA H100 GPUs. The pipeline relies on Tavily for web search and Serper for academic papers to generate citation-backed reports.

An optional ensemble merges outputs from parallel agents. A report refiner step can draw on raw researcher briefs to improve the final document.

## Sources

- [NVIDIA](https://huggingface.co/blog/nvidia/how-nvidia-won-deepresearch-bench)

---
Canonical: https://techandbusiness.org/newswire/X0O85GNlLhBSz1ObTpxI1V
Retrieved: 2026-06-29T09:43:14.407Z
Publisher: Tech & Business (techandbusiness.org)
