Nemotron Nano 9B V2
Released by NVIDIA in 2025, Nemotron Nano 9B V2 is an chat model. NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and...
by NVIDIA · 131K token context window
Best for
- low-latency chat and routing
- request routing and triage
- text classification
Ways to use Nemotron Nano 9B V2 in osFoundry
Connect with your own key (BYOK)
Open the key dialog and paste your NVIDIA API key. osFoundry discovers Nemotron Nano 9B V2 automatically — assign it to a Maestro role (router, direct, orchestrator, or fallback) in the Pipeline tab and it is live in every chat. Your key, your provider account — no token markup.
Use it in a Room App
Room Apps declare AI features in their manifest, then call them with invokeAI:
import { invokeAI } from '@osfoundry/app-sdk'
// 'summarize' is an AI feature declared in your app manifest.
const result = await invokeAI('summarize', userText)
Call it from your own apps
Once a model is wired into your workspace you can host it as an API and reach it from your own services, scripts, or CI — outside osFoundry.
Run Nemotron Nano 9B V2 yourself
Nemotron Nano 9B V2 is also available as open weights — self-host it for full data control and no per-token cost. See that page for GPU requirements and a cost comparison against API pricing.
Nemotron Nano 9B V2 vs similar models
| Model | Org | Params | Context | Input price | Self-host |
|---|
| Nemotron Nano 9B V2 | NVIDIA | — | 131K | $ 0.040 /1M | API only |
| Kimi K2 0905 | MoonshotAI | — | 262K | $ 0.600 /1M | API only |
| Qwen Plus 0728 | Qwen | — | 1000K | $ 0.260 /1M | API only |
| Grok Code Fast 1 | xAI | — | 256K | $ 0.200 /1M | API only |
Licence
Hosted — usage subject to provider terms — Hosted-only model — usage governed by the provider's API terms. Bring your own provider key.
No weights distributed; usage subject to provider terms.
Frequently asked about Nemotron Nano 9B V2
How much does Nemotron Nano 9B V2 cost?
Nemotron Nano 9B V2 is metered at $ 0.040 /1M for input, and $ 0.160 /1M for output. Bring your own NVIDIA API key — osFoundry passes through provider pricing without markup.
Can I use Nemotron Nano 9B V2 commercially?
Commercial use is allowed with conditions. Hosted-only model — usage governed by the provider's API terms. Bring your own provider key. No weights distributed; usage subject to provider terms.
What is the context window of Nemotron Nano 9B V2?
Nemotron Nano 9B V2 supports a 131K token context window.
Can I run Nemotron Nano 9B V2 locally?
No — Nemotron Nano 9B V2 is hosted only and accessed via the NVIDIA API. An open-weights equivalent is available to self-host — see the cross-link above.
What is Nemotron Nano 9B V2 best at?
Nemotron Nano 9B V2 is well-suited to low-latency chat and routing, request routing and triage, text classification.
How do I use Nemotron Nano 9B V2 in osFoundry?
Paste your NVIDIA API key in the key dialog (or deploy the open weights for self-hostable models), assign Nemotron Nano 9B V2 to a Maestro role in the Pipeline tab, then use it in chat, Room Apps via invokeAI, or your own apps.
Published by NVIDIA on September 5, 2025. Source: https://openrouter.ai/nvidia/nemotron-nano-9b-v2