You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
NVIDIA Nemotron™ is a family of open models with open weights, training data, and recipes, delivering leading efficiency and accuracy for building specialized AI agents. All models are released under permissive licenses and weights are available on Hugging Face.
Key properties across the family:
Open weights + open training data — weights, datasets, and recipes are all publicly available on Hugging Face
Agentic-first design — reasoning, tool-calling, multi-turn, RAG, and safety models designed to work together
Transparent training — technical reports and reproducibility scripts are released alongside every model
Deployable anywhere — via vLLM, SGLang, Ollama, llama.cpp, or as NVIDIA NIM™ microservices
"Open innovation is the foundation of AI progress." — Jensen Huang, NVIDIA CEO
📊 Model Generations at a Glance
Generation
Key Models
Architecture
Context
Released
Nemotron 3
Nano, Super, Ultra
Hybrid Mamba-Transformer MoE
1M tokens
Dec 2025
Nemotron Nano V2
9B-v2
Hybrid Transformer-Mamba
128K tokens
2025
Llama Nemotron
Nano 4B/8B, Super 49B, Ultra 253B
Dense Transformer (Llama-based)
128K tokens
2025
Nemotron 4
340B Base/Instruct/Reward
Dense Transformer
4K tokens
Jun 2024
Nemotron 3 (Enterprise)
8B Base/Chat/QA
Dense Transformer
4K tokens
Feb 2024
🤖 Models
Nemotron 3 (Gen 3 — Latest)
Announced December 15, 2025. Trained from scratch by NVIDIA with a hybrid Mamba-Transformer Mixture-of-Experts (MoE) architecture, 1M-token native context, and multi-environment reinforcement learning via NeMo Gym. Pre-training data cutoff: June 25, 2025. 10.6 trillion tokens total (3.5T synthetic).
Model
Params (Total / Active)
Precision
Description
Links
NVIDIA-Nemotron-3-Nano-30B-A3B-FP8
30B / 3.2B
FP8
Final quantized post-trained Nano — fastest inference, lowest cost
Hybrid Transformer-Mamba architecture for edge and single-GPU deployments. Features a configurable thinking budget — dial accuracy, throughput, and cost at inference time.
Model
Params
Description
Links
NVIDIA-Nemotron-Nano-9B-v2
9B
Up to 6× faster throughput vs leading 8B open models; up to 60% lower token generation with thinking budget control
Built on Meta Llama base models and post-trained with NVIDIA's alignment techniques (RPO, REINFORCE, multi-phase SFT + RL). All models support reasoning ON/OFF via system prompt. Recommended: temperature=0.6, top_p=0.95 for reasoning ON; greedy decoding for reasoning OFF.
Model
Base
Params
Context
Description
Links
Llama-3.1-Nemotron-Nano-4B-v1.1
Llama 3.1 Minitron 4B
4B
128K
Fits on a single RTX GPU; multi-language; SFT + RPO post-trained
Minitron models are obtained by pruning NVIDIA's larger Nemotron-4 models and distilling with knowledge distillation. Offer large model quality at SLM inference cost.
Released June 2024. Designed primarily for synthetic data generation (SDG). Trained on 9 trillion tokens. 98% of alignment data is synthetically generated.
Model
Description
Links
Nemotron-4-340B-Base
Base model; competitive with Llama-3 70B, Mixtral 8x22B on commonsense reasoning
Released February 2024. Optimized for building production-ready enterprise AI apps via NeMo Framework. Trained on 3.8 trillion tokens across 53 languages and 37 programming languages.
NVIDIA has released one of the largest open collections of synthetic data for agentic AI — over 10 trillion tokens spanning pre-training, post-training, personas, safety, RL, and RAG.
Dataset
Size
Description
Links
Nemotron-CC
6T+ tokens
Curated Common Crawl; 15 languages; deduplication + quality filtering pipeline
NVIDIA and HuggingFace have a close collaboration — all Nemotron model weights, datasets, and demos live on the HuggingFace Hub under the nvidia organization.
Autonomous web research, synthesis, and report generation
Llama Nemotron Super 49B v1.5
Code Generation & Review
Automated code completion, security review, and documentation
Nemotron 3 Nano / Super
Customer Support Automation
Multi-turn conversational agents with RAG over enterprise knowledge bases
Llama Nemotron Nano 8B
Document Intelligence
Structured extraction from PDFs, tables, and complex documents
Nemotron Nano VL 12B + Nemotron Parse
Medical Transcription
Clinical note generation from doctor-patient conversations
Parakeet ASR + Nemotron LLM
Legal Document Analysis
Contract review, clause extraction, and compliance checking
Nemotron Ultra 253B
Financial Report Generation
Automated synthesis of quarterly earnings and market analysis
Nemotron Super 49B
Software Testing
Automated test case generation, bug triage, and issue summarization
Nemotron 3 Nano
Data Synthesis & AI Training
Use Case
Description
Model Used
Synthetic Data Generation
Generate high-quality instruction data for fine-tuning downstream models
Nemotron-4-340B-Instruct
RLHF Pipeline
Human preference alignment at scale using synthetic comparisons
Nemotron-4-340B-Reward
Math Reasoning Data
Large-scale synthetic math problems and solutions
OpenMathInstruct-2 + Nemotron
Persona-based Data
Culturally-grounded synthetic user data for diverse training
Nemotron-Personas datasets
Edge & On-Device AI
Use Case
Description
Model Used
RTX PC Assistant
On-device personal assistant on NVIDIA RTX GPUs
Llama Nemotron Nano 4B/8B
Embedded Industrial AI
Real-time anomaly detection in manufacturing
Nemotron Nano 9B V2
Offline Voice Assistant
Privacy-preserving voice AI on local hardware
Parakeet + Nemotron Nano
🌍 Companies Using Nemotron Worldwide
NVIDIA Nemotron models are deployed across industries — from healthcare and finance to retail and manufacturing.
Technology & Cloud Providers
Company
Region
Use Case
Notes
Microsoft Azure
Global
Enterprise AI deployment on Azure
Azure AI Model Catalog includes Nemotron NIM
Oracle Cloud
Global
GPU Cloud + AI workloads
OCI Supercluster runs Nemotron training at scale
Dell Technologies
Global
On-premises enterprise AI
Dell AI Factory with NVIDIA NIM on PowerEdge servers
Lenovo
Global
Edge + hybrid AI
Lenovo AI solutions featuring Nemotron NIM
VMware / Broadcom
Global
Private cloud AI
VMware Private AI Foundation with NVIDIA
Enterprise Software
Company
Region
Use Case
Notes
SAP
Germany / Global
Business AI, ERP automation
NVIDIA AI integrated into SAP Business AI
ServiceNow
USA / Global
IT workflows, enterprise agents
Now Platform AI powered by Nemotron via NIM
Salesforce
USA / Global
CRM AI, Einstein AI features
NVIDIA AI partner ecosystem
Adobe
USA / Global
Creative AI, document intelligence
Firefly AI and document workflows
Siemens
Germany / Global
Industrial automation AI
Siemens Industrial Copilot on NVIDIA stack
Healthcare & Life Sciences
Company
Region
Use Case
Notes
Johnson & Johnson
USA
Clinical research automation
Drug discovery and clinical data analysis
Illumina
USA
Genomics AI
Genomic data analysis with NVIDIA BioNeMo
Mayo Clinic
USA
Medical AI
Clinical decision support with NVIDIA AI
Astrazeneca
UK
Drug discovery
NVIDIA Clara + Nemotron for protein analysis
PathAI
USA
Pathology AI
AI-powered pathology report generation
Finance & Insurance
Company
Region
Use Case
Notes
JPMorgan Chase
USA
Document AI, compliance
Financial document analysis and risk assessment
Deutsche Bank
Germany
Enterprise AI assistants
German-language financial AI using multilingual Nemotron
FinanceAI Partners
Global
Trading AI
Real-time market sentiment and report generation
Retail & E-Commerce
Company
Region
Use Case
Notes
Walmart
USA
Customer experience AI
Retail AI and inventory management
Accenture
Global
Enterprise AI solutions
Accenture AI Refinery with NVIDIA NIM
Startups & AI Companies
Company
Region
Description
Notes
Perplexity AI
USA
AI-powered search and research
Uses large Nemotron-family models via API
Cohere
Canada
Enterprise NLP platform
Partnered with NVIDIA for GPU + model distribution
Mistral AI
France
Open AI models
Collaboration on open model ecosystem with NVIDIA
DeepInfra
USA
Inference API
Hosts Nemotron Super 49B v1.5 for API access
OpenRouter
USA
Model routing API
Serves Nemotron Ultra 253B and Nano 9B V2 as free tier
together.ai
USA
AI cloud platform
Nemotron models via Together inference API
Replicate
USA
Model deployment platform
Community-maintained Nemotron model deployments
Anyscale
USA
Distributed AI serving
Nemotron models on Ray Serve
Government & Public Sector
Company / Agency
Region
Use Case
Notes
US Department of Defense
USA
Secure AI workloads
NVIDIA sovereign AI infrastructure
Saudi Aramco (KACST)
Saudi Arabia
Sovereign AI
Arabic-language Nemotron for national AI strategy
Tata Consultancy Services
India
Government AI modernization
AI services for Indian public sector using NVIDIA AI
Fujitsu
Japan
Government AI
Japanese-language Nemotron for public sector use
💡 Note: This list is compiled from publicly announced NVIDIA partnerships and integrations. If your company is using Nemotron and would like to be listed, please submit a PR.