NVIDIA
All Models (34)
Nemotron 3 Ultra 550B
Nemotron
Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more.
Nemotron 3 Super 120B
Nemotron
Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more.
Nemotron 3 Nano 30B
Nemotron
Open, efficient MoE model with 1M context, excelling in coding, reasoning, instruction following, tool calling, and more.
Nemotron 3 Nano Omni 30B
Nemotron
Nemotron 3 Nano Omni is an omni-modal reasoning model that understands images, video, speech, text.
Nemotron 3 Content Safety
Nemotron
Multilingual, multimodal model for detecting unsafe and toxic content.
Nemotron 3.5 Content Safety
Nemotron
Multilingual, multimodal model for detecting unsafe and toxic content.
Nemotron Content Safety Reasoning 4B
Nemotron
A context-aware safety model that applies reasoning to enforce domain-specific policies.
Llama 3.1 Nemotron Safety Guard 8B v3
Nemotron
Leading multilingual content safety model for enhancing the safety and moderation capabilities of LLMs.
Nemotron VoiceChat
Nemotron
Nemotron 3 Voicechat — real-time voice conversation AI.
Nemotron Mini 4B
Nemotron
Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling.
Nemotron Nano 9B v2
Nemotron
High-efficiency LLM with hybrid Transformer-Mamba design, excelling in reasoning and agentic tasks.
Nemotron Nano 12B v2 VL
Nemotron
Nemotron Nano 12B v2 VL enables multi-image and video understanding, along with visual Q&A and summarization capabilities.
Llama 3.3 Nemotron Super 49B v1
Nemotron
High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.
Llama 3.3 Nemotron Super 49B v1.5
Nemotron
High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.
Llama 3.1 Nemotron Nano 8B v1
Nemotron
Leading reasoning and agentic AI accuracy model for PC and edge.
Llama 3.1 Nemotron Nano VL 8B v1
Nemotron
Multi-modal vision-language model that understands text/img and creates informative responses.
Cosmos 3 Nano
Cosmos
Generates physics-aware videos from text prompts or an image prompt for physical AI development.
Cosmos 3 Nano Reasoner
Cosmos
Vision language model that excels in understanding the physical world using structured reasoning on videos or images.
Cosmos Transfer 2.5 2B
Cosmos
Generates physics-aware video world states for physical AI development using text prompts and multiple spatial control inputs derived from real-world data or simulation.
Cosmos Transfer 1 7B
Cosmos
Generates physics-aware video world states for physical AI development using text prompts and multiple spatial control inputs derived from real-world data or simulation.
Synthetic Video Detector
NVIDIA
NVIDIA Synthetic Video Detector is an AI-powered micro-service for detecting AI-generated (synthetic) videos.
Active Speaker Detection
NVIDIA
Detect and track speaker identities across video frames.
Ising Calibration 1 35B
NVIDIA
Open VLM for quantum computer calibration chart understanding across a range of qubit modalities.
GLiNER PII
NVIDIA
GLiNER PII detects Personally Identifiable Information in text.
StreamPETR
NVIDIA
StreamPETR offers efficient 3D object detection for autonomous driving by propagating sparse object queries temporally.
SparseDrive
NVIDIA
End-to-end autonomous driving stack integrating perception, prediction, and planning with sparse scene representations for efficiency and safety.
BEVFormer
NVIDIA
Advanced transformer for multi-frame bird's-eye-view 3D perception in autonomous driving.
Riva Translate 4B
Riva
Translation model in 12 languages with few-shots example prompts capability.
Magpie TTS Zero-shot
NVIDIA
Expressive and engaging text-to-speech, generated from a short audio sample.
Background Noise Removal
NVIDIA
Removes unwanted noises from audio improving speech intelligibility.
Studio Voice
NVIDIA
Enhance input speech recorded with low-quality microphones in noisy or reverberant environments, producing studio-quality speech.
NV-Embed v1
NVIDIA
Generates high-quality numerical embeddings from text inputs.
NV-EmbedCode 7B
NVIDIA
The NV-EmbedCode model is a 7B Mistral-based embedding model optimized for code retrieval, supporting text, code, and hybrid queries.
Rerank QA Mistral 4B
NVIDIA
GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.