N

NVIDIA

CosmosNVIDIANemotronRiva

Models

34

Max Context

1M

Free Endpoints

34

Downloadable

22

All Models (34)

Nemotron 3 Ultra 550B

Nemotron

Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more.

Nemotron 3 Super 120B

Nemotron

Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more.

Nemotron 3 Nano 30B

Nemotron

Open, efficient MoE model with 1M context, excelling in coding, reasoning, instruction following, tool calling, and more.

Nemotron 3 Nano Omni 30B

Nemotron

Nemotron 3 Nano Omni is an omni-modal reasoning model that understands images, video, speech, text.

Nemotron 3 Content Safety

Nemotron

Multilingual, multimodal model for detecting unsafe and toxic content.

Nemotron 3.5 Content Safety

Nemotron

Multilingual, multimodal model for detecting unsafe and toxic content.

Nemotron Content Safety Reasoning 4B

Nemotron

A context-aware safety model that applies reasoning to enforce domain-specific policies.

Llama 3.1 Nemotron Safety Guard 8B v3

Nemotron

Leading multilingual content safety model for enhancing the safety and moderation capabilities of LLMs.

Nemotron VoiceChat

Nemotron

Nemotron 3 Voicechat — real-time voice conversation AI.

Nemotron Mini 4B

Nemotron

Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling.

Nemotron Nano 9B v2

Nemotron

High-efficiency LLM with hybrid Transformer-Mamba design, excelling in reasoning and agentic tasks.

Nemotron Nano 12B v2 VL

Nemotron

Nemotron Nano 12B v2 VL enables multi-image and video understanding, along with visual Q&A and summarization capabilities.

Llama 3.3 Nemotron Super 49B v1

Nemotron

High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.

Llama 3.3 Nemotron Super 49B v1.5

Nemotron

High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.

Llama 3.1 Nemotron Nano 8B v1

Nemotron

Leading reasoning and agentic AI accuracy model for PC and edge.

Llama 3.1 Nemotron Nano VL 8B v1

Nemotron

Multi-modal vision-language model that understands text/img and creates informative responses.

Cosmos 3 Nano

Cosmos

Generates physics-aware videos from text prompts or an image prompt for physical AI development.

Cosmos 3 Nano Reasoner

Cosmos

Vision language model that excels in understanding the physical world using structured reasoning on videos or images.

Cosmos Transfer 2.5 2B

Cosmos

Generates physics-aware video world states for physical AI development using text prompts and multiple spatial control inputs derived from real-world data or simulation.

Cosmos Transfer 1 7B

Cosmos

Generates physics-aware video world states for physical AI development using text prompts and multiple spatial control inputs derived from real-world data or simulation.

Synthetic Video Detector

NVIDIA

NVIDIA Synthetic Video Detector is an AI-powered micro-service for detecting AI-generated (synthetic) videos.

Active Speaker Detection

NVIDIA

Detect and track speaker identities across video frames.

Ising Calibration 1 35B

NVIDIA

Open VLM for quantum computer calibration chart understanding across a range of qubit modalities.

GLiNER PII

NVIDIA

GLiNER PII detects Personally Identifiable Information in text.

StreamPETR

NVIDIA

StreamPETR offers efficient 3D object detection for autonomous driving by propagating sparse object queries temporally.

SparseDrive

NVIDIA

End-to-end autonomous driving stack integrating perception, prediction, and planning with sparse scene representations for efficiency and safety.

BEVFormer

NVIDIA

Advanced transformer for multi-frame bird's-eye-view 3D perception in autonomous driving.

Riva Translate 4B

Riva

Translation model in 12 languages with few-shots example prompts capability.

Magpie TTS Zero-shot

NVIDIA

Expressive and engaging text-to-speech, generated from a short audio sample.

Background Noise Removal

NVIDIA

Removes unwanted noises from audio improving speech intelligibility.

Studio Voice

NVIDIA

Enhance input speech recorded with low-quality microphones in noisy or reverberant environments, producing studio-quality speech.

NV-Embed v1

NVIDIA

Generates high-quality numerical embeddings from text inputs.

NV-EmbedCode 7B

NVIDIA

The NV-EmbedCode model is a 7B Mistral-based embedding model optimized for code retrieval, supporting text, code, and hybrid queries.

Rerank QA Mistral 4B

NVIDIA

GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.