Back to Models
N

NVIDIA

CosmosNVIDIANemotronRiva
Try in Chat
Models
34
Max Context
1M
Free Endpoints
34
Downloadable
22

All Models (34)

N

Nemotron 3 Ultra 550B

Nemotron

Free

Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more.

1M
context
Free
in/M
Free
out/M
N

Nemotron 3 Super 120B

Nemotron

Free

Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more.

1M
context
Free
in/M
Free
out/M
N

Nemotron 3 Nano 30B

Nemotron

Free

Open, efficient MoE model with 1M context, excelling in coding, reasoning, instruction following, tool calling, and more.

1M
context
Free
in/M
Free
out/M
N

Nemotron 3 Nano Omni 30B

Nemotron

Free

Nemotron 3 Nano Omni is an omni-modal reasoning model that understands images, video, speech, text.

131K
context
Free
in/M
Free
out/M
N

Nemotron 3 Content Safety

Nemotron

Free

Multilingual, multimodal model for detecting unsafe and toxic content.

8K
context
Free
in/M
Free
out/M
N

Nemotron 3.5 Content Safety

Nemotron

Free

Multilingual, multimodal model for detecting unsafe and toxic content.

8K
context
Free
in/M
Free
out/M
N

Nemotron Content Safety Reasoning 4B

Nemotron

Free

A context-aware safety model that applies reasoning to enforce domain-specific policies.

8K
context
Free
in/M
Free
out/M
N

Llama 3.1 Nemotron Safety Guard 8B v3

Nemotron

Free

Leading multilingual content safety model for enhancing the safety and moderation capabilities of LLMs.

8K
context
Free
in/M
Free
out/M
N

Nemotron VoiceChat

Nemotron

Free

Nemotron 3 Voicechat — real-time voice conversation AI.

8K
context
Free
in/M
Free
out/M
N

Nemotron Mini 4B

Nemotron

Free

Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling.

8K
context
Free
in/M
Free
out/M
N

Nemotron Nano 9B v2

Nemotron

Free

High-efficiency LLM with hybrid Transformer-Mamba design, excelling in reasoning and agentic tasks.

131K
context
Free
in/M
Free
out/M
N

Nemotron Nano 12B v2 VL

Nemotron

Free

Nemotron Nano 12B v2 VL enables multi-image and video understanding, along with visual Q&A and summarization capabilities.

131K
context
Free
in/M
Free
out/M
N

Llama 3.3 Nemotron Super 49B v1

Nemotron

Free

High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.

131K
context
Free
in/M
Free
out/M
N

Llama 3.3 Nemotron Super 49B v1.5

Nemotron

Free

High efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.

131K
context
Free
in/M
Free
out/M
N

Llama 3.1 Nemotron Nano 8B v1

Nemotron

Free

Leading reasoning and agentic AI accuracy model for PC and edge.

131K
context
Free
in/M
Free
out/M
N

Llama 3.1 Nemotron Nano VL 8B v1

Nemotron

Free

Multi-modal vision-language model that understands text/img and creates informative responses.

131K
context
Free
in/M
Free
out/M
N

Cosmos 3 Nano

Cosmos

Free

Generates physics-aware videos from text prompts or an image prompt for physical AI development.

8K
context
Free
in/M
Free
out/M
N

Cosmos 3 Nano Reasoner

Cosmos

Free

Vision language model that excels in understanding the physical world using structured reasoning on videos or images.

131K
context
Free
in/M
Free
out/M
N

Cosmos Transfer 2.5 2B

Cosmos

Free

Generates physics-aware video world states for physical AI development using text prompts and multiple spatial control inputs derived from real-world data or simulation.

8K
context
Free
in/M
Free
out/M
N

Cosmos Transfer 1 7B

Cosmos

Free

Generates physics-aware video world states for physical AI development using text prompts and multiple spatial control inputs derived from real-world data or simulation.

8K
context
Free
in/M
Free
out/M
N

Synthetic Video Detector

NVIDIA

Free

NVIDIA Synthetic Video Detector is an AI-powered micro-service for detecting AI-generated (synthetic) videos.

8K
context
Free
in/M
Free
out/M
N

Active Speaker Detection

NVIDIA

Free

Detect and track speaker identities across video frames.

8K
context
Free
in/M
Free
out/M
N

Ising Calibration 1 35B

NVIDIA

Free

Open VLM for quantum computer calibration chart understanding across a range of qubit modalities.

8K
context
Free
in/M
Free
out/M
N

GLiNER PII

NVIDIA

Free

GLiNER PII detects Personally Identifiable Information in text.

8K
context
Free
in/M
Free
out/M
N

StreamPETR

NVIDIA

Free

StreamPETR offers efficient 3D object detection for autonomous driving by propagating sparse object queries temporally.

8K
context
Free
in/M
Free
out/M
N

SparseDrive

NVIDIA

Free

End-to-end autonomous driving stack integrating perception, prediction, and planning with sparse scene representations for efficiency and safety.

8K
context
Free
in/M
Free
out/M
N

BEVFormer

NVIDIA

Free

Advanced transformer for multi-frame bird's-eye-view 3D perception in autonomous driving.

8K
context
Free
in/M
Free
out/M
N

Riva Translate 4B

Riva

Free

Translation model in 12 languages with few-shots example prompts capability.

8K
context
Free
in/M
Free
out/M
N

Magpie TTS Zero-shot

NVIDIA

Free

Expressive and engaging text-to-speech, generated from a short audio sample.

8K
context
Free
in/M
Free
out/M
N

Background Noise Removal

NVIDIA

Free

Removes unwanted noises from audio improving speech intelligibility.

8K
context
Free
in/M
Free
out/M
N

Studio Voice

NVIDIA

Free

Enhance input speech recorded with low-quality microphones in noisy or reverberant environments, producing studio-quality speech.

8K
context
Free
in/M
Free
out/M
N

NV-Embed v1

NVIDIA

Free

Generates high-quality numerical embeddings from text inputs.

8K
context
Free
in/M
Free
out/M
N

NV-EmbedCode 7B

NVIDIA

Free

The NV-EmbedCode model is a 7B Mistral-based embedding model optimized for code retrieval, supporting text, code, and hybrid queries.

8K
context
Free
in/M
Free
out/M
N

Rerank QA Mistral 4B

NVIDIA

Free

GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.

8K
context
Free
in/M
Free
out/M