Nemotron Mini 4B

Nemotron

by NVIDIA

Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling.

Try in Chat

Free EndpointAug 26, 2024open license2M API calls / 30d

Specifications

Context Length

8,192 tokens

Input Price

Free

per 1M tokens

Output Price

Free

per 1M tokens

Modalities

text→text

Capabilities

text generationchatfunction callingroleplay

API Usage

1Router is fully OpenAI-compatible. Just set the base URL and use this model ID:

nvidia/nemotron-mini-4b-instruct

cURL

curl https://api.1router.com/v1/chat/completions \
  -H "Authorization: Bearer $ROUTER_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "nvidia/nemotron-mini-4b-instruct",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

Nemotron Mini 4B

Specifications

Capabilities

Tags

API Usage