Back to Models
N

Nemotron Mini 4B

Nemotron
by NVIDIA

Optimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling.

Free EndpointAug 26, 2024open license2M API calls / 30d

Specifications

Context Length
8K
8,192 tokens
Input Price
Free
per 1M tokens
Output Price
Free
per 1M tokens
Modalities
1
text→text

Capabilities

text generationchatfunction callingroleplay

Tags

chatroleplayragsmall-model

API Usage

1Router is fully OpenAI-compatible. Just set the base URL and use this model ID:

nvidia/nemotron-mini-4b-instruct
cURL
curl https://api.1router.com/v1/chat/completions \
  -H "Authorization: Bearer $ROUTER_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "nvidia/nemotron-mini-4b-instruct",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'