Back to Modelsby NVIDIA
N
Nemotron Mini 4B
NemotronOptimized SLM for on-device inference and fine-tuned for roleplay, RAG and function calling.
Free EndpointAug 26, 2024open license2M API calls / 30d
Specifications
Context Length
8K
8,192 tokens
Input Price
Free
per 1M tokens
Output Price
Free
per 1M tokens
Modalities
1
text→text
Capabilities
text generationchatfunction callingroleplay
Tags
chatroleplayragsmall-model
API Usage
1Router is fully OpenAI-compatible. Just set the base URL and use this model ID:
nvidia/nemotron-mini-4b-instructcURL
curl https://api.1router.com/v1/chat/completions \
-H "Authorization: Bearer $ROUTER_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "nvidia/nemotron-mini-4b-instruct",
"messages": [{"role": "user", "content": "Hello!"}]
}'