Initial model card — Nemotron family
Browse files
README.md
ADDED
|
@@ -0,0 +1,54 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: other
|
| 3 |
+
license_name: nvidia-open-model-license
|
| 4 |
+
tags:
|
| 5 |
+
- ava
|
| 6 |
+
- voiceos
|
| 7 |
+
- nemotron
|
| 8 |
+
- nvidia
|
| 9 |
+
- on-device
|
| 10 |
+
- mixture-of-experts
|
| 11 |
+
base_model:
|
| 12 |
+
- nvidia/NVIDIA-Nemotron-3-Nano-4B
|
| 13 |
+
- nvidia/NVIDIA-Nemotron-Nano-9B-v2
|
| 14 |
+
- nvidia/Nemotron-3-Nano-30B-A3B
|
| 15 |
+
- nvidia/NVIDIA-Nemotron-3-Super-120B-A12B
|
| 16 |
+
- nvidia/Nemotron-Cascade-2-30B-A3B
|
| 17 |
+
- nvidia/Llama-3.1-Nemotron-Nano-VL-8B-V1
|
| 18 |
+
---
|
| 19 |
+
|
| 20 |
+
# AVA Nautilus Models (Nemotron)
|
| 21 |
+
|
| 22 |
+
On-device LLM models for VoiceOS, based on NVIDIA Nemotron.
|
| 23 |
+
|
| 24 |
+
## Models
|
| 25 |
+
|
| 26 |
+
| AVA ID | Source | Active Params | Format | Target |
|
| 27 |
+
|--------|--------|--------------|--------|--------|
|
| 28 |
+
| AVA-NAUTILUS-4B | Nemotron-3-Nano-4B | 4B | safetensors + GGUF | Phone |
|
| 29 |
+
| AVA-NAUTILUS-9B | Nemotron-Nano-9B-v2 | 9B | safetensors | Desktop / tablet |
|
| 30 |
+
| AVA-NAUTILUS-30B-A3B | Nemotron-3-Nano-30B | 3B active (MoE) | safetensors + GGUF | Phone (flagship) |
|
| 31 |
+
| AVA-NAUTILUS-120B-A12B | Nemotron-3-Super-120B | 12B active (MoE) | safetensors | Desktop |
|
| 32 |
+
| AVA-NAUTILUS-CASCADE | Nemotron-Cascade-2-30B-A3B | 3B active | safetensors + GGUF | Phone (reasoning) |
|
| 33 |
+
| AVA-NAUTILUS-VL-8B | Nemotron-Nano-VL-8B | 8B | safetensors | Vision tasks |
|
| 34 |
+
|
| 35 |
+
## Directory Structure
|
| 36 |
+
|
| 37 |
+
```
|
| 38 |
+
raw/ # Base model weights (safetensors)
|
| 39 |
+
nemotron-3-nano-4b/
|
| 40 |
+
nemotron-nano-9b-v2/
|
| 41 |
+
nemotron-3-nano-30b-a3b/
|
| 42 |
+
nemotron-3-super-120b-a12b/
|
| 43 |
+
nemotron-cascade-2-30b-a3b/
|
| 44 |
+
nemotron-nano-vl-8b/
|
| 45 |
+
gguf/ # Quantized for on-device inference
|
| 46 |
+
nemotron-3-nano-4b-Q4_K_M.gguf
|
| 47 |
+
nemotron-3-nano-30b-A3B-Q4_K_M.gguf
|
| 48 |
+
nemotron-cascade-2-30b-a3b-Q4_K_M.gguf
|
| 49 |
+
production/ # AON-encrypted (deployed to devices)
|
| 50 |
+
```
|
| 51 |
+
|
| 52 |
+
## License
|
| 53 |
+
|
| 54 |
+
Raw models: NVIDIA Open Model License. AON-encrypted production variants: Proprietary (Intelligent Devices LLC).
|