IDEAHQ commited on
Commit
bbf3d9b
·
verified ·
1 Parent(s): 34f47f4

Initial model card — Nemotron family

Browse files
Files changed (1) hide show
  1. README.md +54 -0
README.md ADDED
@@ -0,0 +1,54 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: other
3
+ license_name: nvidia-open-model-license
4
+ tags:
5
+ - ava
6
+ - voiceos
7
+ - nemotron
8
+ - nvidia
9
+ - on-device
10
+ - mixture-of-experts
11
+ base_model:
12
+ - nvidia/NVIDIA-Nemotron-3-Nano-4B
13
+ - nvidia/NVIDIA-Nemotron-Nano-9B-v2
14
+ - nvidia/Nemotron-3-Nano-30B-A3B
15
+ - nvidia/NVIDIA-Nemotron-3-Super-120B-A12B
16
+ - nvidia/Nemotron-Cascade-2-30B-A3B
17
+ - nvidia/Llama-3.1-Nemotron-Nano-VL-8B-V1
18
+ ---
19
+
20
+ # AVA Nautilus Models (Nemotron)
21
+
22
+ On-device LLM models for VoiceOS, based on NVIDIA Nemotron.
23
+
24
+ ## Models
25
+
26
+ | AVA ID | Source | Active Params | Format | Target |
27
+ |--------|--------|--------------|--------|--------|
28
+ | AVA-NAUTILUS-4B | Nemotron-3-Nano-4B | 4B | safetensors + GGUF | Phone |
29
+ | AVA-NAUTILUS-9B | Nemotron-Nano-9B-v2 | 9B | safetensors | Desktop / tablet |
30
+ | AVA-NAUTILUS-30B-A3B | Nemotron-3-Nano-30B | 3B active (MoE) | safetensors + GGUF | Phone (flagship) |
31
+ | AVA-NAUTILUS-120B-A12B | Nemotron-3-Super-120B | 12B active (MoE) | safetensors | Desktop |
32
+ | AVA-NAUTILUS-CASCADE | Nemotron-Cascade-2-30B-A3B | 3B active | safetensors + GGUF | Phone (reasoning) |
33
+ | AVA-NAUTILUS-VL-8B | Nemotron-Nano-VL-8B | 8B | safetensors | Vision tasks |
34
+
35
+ ## Directory Structure
36
+
37
+ ```
38
+ raw/ # Base model weights (safetensors)
39
+ nemotron-3-nano-4b/
40
+ nemotron-nano-9b-v2/
41
+ nemotron-3-nano-30b-a3b/
42
+ nemotron-3-super-120b-a12b/
43
+ nemotron-cascade-2-30b-a3b/
44
+ nemotron-nano-vl-8b/
45
+ gguf/ # Quantized for on-device inference
46
+ nemotron-3-nano-4b-Q4_K_M.gguf
47
+ nemotron-3-nano-30b-A3B-Q4_K_M.gguf
48
+ nemotron-cascade-2-30b-a3b-Q4_K_M.gguf
49
+ production/ # AON-encrypted (deployed to devices)
50
+ ```
51
+
52
+ ## License
53
+
54
+ Raw models: NVIDIA Open Model License. AON-encrypted production variants: Proprietary (Intelligent Devices LLC).