--- license: other license_name: nvidia-open-model-license tags: - ava - voiceos - nemotron - nvidia - on-device - mixture-of-experts base_model: - nvidia/NVIDIA-Nemotron-3-Nano-4B - nvidia/NVIDIA-Nemotron-Nano-9B-v2 - nvidia/Nemotron-3-Nano-30B-A3B - nvidia/NVIDIA-Nemotron-3-Super-120B-A12B - nvidia/Nemotron-Cascade-2-30B-A3B - nvidia/Llama-3.1-Nemotron-Nano-VL-8B-V1 --- # AVA Nautilus Models (Nemotron) On-device LLM models for VoiceOS, based on NVIDIA Nemotron. ## Models | AVA ID | Source | Active Params | Format | Target | |--------|--------|--------------|--------|--------| | AVA-NAUTILUS-4B | Nemotron-3-Nano-4B | 4B | safetensors + GGUF | Phone | | AVA-NAUTILUS-9B | Nemotron-Nano-9B-v2 | 9B | safetensors | Desktop / tablet | | AVA-NAUTILUS-30B-A3B | Nemotron-3-Nano-30B | 3B active (MoE) | safetensors + GGUF | Phone (flagship) | | AVA-NAUTILUS-120B-A12B | Nemotron-3-Super-120B | 12B active (MoE) | safetensors | Desktop | | AVA-NAUTILUS-CASCADE | Nemotron-Cascade-2-30B-A3B | 3B active | safetensors + GGUF | Phone (reasoning) | | AVA-NAUTILUS-VL-8B | Nemotron-Nano-VL-8B | 8B | safetensors | Vision tasks | ## Directory Structure ``` raw/ # Base model weights (safetensors) nemotron-3-nano-4b/ nemotron-nano-9b-v2/ nemotron-3-nano-30b-a3b/ nemotron-3-super-120b-a12b/ nemotron-cascade-2-30b-a3b/ nemotron-nano-vl-8b/ gguf/ # Quantized for on-device inference nemotron-3-nano-4b-Q4_K_M.gguf nemotron-3-nano-30b-A3B-Q4_K_M.gguf nemotron-cascade-2-30b-a3b-Q4_K_M.gguf production/ # AON-encrypted (deployed to devices) ``` ## License Raw models: NVIDIA Open Model License. AON-encrypted production variants: Proprietary (Intelligent Devices LLC).