ava-nautilus / README.md

IDEAHQ

Initial model card — Nemotron family

bbf3d9b verified 6 days ago

preview code

raw

history blame contribute delete

1.78 kB

metadata

license: other
license_name: nvidia-open-model-license
tags:
  - ava
  - voiceos
  - nemotron
  - nvidia
  - on-device
  - mixture-of-experts
base_model:
  - nvidia/NVIDIA-Nemotron-3-Nano-4B
  - nvidia/NVIDIA-Nemotron-Nano-9B-v2
  - nvidia/Nemotron-3-Nano-30B-A3B
  - nvidia/NVIDIA-Nemotron-3-Super-120B-A12B
  - nvidia/Nemotron-Cascade-2-30B-A3B
  - nvidia/Llama-3.1-Nemotron-Nano-VL-8B-V1

AVA Nautilus Models (Nemotron)

On-device LLM models for VoiceOS, based on NVIDIA Nemotron.

Models

AVA ID	Source	Active Params	Format	Target
AVA-NAUTILUS-4B	Nemotron-3-Nano-4B	4B	safetensors + GGUF	Phone
AVA-NAUTILUS-9B	Nemotron-Nano-9B-v2	9B	safetensors	Desktop / tablet
AVA-NAUTILUS-30B-A3B	Nemotron-3-Nano-30B	3B active (MoE)	safetensors + GGUF	Phone (flagship)
AVA-NAUTILUS-120B-A12B	Nemotron-3-Super-120B	12B active (MoE)	safetensors	Desktop
AVA-NAUTILUS-CASCADE	Nemotron-Cascade-2-30B-A3B	3B active	safetensors + GGUF	Phone (reasoning)
AVA-NAUTILUS-VL-8B	Nemotron-Nano-VL-8B	8B	safetensors	Vision tasks

Directory Structure

raw/                           # Base model weights (safetensors)
  nemotron-3-nano-4b/
  nemotron-nano-9b-v2/
  nemotron-3-nano-30b-a3b/
  nemotron-3-super-120b-a12b/
  nemotron-cascade-2-30b-a3b/
  nemotron-nano-vl-8b/
gguf/                          # Quantized for on-device inference
  nemotron-3-nano-4b-Q4_K_M.gguf
  nemotron-3-nano-30b-A3B-Q4_K_M.gguf
  nemotron-cascade-2-30b-a3b-Q4_K_M.gguf
production/                    # AON-encrypted (deployed to devices)

License

Raw models: NVIDIA Open Model License. AON-encrypted production variants: Proprietary (Intelligent Devices LLC).