IDEAHQ
/

ava-nautilus

mixture-of-experts

Model card Files Files and versions

ava-nautilus / README.md

IDEAHQ's picture

Initial model card — Nemotron family

bbf3d9b verified 7 days ago

|

history blame contribute delete

1.78 kB

	---
	license: other
	license_name: nvidia-open-model-license
	tags:
	- ava
	- voiceos
	- nemotron
	- nvidia
	- on-device
	- mixture-of-experts
	base_model:
	- nvidia/NVIDIA-Nemotron-3-Nano-4B
	- nvidia/NVIDIA-Nemotron-Nano-9B-v2
	- nvidia/Nemotron-3-Nano-30B-A3B
	- nvidia/NVIDIA-Nemotron-3-Super-120B-A12B
	- nvidia/Nemotron-Cascade-2-30B-A3B
	- nvidia/Llama-3.1-Nemotron-Nano-VL-8B-V1
	---

	# AVA Nautilus Models (Nemotron)

	On-device LLM models for VoiceOS, based on NVIDIA Nemotron.

	## Models

	\| AVA ID \| Source \| Active Params \| Format \| Target \|
	\|--------\|--------\|--------------\|--------\|--------\|
	\| AVA-NAUTILUS-4B \| Nemotron-3-Nano-4B \| 4B \| safetensors + GGUF \| Phone \|
	\| AVA-NAUTILUS-9B \| Nemotron-Nano-9B-v2 \| 9B \| safetensors \| Desktop / tablet \|
	\| AVA-NAUTILUS-30B-A3B \| Nemotron-3-Nano-30B \| 3B active (MoE) \| safetensors + GGUF \| Phone (flagship) \|
	\| AVA-NAUTILUS-120B-A12B \| Nemotron-3-Super-120B \| 12B active (MoE) \| safetensors \| Desktop \|
	\| AVA-NAUTILUS-CASCADE \| Nemotron-Cascade-2-30B-A3B \| 3B active \| safetensors + GGUF \| Phone (reasoning) \|
	\| AVA-NAUTILUS-VL-8B \| Nemotron-Nano-VL-8B \| 8B \| safetensors \| Vision tasks \|

	## Directory Structure

	```
	raw/ # Base model weights (safetensors)
	nemotron-3-nano-4b/
	nemotron-nano-9b-v2/
	nemotron-3-nano-30b-a3b/
	nemotron-3-super-120b-a12b/
	nemotron-cascade-2-30b-a3b/
	nemotron-nano-vl-8b/
	gguf/ # Quantized for on-device inference
	nemotron-3-nano-4b-Q4_K_M.gguf
	nemotron-3-nano-30b-A3B-Q4_K_M.gguf
	nemotron-cascade-2-30b-a3b-Q4_K_M.gguf
	production/ # AON-encrypted (deployed to devices)
	```

	## License

	Raw models: NVIDIA Open Model License. AON-encrypted production variants: Proprietary (Intelligent Devices LLC).