Ex0bit/Qwen3-VLTO-32B-Instruct-NVFP4-256K at main

Qwen3-VLTO-32B-Instruct-NVFP4-256K

20.7 GB

Ctrl+K

Ctrl+K

1 contributor

History: 4 commits

Ex0bit's picture

Update README.md

a87dab3 verified 7 months ago

.gitattributes

1.57 kB
Upload NVFP4-quantized Qwen3-VLTO-32B with 256K context (YaRN RoPE scaling) 7 months ago
README.md

5.43 kB
Update README.md 7 months ago
added_tokens.json

707 Bytes
Upload NVFP4-quantized Qwen3-VLTO-32B with 256K context (YaRN RoPE scaling) 7 months ago
chat_template.jinja

4.17 kB
Upload NVFP4-quantized Qwen3-VLTO-32B with 256K context (YaRN RoPE scaling) 7 months ago
config.json

3.49 kB
Upload NVFP4-quantized Qwen3-VLTO-32B with 256K context (YaRN RoPE scaling) 7 months ago
generation_config.json

214 Bytes
Upload NVFP4-quantized Qwen3-VLTO-32B with 256K context (YaRN RoPE scaling) 7 months ago
hf_quant_config.json

267 Bytes
Upload NVFP4-quantized Qwen3-VLTO-32B with 256K context (YaRN RoPE scaling) 7 months ago
merges.txt

1.67 MB
Upload NVFP4-quantized Qwen3-VLTO-32B with 256K context (YaRN RoPE scaling) 7 months ago
model-00001-of-00005.safetensors

4.97 GB
xet

Upload NVFP4-quantized Qwen3-VLTO-32B with 256K context (YaRN RoPE scaling) 7 months ago
model-00002-of-00005.safetensors

4.94 GB
xet

Upload NVFP4-quantized Qwen3-VLTO-32B with 256K context (YaRN RoPE scaling) 7 months ago
model-00003-of-00005.safetensors

4.94 GB
xet

Upload NVFP4-quantized Qwen3-VLTO-32B with 256K context (YaRN RoPE scaling) 7 months ago
model-00004-of-00005.safetensors

4.26 GB
xet

Upload NVFP4-quantized Qwen3-VLTO-32B with 256K context (YaRN RoPE scaling) 7 months ago
model-00005-of-00005.safetensors

1.56 GB
xet

Upload NVFP4-quantized Qwen3-VLTO-32B with 256K context (YaRN RoPE scaling) 7 months ago
model.safetensors.index.json

176 kB
Upload NVFP4-quantized Qwen3-VLTO-32B with 256K context (YaRN RoPE scaling) 7 months ago
quantization_metadata.json

1.06 kB
Upload NVFP4-quantized Qwen3-VLTO-32B with 256K context (YaRN RoPE scaling) 7 months ago
special_tokens_map.json

613 Bytes
Upload NVFP4-quantized Qwen3-VLTO-32B with 256K context (YaRN RoPE scaling) 7 months ago
tokenizer.json

11.4 MB
xet

Upload NVFP4-quantized Qwen3-VLTO-32B with 256K context (YaRN RoPE scaling) 7 months ago
tokenizer_config.json

5.4 kB
Upload NVFP4-quantized Qwen3-VLTO-32B with 256K context (YaRN RoPE scaling) 7 months ago
vocab.json

2.78 MB
Upload NVFP4-quantized Qwen3-VLTO-32B with 256K context (YaRN RoPE scaling) 7 months ago