3.73 GB

Ctrl+K

1 contributor

History: 3 commits

majentik

docs: Tier 2 polish — variant matrix + quant trade-off

769e764 verified 26 days ago

.gitattributes

1.73 kB
Add MLX quantized model with KV cache compression about 2 months ago
README.md

5.27 kB
docs: Tier 2 polish — variant matrix + quant trade-off 26 days ago
adaptor.safetensors

233 MB
xet

Add MLX quantized model with KV cache compression about 2 months ago
adaptor_config.json

82 Bytes
Add MLX quantized model with KV cache compression about 2 months ago
config.json

2.85 kB
Add MLX quantized model with KV cache compression about 2 months ago
decoder-00000.safetensors

2.28 GB
xet

Add MLX quantized model with KV cache compression about 2 months ago
decoder-00001.safetensors

499 MB
xet

Add MLX quantized model with KV cache compression about 2 months ago
decoder.safetensors.index.json

22.2 kB
Add MLX quantized model with KV cache compression about 2 months ago
decoder_config.json

802 Bytes
Add MLX quantized model with KV cache compression about 2 months ago
encoder.safetensors

677 MB
xet

Add MLX quantized model with KV cache compression about 2 months ago
encoder_config.json

1.21 kB
Add MLX quantized model with KV cache compression about 2 months ago
generation_config.json

197 Bytes
Add MLX quantized model with KV cache compression about 2 months ago
preprocessor_config.json

443 Bytes
Add MLX quantized model with KV cache compression about 2 months ago
processor_config.json

281 Bytes
Add MLX quantized model with KV cache compression about 2 months ago
radar_asr.png

1.43 MB
xet

Add MLX quantized model with KV cache compression about 2 months ago
radar_task.png

1.22 MB
xet

Add MLX quantized model with KV cache compression about 2 months ago
special_tokens_map.json

636 Bytes
Add MLX quantized model with KV cache compression about 2 months ago
tokenizer.json

34.4 MB
xet

Add MLX quantized model with KV cache compression about 2 months ago
tokenizer.model

4.24 MB
xet

Add MLX quantized model with KV cache compression about 2 months ago
tokenizer_config.json

46.9 kB
Add MLX quantized model with KV cache compression about 2 months ago