How to use from the
Use from the
MLX library
# Download the model from the Hub
pip install huggingface_hub[hf_xet]

huggingface-cli download --local-dir SenseVoiceSmall-4bit vanch007/SenseVoiceSmall-4bit

SenseVoiceSmall 4bit MLX

This repository is a local 4-bit MLX quantization of mlx-community/SenseVoiceSmall for VTranslator.

Quantization:

  • bits: 4
  • group_size: 64
  • target repo: vanch007/SenseVoiceSmall-4bit

The app expects the standard config.json, model*.safetensors, am.mvn, and tokenizer model files.

Downloads last month
33
MLX
Hardware compatibility
Log In to add your hardware

Quantized

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for vanch007/SenseVoiceSmall-4bit

Finetuned
(1)
this model