Generate and convert voice using text and audio inputs
Create GGUF quantized model from a Hugging Face repo