Orpheus-3b-FT-AWQ

This is a quantised version of canopylabs/orpheus-3b-0.1-ft.

Orpheus is a high-performance Text-to-Speech model fine-tuned for natural, emotional speech synthesis. This repository hosts the 8-bit quantised version of the 3B parameter model, optimised for efficiency while maintaining high-quality output.

Model Description

Orpheus-3b-FT-AWQ is a 3 billion parameter Text-to-Speech model that converts text inputs into natural-sounding speech with support for multiple voices and emotional expressions. The model has been quantised to 8-bit (Q8_0) format for efficient inference, making it accessible on consumer hardware.

Key features:

  • 8 distinct voice options with different characteristics
  • Support for emotion tags like laughter, sighs, etc.
  • Optimised for CUDA acceleration on RTX GPUs
  • Produces high-quality 24kHz mono audio
  • Fine-tuned for conversational naturalness
Downloads last month
21
Safetensors
Model size
3B params
Tensor type
F32
·
I32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support