| --- |
| license: cc-by-4.0 |
| language: |
| - en |
| library_name: pocket-tts-onnx |
| base_model: |
| - kyutai/pocket-tts |
| pipeline_tag: text-to-speech |
| tags: |
| - tts |
| - voice-cloning |
| - onnx |
| - onnxruntime |
| --- |
| |
| # Voice Clone Pro ONNX |
| ## Files |
|
|
| ``` |
| pocket-tts-onnx/ |
| βββ onnx/ |
| β βββ flow_lm_main.onnx # 303 MB - Flow LM transformer (FP32) |
| β βββ flow_lm_main_int8.onnx # 76 MB - Flow LM transformer (INT8) |
| β βββ flow_lm_flow.onnx # 39 MB - Flow network (FP32) |
| β βββ flow_lm_flow_int8.onnx # 10 MB - Flow network (INT8) |
| β βββ mimi_decoder.onnx # 42 MB - Audio decoder (FP32) |
| β βββ mimi_decoder_int8.onnx # 23 MB - Audio decoder (INT8) |
| β βββ mimi_encoder.onnx # 73 MB - Voice encoder |
| β βββ text_conditioner.onnx # 16 MB - Text embeddings |
| βββ reference_sample.wav # Example voice reference |
| βββ tokenizer.model # SentencePiece tokenizer |
| βββ pocket_tts_onnx.py # Inference wrapper |
| βββ generate.py # CLI script |
| βββ requirements.txt # Python dependencies |
| βββ README.md |
| ``` |
|
|