walydevelopers's picture
Update README.md
a9c25da verified
---
license: cc-by-4.0
language:
- en
library_name: pocket-tts-onnx
base_model:
- kyutai/pocket-tts
pipeline_tag: text-to-speech
tags:
- tts
- voice-cloning
- onnx
- onnxruntime
---
# Voice Clone Pro ONNX
## Files
```
pocket-tts-onnx/
β”œβ”€β”€ onnx/
β”‚ β”œβ”€β”€ flow_lm_main.onnx # 303 MB - Flow LM transformer (FP32)
β”‚ β”œβ”€β”€ flow_lm_main_int8.onnx # 76 MB - Flow LM transformer (INT8)
β”‚ β”œβ”€β”€ flow_lm_flow.onnx # 39 MB - Flow network (FP32)
β”‚ β”œβ”€β”€ flow_lm_flow_int8.onnx # 10 MB - Flow network (INT8)
β”‚ β”œβ”€β”€ mimi_decoder.onnx # 42 MB - Audio decoder (FP32)
β”‚ β”œβ”€β”€ mimi_decoder_int8.onnx # 23 MB - Audio decoder (INT8)
β”‚ β”œβ”€β”€ mimi_encoder.onnx # 73 MB - Voice encoder
β”‚ └── text_conditioner.onnx # 16 MB - Text embeddings
β”œβ”€β”€ reference_sample.wav # Example voice reference
β”œβ”€β”€ tokenizer.model # SentencePiece tokenizer
β”œβ”€β”€ pocket_tts_onnx.py # Inference wrapper
β”œβ”€β”€ generate.py # CLI script
β”œβ”€β”€ requirements.txt # Python dependencies
└── README.md
```