Instructions to use hf-audio/xcodec-hubert-librispeech with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use hf-audio/xcodec-hubert-librispeech with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("feature-extraction", model="hf-audio/xcodec-hubert-librispeech")# Load model directly from transformers import AutoFeatureExtractor, AutoModel extractor = AutoFeatureExtractor.from_pretrained("hf-audio/xcodec-hubert-librispeech") model = AutoModel.from_pretrained("hf-audio/xcodec-hubert-librispeech") - Notebooks
- Google Colab
- Kaggle
Update README.md
Browse files
README.md
CHANGED
|
@@ -7,7 +7,15 @@ datasets:
|
|
| 7 |
|
| 8 |
# X-Codec (speech, HuBERT)
|
| 9 |
|
| 10 |
-
This codec is
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 11 |
|
| 12 |
Original model is `xcodec_hubert_librispeech` from [this table](https://github.com/zhenye234/xcodec?tab=readme-ov-file#available-models).
|
| 13 |
|
|
|
|
| 7 |
|
| 8 |
# X-Codec (speech, HuBERT)
|
| 9 |
|
| 10 |
+
This codec is part of the X-Codec family of codecs as shown below:
|
| 11 |
+
|
| 12 |
+
| Model checkpoint | Semantic Model | Domain | Training Data |
|
| 13 |
+
|--------------------------------------------|-----------------------------------------------------------------------|---------------|-------------------------------|
|
| 14 |
+
| [xcodec-hubert-librispeech](https://huggingface.co/hf-audio/xcodec-hubert-librispeech) (this model) | [facebook/hubert-base-ls960](https://huggingface.co/facebook/hubert-base-ls960) | Speech | Librispeech |
|
| 15 |
+
| [xcodec-wavlm-mls](https://huggingface.co/hf-audio/xcodec-wavlm-mls) | [microsoft/wavlm-base-plus](https://huggingface.co/microsoft/wavlm-base-plus)| Speech | MLS English |
|
| 16 |
+
| [xcodec-wavlm-more-data](https://huggingface.co/hf-audio/xcodec-wavlm-more-data) | [microsoft/wavlm-base-plus](https://huggingface.co/microsoft/wavlm-base-plus)| Speech | MLS English + Internal data |
|
| 17 |
+
| [xcodec-hubert-general](https://huggingface.co/hf-audio/xcodec-hubert-general) | [ZhenYe234/hubert_base_general_audio](https://huggingface.co/ZhenYe234/hubert_base_general_audio) | General audio | 200k hours internal data |
|
| 18 |
+
| [xcodec-hubert-general-balanced](https://huggingface.co/hf-audio/xcodec-hubert-general-balanced) | [ZhenYe234/hubert_base_general_audio](https://huggingface.co/ZhenYe234/hubert_base_general_audio) | General audio | More balanced data |
|
| 19 |
|
| 20 |
Original model is `xcodec_hubert_librispeech` from [this table](https://github.com/zhenye234/xcodec?tab=readme-ov-file#available-models).
|
| 21 |
|