Audio-to-Audio
Moshi
Safetensors
Hindi
speech-to-speech
hindi
conversational-ai
audio
full-duplex
duplex-dialogue
indian-languages
Instructions to use JoshTalksAI/Human-1 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Moshi
How to use JoshTalksAI/Human-1 with Moshi:
# pip install moshi # Run the interactive web server python -m moshi.server --hf-repo "JoshTalksAI/Human-1" # Then open https://localhost:8998 in your browser
# pip install moshi import torch from moshi.models import loaders # Load checkpoint info from HuggingFace checkpoint = loaders.CheckpointInfo.from_hf_repo("JoshTalksAI/Human-1") # Load the Mimi audio codec mimi = checkpoint.get_mimi(device="cuda") mimi.set_num_codebooks(8) # Encode audio (24kHz, mono) wav = torch.randn(1, 1, 24000 * 10) # [batch, channels, samples] with torch.no_grad(): codes = mimi.encode(wav.cuda()) decoded = mimi.decode(codes) - Notebooks
- Google Colab
- Kaggle
Add files using upload-large-folder tool
Browse files- model.safetensors +3 -0
- tokenizer-e351c8d8-checkpoint125.safetensors +3 -0
- tokenizer_hindi.model +3 -0
- tokenizer_hindi.vocab +0 -0
model.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:66e73545d7e54dc8ed39bcab1b62ae336c82d97b141dbc0622a07acbf2a5ea2d
|
| 3 |
+
size 30750958336
|
tokenizer-e351c8d8-checkpoint125.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:09b782f0629851a271227fb9d36db65c041790365f11bbe5d3d59369cf863f50
|
| 3 |
+
size 384644900
|
tokenizer_hindi.model
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a05524805f181f74520be13b407cb00bcea3872398bcd0058d75e40c6bfc13c2
|
| 3 |
+
size 1080022
|
tokenizer_hindi.vocab
ADDED
|
The diff for this file is too large to render.
See raw diff
|
|
|