How to use jspaulsen/unmute-encoder with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("jspaulsen/unmute-encoder", dtype="auto")
How to use jspaulsen/unmute-encoder with Moshi:
# pip install moshi # Run the interactive web server python -m moshi.server --hf-repo "jspaulsen/unmute-encoder" # Then open https://localhost:8998 in your browser
# pip install moshi import torch from moshi.models import loaders # Load checkpoint info from HuggingFace checkpoint = loaders.CheckpointInfo.from_hf_repo("jspaulsen/unmute-encoder") # Load the Mimi audio codec mimi = checkpoint.get_mimi(device="cuda") mimi.set_num_codebooks(8) # Encode audio (24kHz, mono) wav = torch.randn(1, 1, 24000 * 10) # [batch, channels, samples] with torch.no_grad(): codes = mimi.encode(wav.cuda()) decoded = mimi.decode(codes)
The community tab is the place to discuss and collaborate with the HF community!