Instructions to use Saugat20021/sahayak-voice-models with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Chatterbox
How to use Saugat20021/sahayak-voice-models with Chatterbox:
# pip install chatterbox-tts import torchaudio as ta from chatterbox.tts import ChatterboxTTS model = ChatterboxTTS.from_pretrained(device="cuda") text = "Ezreal and Jinx teamed up with Ahri, Yasuo, and Teemo to take down the enemy's Nexus in an epic late-game pentakill." wav = model.generate(text) ta.save("test-1.wav", wav, model.sr) # If you want to synthesize with a different voice, specify the audio prompt AUDIO_PROMPT_PATH="YOUR_FILE.wav" wav = model.generate(text, audio_prompt_path=AUDIO_PROMPT_PATH) ta.save("test-2.wav", wav, model.sr) - llama-cpp-python
How to use Saugat20021/sahayak-voice-models with llama-cpp-python:
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="Saugat20021/sahayak-voice-models", filename="llm/himalaya-gemma-toolcall.gguf", )
llm.create_chat_completion( messages = "No input example has been defined for this model task." )
- Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- llama.cpp
How to use Saugat20021/sahayak-voice-models with llama.cpp:
Install (macOS, Linux)
curl -LsSf https://llama.app/install.sh | sh # Start a local OpenAI-compatible server with a web UI: llama serve -hf Saugat20021/sahayak-voice-models # Run inference directly in the terminal: llama cli -hf Saugat20021/sahayak-voice-models
Install from WinGet (Windows)
winget install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama serve -hf Saugat20021/sahayak-voice-models # Run inference directly in the terminal: llama cli -hf Saugat20021/sahayak-voice-models
Use pre-built binary
# Download pre-built binary from: # https://github.com/ggerganov/llama.cpp/releases # Start a local OpenAI-compatible server with a web UI: ./llama-server -hf Saugat20021/sahayak-voice-models # Run inference directly in the terminal: ./llama-cli -hf Saugat20021/sahayak-voice-models
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git cd llama.cpp cmake -B build cmake --build build -j --target llama-server llama-cli # Start a local OpenAI-compatible server with a web UI: ./build/bin/llama-server -hf Saugat20021/sahayak-voice-models # Run inference directly in the terminal: ./build/bin/llama-cli -hf Saugat20021/sahayak-voice-models
Use Docker
docker model run hf.co/Saugat20021/sahayak-voice-models
- LM Studio
- Jan
- Ollama
How to use Saugat20021/sahayak-voice-models with Ollama:
ollama run hf.co/Saugat20021/sahayak-voice-models
- Unsloth Studio
How to use Saugat20021/sahayak-voice-models with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for Saugat20021/sahayak-voice-models to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for Saugat20021/sahayak-voice-models to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for Saugat20021/sahayak-voice-models to start chatting
- Pi
How to use Saugat20021/sahayak-voice-models with Pi:
Start the llama.cpp server
# Install llama.cpp: brew install llama.cpp # Start a local OpenAI-compatible server: llama serve -hf Saugat20021/sahayak-voice-models
Configure the model in Pi
# Install Pi: npm install -g @mariozechner/pi-coding-agent # Add to ~/.pi/agent/models.json: { "providers": { "llama-cpp": { "baseUrl": "http://localhost:8080/v1", "api": "openai-completions", "apiKey": "none", "models": [ { "id": "Saugat20021/sahayak-voice-models" } ] } } }Run Pi
# Start Pi in your project directory: pi
- Hermes Agent new
How to use Saugat20021/sahayak-voice-models with Hermes Agent:
Start the llama.cpp server
# Install llama.cpp: brew install llama.cpp # Start a local OpenAI-compatible server: llama serve -hf Saugat20021/sahayak-voice-models
Configure Hermes
# Install Hermes: curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash hermes setup # Point Hermes at the local server: hermes config set model.provider custom hermes config set model.base_url http://127.0.0.1:8080/v1 hermes config set model.default Saugat20021/sahayak-voice-models
Run Hermes
hermes
- Atomic Chat new
- OpenClaw new
How to use Saugat20021/sahayak-voice-models with OpenClaw:
Start the llama.cpp server
# Install llama.cpp: brew install llama.cpp # Start a local OpenAI-compatible server: llama serve -hf Saugat20021/sahayak-voice-models
Configure OpenClaw
# Install OpenClaw: npm install -g openclaw@latest # Register the local server and set it as the default model: openclaw onboard --non-interactive --mode local \ --auth-choice custom-api-key \ --custom-base-url http://127.0.0.1:8080/v1 \ --custom-model-id "Saugat20021/sahayak-voice-models" \ --custom-provider-id llama-cpp \ --custom-compatibility openai \ --custom-text-input \ --accept-risk \ --skip-health
Run OpenClaw
openclaw agent --local --agent main --message "Hello from Hugging Face"
- Docker Model Runner
How to use Saugat20021/sahayak-voice-models with Docker Model Runner:
docker model run hf.co/Saugat20021/sahayak-voice-models
- Lemonade
How to use Saugat20021/sahayak-voice-models with Lemonade:
Pull the model
# Download Lemonade from https://lemonade-server.ai/ lemonade pull Saugat20021/sahayak-voice-models
Run and chat with the model
lemonade run user.sahayak-voice-models-{{QUANT_TAG}}List all available models
lemonade list
llm.create_chat_completion(
messages = "No input example has been defined for this model task."
)Sahayak Voice Models
The full model bundle behind the Sahayak voice banking assistant: TTS, STT, and
LLM, one folder each. tts/ and stt/ are original fine-tunes; llm/'s base
weights are an unmodified third-party public model (only its Modelfile is original).
All are included here so this repo is a complete, self-contained, reproducible bundle
of everything the assistant runs. Attribution for each is below.
tts/ — original fine-tune
| File | What it is |
|---|---|
t3_nepali_checkpoint.pt |
Fine-tuned T3 (text→speech-token) model, trained on Nepali |
tokenizer_np.json |
Nepali BPE+grapheme tokenizer for the fine-tuned T3 model |
This is a Nepali fine-tune of the T3 stage of
ResembleAI/chatterbox. You still need
the base model's other components to use it — ve.safetensors (voice encoder),
s3gen.safetensors (mel generator), conds.pt (default voice conditioning) — download
those from the original ResembleAI repo; they are unmodified upstream weights and
aren't re-hosted here.
stt/ — fine-tuned on the Hindi version
Based on ai4bharat/indic-conformer-600m-multilingual (ONNX conformer encoder + per-language CTC/RNNT decoder heads, 22 Indic languages including Nepali) — further fine-tuned here on the Hindi version of the model to adapt it for this assistant. All credit for the base architecture/weights belongs to the original AI4Bharat repo; the fine-tuning on top of it was done for this project.
llm/ — base weights are a mirror; the Modelfile is original
| File | What it is |
|---|---|
himalaya-gemma-toolcall.gguf |
Same weights as himalaya-ai/himalaya-gemma-4-e2b-it-gguf — not modified/fine-tuned further here |
Modelfile |
An Ollama Modelfile adding a tool-calling prompt template (JSON function-call format) on top of the base model — this is the only original artifact in this folder |
The underlying weights are unchanged from Himalaya AI Labs' public Gemma-based Nepali model. All credit and license terms belong to that original repo.
Summary of what's actually original here
tts/t3_nepali_checkpoint.pt + tts/tokenizer_np.json (Nepali TTS fine-tune), the
stt/ model (fine-tuned from the Hindi version of IndicConformer), and llm/Modelfile
were produced for this project. The llm/himalaya-gemma-toolcall.gguf weights are an
unmodified copy of a third-party public model, included for a complete, reproducible
bundle — not a claim of authorship over those weights.
- Downloads last month
- -
We're not able to determine the quantization variants.
Model tree for Saugat20021/sahayak-voice-models
Base model
ResembleAI/chatterbox
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="Saugat20021/sahayak-voice-models", filename="llm/himalaya-gemma-toolcall.gguf", )