Text Generation
GGUF
English
medical
conversational
How to use from
llama.cpp
Install (macOS, Linux)
curl -LsSf https://llama.app/install.sh | sh
# Start a local OpenAI-compatible server with a web UI:
llama serve -hf Kush26/Mental_Health_ChatBot:F16
# Run inference directly in the terminal:
llama cli -hf Kush26/Mental_Health_ChatBot:F16
Install from WinGet (Windows)
winget install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama serve -hf Kush26/Mental_Health_ChatBot:F16
# Run inference directly in the terminal:
llama cli -hf Kush26/Mental_Health_ChatBot:F16
Use pre-built binary
# Download pre-built binary from:
# https://github.com/ggerganov/llama.cpp/releases
# Start a local OpenAI-compatible server with a web UI:
./llama-server -hf Kush26/Mental_Health_ChatBot:F16
# Run inference directly in the terminal:
./llama-cli -hf Kush26/Mental_Health_ChatBot:F16
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git
cd llama.cpp
cmake -B build
cmake --build build -j --target llama-server llama-cli
# Start a local OpenAI-compatible server with a web UI:
./build/bin/llama-server -hf Kush26/Mental_Health_ChatBot:F16
# Run inference directly in the terminal:
./build/bin/llama-cli -hf Kush26/Mental_Health_ChatBot:F16
Use Docker
docker model run hf.co/Kush26/Mental_Health_ChatBot:F16
Quick Links

Fine-Tuned LLaMA-3 8B Mental Health Conversational Model

Model Overview

This is a fine-tuned version of LLaMA-3 8B Instruct, specifically adapted for conversational mental health support. The model has been fine-tuned using LoRA / QLoRA techniques and quantized to 4-bit for efficient inference. It is ideal for applications requiring lightweight deployment without compromising the quality of responses.

  • Base Model: LLaMA-3 8B Instruct
  • Fine-Tuning: Mental health conversational dataset
  • Technique: LoRA / QLoRA
  • Quantization: 4-bit (GGUF)
  • File Format: model.Q4_K_M.gguf

This model is optimized for generating empathetic, safe, and context-aware responses for mental health conversations. It is intended for research, personal, or educational use.


How to Download

You can download the model using this link:


Using in LM Studio

Follow these steps to use the model in LM Studio:

  1. Install LM Studio
    Download and install LM Studio from https://lmstudio.ai.

  2. Add the Model

    • Open LM Studio.
    • Click "Add Model" or "Load Local Model".
    • Select the downloaded model.Q4_K_M.gguf file.
  3. Configure Model Settings

    • Choose appropriate context length (e.g., 2048 tokens).
    • Enable GPU acceleration if available for faster inference.
    • Adjust any sampling parameters (temperature, top-p) as needed.
  4. Start Chatting

    • Open a new chat session.
    • Interact with the model for mental health conversations or research purposes.

Notes

  • This model is not a substitute for professional mental health care.
  • Use responsibly and ensure privacy when handling sensitive conversations.
  • Compatible with LM Studio version 1.9 and above.
Downloads last month
34
GGUF
Model size
8B params
Architecture
llama
Hardware compatibility
Log In to add your hardware

4-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Kush26/Mental_Health_ChatBot

Quantized
(660)
this model

Dataset used to train Kush26/Mental_Health_ChatBot

Space using Kush26/Mental_Health_ChatBot 1