How to use from
llama.cpp
Install from brew
brew install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf QuantFactory/ClinicalGPT-base-zh-GGUF:
# Run inference directly in the terminal:
llama-cli -hf QuantFactory/ClinicalGPT-base-zh-GGUF:
Install from WinGet (Windows)
winget install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf QuantFactory/ClinicalGPT-base-zh-GGUF:
# Run inference directly in the terminal:
llama-cli -hf QuantFactory/ClinicalGPT-base-zh-GGUF:
Use pre-built binary
# Download pre-built binary from:
# https://github.com/ggerganov/llama.cpp/releases
# Start a local OpenAI-compatible server with a web UI:
./llama-server -hf QuantFactory/ClinicalGPT-base-zh-GGUF:
# Run inference directly in the terminal:
./llama-cli -hf QuantFactory/ClinicalGPT-base-zh-GGUF:
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git
cd llama.cpp
cmake -B build
cmake --build build -j --target llama-server llama-cli
# Start a local OpenAI-compatible server with a web UI:
./build/bin/llama-server -hf QuantFactory/ClinicalGPT-base-zh-GGUF:
# Run inference directly in the terminal:
./build/bin/llama-cli -hf QuantFactory/ClinicalGPT-base-zh-GGUF:
Use Docker
docker model run hf.co/QuantFactory/ClinicalGPT-base-zh-GGUF:
Quick Links

QuantFactory/ClinicalGPT-base-zh-GGUF

This is quantized version of medicalai/ClinicalGPT-base-zh created using llama.cpp

Original Model Card

ClinicalGPT

This model card introduces ClinicalGPT model, a large language model designed and optimized for clinical scenarios. ClinicalGPT is fine-tuned on extensive and diverse medical datasets, including medical records, domain-specific knowledge, and multi-round dialogue consultations. The model is undergoing ongoing and continuous updates.

Model Fine-tuning

We set the learning rate to 5e-5, with a batch size of 128 and a maximum length of 1,024, training across 3 epochs.

How to use the model

Load the model via the transformers library:

from transformers import AutoTokenizer, AutoModelForCausalLM
tokenizer = AutoTokenizer.from_pretrained("medicalai/ClinicalGPT-base-zh")
model = AutoModelForCausalLM.from_pretrained("medicalai/ClinicalGPT-base-zh")

Limitations

The project is intended for research purposes only and restricted from commercial or clinical use. The generated content by the model is subject to factors such as model computations, randomness, misinterpretation, and biases, and this project cannot guarantee its accuracy. This project assumes no legal liability for any content produced by the model. Users are advised to exercise caution and independently verify the generated results.

Downloads last month
41
GGUF
Model size
8B params
Architecture
bloom
Hardware compatibility
Log In to add your hardware

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support