davidkim205/kollm-converations
Viewer β’ Updated β’ 1.12M β’ 177 β’ 30
How to use cockroach54/solar-sft-qlora with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("text-generation", model="cockroach54/solar-sft-qlora") # Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM
tokenizer = AutoTokenizer.from_pretrained("cockroach54/solar-sft-qlora")
model = AutoModelForCausalLM.from_pretrained("cockroach54/solar-sft-qlora")How to use cockroach54/solar-sft-qlora with vLLM:
# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "cockroach54/solar-sft-qlora"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/completions" \
-H "Content-Type: application/json" \
--data '{
"model": "cockroach54/solar-sft-qlora",
"prompt": "Once upon a time,",
"max_tokens": 512,
"temperature": 0.5
}'docker model run hf.co/cockroach54/solar-sft-qlora
How to use cockroach54/solar-sft-qlora with SGLang:
# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
--model-path "cockroach54/solar-sft-qlora" \
--host 0.0.0.0 \
--port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
-H "Content-Type: application/json" \
--data '{
"model": "cockroach54/solar-sft-qlora",
"prompt": "Once upon a time,",
"max_tokens": 512,
"temperature": 0.5
}'docker run --gpus all \
--shm-size 32g \
-p 30000:30000 \
-v ~/.cache/huggingface:/root/.cache/huggingface \
--env "HF_TOKEN=<secret>" \
--ipc=host \
lmsysorg/sglang:latest \
python3 -m sglang.launch_server \
--model-path "cockroach54/solar-sft-qlora" \
--host 0.0.0.0 \
--port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
-H "Content-Type: application/json" \
--data '{
"model": "cockroach54/solar-sft-qlora",
"prompt": "Once upon a time,",
"max_tokens": 512,
"temperature": 0.5
}'How to use cockroach54/solar-sft-qlora with Docker Model Runner:
docker model run hf.co/cockroach54/solar-sft-qlora
| Tasks | Version | Filter | n-shot | Metric | Value | Stderr | |
|---|---|---|---|---|---|---|---|
| kobest_wic | 1 | none | 0 | acc | 0.5056 | Β± | 0.0141 |
| none | 0 | f1 | 0.4170 | Β± | N/A | ||
| kobest_sentineg | 1 | none | 0 | acc | 0.8791 | Β± | 0.0164 |
| none | 0 | f1 | 0.8779 | Β± | N/A | ||
| kobest_hellaswag | 1 | none | 0 | acc | 0.4540 | Β± | 0.0223 |
| none | 0 | f1 | 0.4522 | Β± | N/A | ||
| none | 0 | acc_norm | 0.5600 | Β± | 0.0222 | ||
| kobest_copa | 1 | none | 0 | acc | 0.6540 | Β± | 0.0151 |
| none | 0 | f1 | 0.6535 | Β± | N/A | ||
| kobest_boolq | 1 | none | 0 | acc | 0.5135 | Β± | 0.0133 |
| none | 0 | f1 | 0.3592 | Β± | N/A | ||
| ko_truthfulqa | 2 | none | 0 | acc | 0.2938 | Β± | 0.0159 |
| ko_hellaswag | 1 | none | 0 | acc | 0.3816 | Β± | 0.0048 |
| none | 0 | acc_norm | 0.5171 | Β± | 0.0050 | ||
| ko_common_gen | 1 | none | 0 | acc | 0.8095 | Β± | 0.0100 |
| none | 0 | acc_norm | 0.8095 | Β± | 0.0100 | ||
| ko_arc_easy | 1 | none | 0 | acc | 0.3311 | Β± | 0.0138 |
| none | 0 | acc_norm | 0.4053 | Β± | 0.0143 |