microsoft/crd3
Updated • 193 • 28
How to use cvnberk/DialoGPT-small-crd3 with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("text-generation", model="cvnberk/DialoGPT-small-crd3") # Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM
tokenizer = AutoTokenizer.from_pretrained("cvnberk/DialoGPT-small-crd3")
model = AutoModelForCausalLM.from_pretrained("cvnberk/DialoGPT-small-crd3")How to use cvnberk/DialoGPT-small-crd3 with vLLM:
# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "cvnberk/DialoGPT-small-crd3"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/completions" \
-H "Content-Type: application/json" \
--data '{
"model": "cvnberk/DialoGPT-small-crd3",
"prompt": "Once upon a time,",
"max_tokens": 512,
"temperature": 0.5
}'docker model run hf.co/cvnberk/DialoGPT-small-crd3
How to use cvnberk/DialoGPT-small-crd3 with SGLang:
# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
--model-path "cvnberk/DialoGPT-small-crd3" \
--host 0.0.0.0 \
--port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
-H "Content-Type: application/json" \
--data '{
"model": "cvnberk/DialoGPT-small-crd3",
"prompt": "Once upon a time,",
"max_tokens": 512,
"temperature": 0.5
}'docker run --gpus all \
--shm-size 32g \
-p 30000:30000 \
-v ~/.cache/huggingface:/root/.cache/huggingface \
--env "HF_TOKEN=<secret>" \
--ipc=host \
lmsysorg/sglang:latest \
python3 -m sglang.launch_server \
--model-path "cvnberk/DialoGPT-small-crd3" \
--host 0.0.0.0 \
--port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
-H "Content-Type: application/json" \
--data '{
"model": "cvnberk/DialoGPT-small-crd3",
"prompt": "Once upon a time,",
"max_tokens": 512,
"temperature": 0.5
}'How to use cvnberk/DialoGPT-small-crd3 with Docker Model Runner:
docker model run hf.co/cvnberk/DialoGPT-small-crd3
This model is a fine-tuned version of microsoft/DialoGPT-small on the crd3 dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| 2.7219 | 0.41 | 500 | 2.5055 |
| 2.1998 | 0.82 | 1000 | 2.1320 |
| 2.0375 | 1.23 | 1500 | 2.0455 |
| 1.9719 | 1.64 | 2000 | 2.0024 |
| 1.898 | 2.05 | 2500 | 1.9783 |
| 1.8977 | 2.46 | 3000 | 1.9598 |
| 1.8336 | 2.87 | 3500 | 1.9467 |
| 1.8316 | 3.28 | 4000 | 1.9379 |
| 1.7895 | 3.69 | 4500 | 1.9316 |
| 1.777 | 4.11 | 5000 | 1.9267 |
| 1.7648 | 4.52 | 5500 | 1.9205 |
| 1.7478 | 4.93 | 6000 | 1.9181 |
| 1.752 | 5.34 | 6500 | 1.9170 |
| 1.7284 | 5.75 | 7000 | 1.9158 |
| 1.7254 | 6.16 | 7500 | 1.9158 |
| 1.7272 | 6.57 | 8000 | 1.9152 |
| 1.7252 | 6.98 | 8500 | 1.9150 |
Base model
microsoft/DialoGPT-small