Instructions to use lewtun/dummy-model with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use lewtun/dummy-model with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="lewtun/dummy-model")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("lewtun/dummy-model")
model = AutoModelForCausalLM.from_pretrained("lewtun/dummy-model", device_map="auto")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Notebooks
Google Colab
Kaggle
Local Apps Settings

vLLM

How to use lewtun/dummy-model with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "lewtun/dummy-model"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "lewtun/dummy-model",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/lewtun/dummy-model

SGLang

How to use lewtun/dummy-model with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "lewtun/dummy-model" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "lewtun/dummy-model",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "lewtun/dummy-model" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "lewtun/dummy-model",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use lewtun/dummy-model with Docker Model Runner:
```
docker model run hf.co/lewtun/dummy-model
```

lewtun HF Staff commited on Feb 21, 2024

Commit

fb6eba0

verified ·

1 Parent(s): 8704522

Model save

Browse files

Files changed (4) hide show

all_results.json +3 -3
train_results.json +3 -3
trainer_state.json +7 -7
training_args.bin +2 -2

all_results.json CHANGED Viewed

@@ -1,7 +1,7 @@
 {
     "epoch": 0.0,
     "train_loss": 0.6931471824645996,
-    "train_runtime": 22.2221,
-    "train_samples_per_second": 2.88,
-    "train_steps_per_second": 0.045
 }

 {
     "epoch": 0.0,
     "train_loss": 0.6931471824645996,
+    "train_runtime": 5.3639,
+    "train_samples_per_second": 11.932,
+    "train_steps_per_second": 0.186
 }

train_results.json CHANGED Viewed

@@ -1,7 +1,7 @@
 {
     "epoch": 0.0,
     "train_loss": 0.6931471824645996,
-    "train_runtime": 22.2221,
-    "train_samples_per_second": 2.88,
-    "train_steps_per_second": 0.045
 }

 {
     "epoch": 0.0,
     "train_loss": 0.6931471824645996,
+    "train_runtime": 5.3639,
+    "train_samples_per_second": 11.932,
+    "train_steps_per_second": 0.186
 }

trainer_state.json CHANGED Viewed

@@ -11,10 +11,10 @@
     {
       "epoch": 0.0,
       "learning_rate": 0.0,
-      "logits/generated": -1.3129560947418213,
-      "logits/real": -0.6997354626655579,
-      "logps/generated": -609.5880126953125,
-      "logps/real": -542.52783203125,
       "loss": 0.6931,
       "rewards/accuracies": 0.0,
       "rewards/generated": 0.0,
@@ -27,9 +27,9 @@
       "step": 1,
       "total_flos": 0.0,
       "train_loss": 0.6931471824645996,
-      "train_runtime": 22.2221,
-      "train_samples_per_second": 2.88,
-      "train_steps_per_second": 0.045
     }
   ],
   "logging_steps": 10,

     {
       "epoch": 0.0,
       "learning_rate": 0.0,
+      "logits/generated": -1.3131200075149536,
+      "logits/real": -0.6672423481941223,
+      "logps/generated": -604.0491943359375,
+      "logps/real": -541.8375854492188,
       "loss": 0.6931,
       "rewards/accuracies": 0.0,
       "rewards/generated": 0.0,
       "step": 1,
       "total_flos": 0.0,
       "train_loss": 0.6931471824645996,
+      "train_runtime": 5.3639,
+      "train_samples_per_second": 11.932,
+      "train_steps_per_second": 0.186
     }
   ],
   "logging_steps": 10,

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:b76d577a3000f391b9d71fe4f8351365647d08ebe72ece646311115d02467530
-size 5944

 version https://git-lfs.github.com/spec/v1
+oid sha256:6d8b6e54292bea39dbb8cbd18ed891032caf6bd8faede14faf65a67dbf8a6418
+size 4792