Instructions to use DevHunterAI/cad-lora-v1 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use DevHunterAI/cad-lora-v1 with PEFT:

from peft import PeftModel
from transformers import AutoModelForCausalLM

base_model = AutoModelForCausalLM.from_pretrained("Qwen/Qwen2.5-1.5B-Instruct")
model = PeftModel.from_pretrained(base_model, "DevHunterAI/cad-lora-v1")

Transformers

How to use DevHunterAI/cad-lora-v1 with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="DevHunterAI/cad-lora-v1")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("DevHunterAI/cad-lora-v1", dtype="auto")

Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use DevHunterAI/cad-lora-v1 with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "DevHunterAI/cad-lora-v1"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "DevHunterAI/cad-lora-v1",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/DevHunterAI/cad-lora-v1

SGLang

How to use DevHunterAI/cad-lora-v1 with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "DevHunterAI/cad-lora-v1" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "DevHunterAI/cad-lora-v1",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "DevHunterAI/cad-lora-v1" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "DevHunterAI/cad-lora-v1",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use DevHunterAI/cad-lora-v1 with Docker Model Runner:
```
docker model run hf.co/DevHunterAI/cad-lora-v1
```

DevHunterAI commited on Feb 21

Commit

28eacbe

verified ·

1 Parent(s): ef36934

CAD LoRA adapter upload

Browse files

Files changed (12) hide show

adapter_config.json +1 -1
adapter_model.safetensors +1 -1
checkpoint-4/adapter_config.json +1 -1
checkpoint-4/adapter_model.safetensors +1 -1
checkpoint-4/optimizer.pt +1 -1
checkpoint-4/rng_state.pth +1 -1
checkpoint-4/trainer_state.json +1 -1
checkpoint-6/adapter_config.json +1 -1
checkpoint-6/adapter_model.safetensors +1 -1
checkpoint-6/optimizer.pt +1 -1
checkpoint-6/rng_state.pth +1 -1
checkpoint-6/trainer_state.json +7 -7

adapter_config.json CHANGED Viewed

@@ -29,8 +29,8 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "v_proj",
     "up_proj",
     "q_proj"
   ],
   "target_parameters": null,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
     "up_proj",
+    "v_proj",
     "q_proj"
   ],
   "target_parameters": null,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:ba8d614b094e235cd267370ad57bab6b1526c0561f009a1533486986b9082e45
 size 27547336

 version https://git-lfs.github.com/spec/v1
+oid sha256:dc2cc7257ebd4cd73b1c7374f391dec191015c425a9ceb4f9259c8a8c281de02
 size 27547336

checkpoint-4/adapter_config.json CHANGED Viewed

@@ -29,8 +29,8 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "v_proj",
     "up_proj",
     "q_proj"
   ],
   "target_parameters": null,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
     "up_proj",
+    "v_proj",
     "q_proj"
   ],
   "target_parameters": null,

checkpoint-4/adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:bf58dbe200afff2dec926b3d5d2456143b945dd6eda66cd49ef02d4923c40084
 size 27547336

 version https://git-lfs.github.com/spec/v1
+oid sha256:0068bbb55640de2e6a5b9788ef78a9c2038a60b87b2fc8904c9796b39f20580b
 size 27547336

checkpoint-4/optimizer.pt CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:597b6281a83fc0faf04a230b279297c1649f7f87e6e6b631a8901222b5e90a4c
 size 55191482

 version https://git-lfs.github.com/spec/v1
+oid sha256:c7ee1202bdb7b019ea0256bc1aee85ac3b1eaea14b844e1abfb9813b37ad6cbb
 size 55191482

checkpoint-4/rng_state.pth CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e9837c2e4d58d01133e4668cffaa8ef6741d1ecca79e127b539e4997cd03de3e
 size 14244

 version https://git-lfs.github.com/spec/v1
+oid sha256:f0c40c4a81f200aad10927b2fb78e549faf5e14be0b5d66b8ffdc58cd6eaa7bf
 size 14244

checkpoint-4/trainer_state.json CHANGED Viewed

@@ -26,7 +26,7 @@
       "attributes": {}
     }
   },
-  "total_flos": 137739258384384.0,
   "train_batch_size": 1,
   "trial_name": null,
   "trial_params": null

       "attributes": {}
     }
   },
+  "total_flos": 145326456459264.0,
   "train_batch_size": 1,
   "trial_name": null,
   "trial_params": null

checkpoint-6/adapter_config.json CHANGED Viewed

@@ -29,8 +29,8 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "v_proj",
     "up_proj",
     "q_proj"
   ],
   "target_parameters": null,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
     "up_proj",
+    "v_proj",
     "q_proj"
   ],
   "target_parameters": null,

checkpoint-6/adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:ba8d614b094e235cd267370ad57bab6b1526c0561f009a1533486986b9082e45
 size 27547336

 version https://git-lfs.github.com/spec/v1
+oid sha256:dc2cc7257ebd4cd73b1c7374f391dec191015c425a9ceb4f9259c8a8c281de02
 size 27547336

checkpoint-6/optimizer.pt CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e5c4e33f62121c6d9da18ed4c0e1e6f6fb48e91ec456d1c0d139ef61860ecf7f
 size 55191482

 version https://git-lfs.github.com/spec/v1
+oid sha256:cb00c6a6c62ff7d7a6ec09215d12e3313b68eb6f4cb014a8d3cf884e8e11cdd2
 size 55191482

checkpoint-6/rng_state.pth CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:60a8fb80ac1bf80fef94772d1bdc9568443800de852553425ea39c224fe280a6
 size 14244

 version https://git-lfs.github.com/spec/v1
+oid sha256:64123b336e44c6cf78b63f66127ecea6f20a681e01f9223d7d4f5ef02fd154ce
 size 14244

checkpoint-6/trainer_state.json CHANGED Viewed

@@ -10,13 +10,13 @@
   "is_world_process_zero": true,
   "log_history": [
     {
-      "entropy": 0.8762538245430699,
-      "epoch": 2.8421052631578947,
-      "grad_norm": 0.6482176184654236,
       "learning_rate": 6.909830056250527e-05,
-      "loss": 1.3301252365112304,
-      "mean_token_accuracy": 0.7302898124412254,
-      "num_tokens": 24767.0,
       "step": 5
     }
   ],
@@ -37,7 +37,7 @@
       "attributes": {}
     }
   },
-  "total_flos": 206608887576576.0,
   "train_batch_size": 1,
   "trial_name": null,
   "trial_params": null

   "is_world_process_zero": true,
   "log_history": [
     {
+      "entropy": 0.8723009686384883,
+      "epoch": 2.8,
+      "grad_norm": 0.6077917814254761,
       "learning_rate": 6.909830056250527e-05,
+      "loss": 1.339169979095459,
+      "mean_token_accuracy": 0.7311233420457158,
+      "num_tokens": 25846.0,
       "step": 5
     }
   ],
       "attributes": {}
     }
   },
+  "total_flos": 217989684688896.0,
   "train_batch_size": 1,
   "trial_name": null,
   "trial_params": null