Instructions to use Ba2han/model-sft-q2 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use Ba2han/model-sft-q2 with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="Ba2han/model-sft-q2", trust_remote_code=True)
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("Ba2han/model-sft-q2", trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained("Ba2han/model-sft-q2", trust_remote_code=True)
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Notebooks
Google Colab
Kaggle
Local Apps Settings

vLLM

How to use Ba2han/model-sft-q2 with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "Ba2han/model-sft-q2"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Ba2han/model-sft-q2",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/Ba2han/model-sft-q2

SGLang

How to use Ba2han/model-sft-q2 with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "Ba2han/model-sft-q2" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Ba2han/model-sft-q2",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "Ba2han/model-sft-q2" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Ba2han/model-sft-q2",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Unsloth Studio

How to use Ba2han/model-sft-q2 with Unsloth Studio:

Install Unsloth Studio (macOS, Linux, WSL)

curl -fsSL https://unsloth.ai/install.sh | sh
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for Ba2han/model-sft-q2 to start chatting

Install Unsloth Studio (Windows)

irm https://unsloth.ai/install.ps1 | iex
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for Ba2han/model-sft-q2 to start chatting

Using HuggingFace Spaces for Unsloth

# No setup required
# Open https://huggingface.co/spaces/unsloth/studio in your browser
# Search for Ba2han/model-sft-q2 to start chatting

Load model with FastModel

pip install unsloth
from unsloth import FastModel
model, tokenizer = FastModel.from_pretrained(
    model_name="Ba2han/model-sft-q2",
    max_seq_length=2048,
)

Docker Model Runner
How to use Ba2han/model-sft-q2 with Docker Model Runner:
```
docker model run hf.co/Ba2han/model-sft-q2
```

Ba2han commited on 2 days ago

Commit

0aeef93

verified ·

1 Parent(s): ba722f8

Training in progress, step 503, checkpoint

Browse files

Files changed (4) hide show

last-checkpoint/model.safetensors +1 -1
last-checkpoint/optimizer.pt +1 -1
last-checkpoint/scheduler.pt +1 -1
last-checkpoint/trainer_state.json +363 -4

last-checkpoint/model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:622ecd7be560bdbf226ecf8fd61144b01025d68163856053e09ea432de0b54f2
 size 1049614696

 version https://git-lfs.github.com/spec/v1
+oid sha256:9a8012a7e4ebbcf674e7c053aab2984371f5b66d18160874ef76e47df13d7d10
 size 1049614696

last-checkpoint/optimizer.pt CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:66eaacd2f49b512e2c61c4f12a4643bc826503e2993600f1ac775b0387e29469
 size 1372902609

 version https://git-lfs.github.com/spec/v1
+oid sha256:3767a23772d915d4f0d08aecbd8ba1007c2450b7f4264217c6dc062863bcbb62
 size 1372902609

last-checkpoint/scheduler.pt CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3642db2f13fe0c8bf25dc88e523139fe4e7db636a1b7146241a3216eeebaf086
 size 1465

 version https://git-lfs.github.com/spec/v1
+oid sha256:abb0ed761d9206a280c8415a8884f5ff0e1548d6a1c2cc83a21b80bc00311e4e
 size 1465

last-checkpoint/trainer_state.json CHANGED Viewed

@@ -2,9 +2,9 @@
   "best_global_step": null,
   "best_metric": null,
   "best_model_checkpoint": null,
-  "epoch": 1.0049751243781095,
   "eval_steps": 76,
-  "global_step": 404,
   "is_hyper_param_search": false,
   "is_local_process_zero": true,
   "is_world_process_zero": true,
@@ -1462,6 +1462,365 @@
       "learning_rate": 4.4928312680573064e-05,
       "loss": 1.4603819847106934,
       "step": 404
     }
   ],
   "logging_steps": 2,
@@ -1476,12 +1835,12 @@
         "should_evaluate": false,
         "should_log": false,
         "should_save": true,
-        "should_training_stop": false
       },
       "attributes": {}
     }
   },
-  "total_flos": 9585726492508160.0,
   "train_batch_size": 4,
   "trial_name": null,
   "trial_params": null

   "best_global_step": null,
   "best_metric": null,
   "best_model_checkpoint": null,
+  "epoch": 1.2512437810945274,
   "eval_steps": 76,
+  "global_step": 503,
   "is_hyper_param_search": false,
   "is_local_process_zero": true,
   "is_world_process_zero": true,
       "learning_rate": 4.4928312680573064e-05,
       "loss": 1.4603819847106934,
       "step": 404
+    },
+    {
+      "epoch": 1.0099502487562189,
+      "grad_norm": 0.7265625,
+      "learning_rate": 4.415111107797445e-05,
+      "loss": 1.4816609621047974,
+      "step": 406
+    },
+    {
+      "epoch": 1.0149253731343284,
+      "grad_norm": 0.6640625,
+      "learning_rate": 4.332629679574566e-05,
+      "loss": 1.4633798599243164,
+      "step": 408
+    },
+    {
+      "epoch": 1.0199004975124377,
+      "grad_norm": 0.75390625,
+      "learning_rate": 4.245592045215182e-05,
+      "loss": 1.4657684564590454,
+      "step": 410
+    },
+    {
+      "epoch": 1.0248756218905473,
+      "grad_norm": 0.59765625,
+      "learning_rate": 4.154214593992149e-05,
+      "loss": 1.4662880897521973,
+      "step": 412
+    },
+    {
+      "epoch": 1.0298507462686568,
+      "grad_norm": 0.8125,
+      "learning_rate": 4.058724504646834e-05,
+      "loss": 1.4156984090805054,
+      "step": 414
+    },
+    {
+      "epoch": 1.0348258706467661,
+      "grad_norm": 0.73046875,
+      "learning_rate": 3.959359180586975e-05,
+      "loss": 1.4586745500564575,
+      "step": 416
+    },
+    {
+      "epoch": 1.0398009950248757,
+      "grad_norm": 0.58984375,
+      "learning_rate": 3.856365659664399e-05,
+      "loss": 1.4008747339248657,
+      "step": 418
+    },
+    {
+      "epoch": 1.044776119402985,
+      "grad_norm": 0.67578125,
+      "learning_rate": 3.7500000000000003e-05,
+      "loss": 1.5016967058181763,
+      "step": 420
+    },
+    {
+      "epoch": 1.0497512437810945,
+      "grad_norm": 0.6875,
+      "learning_rate": 3.6405266433829075e-05,
+      "loss": 1.420767903327942,
+      "step": 422
+    },
+    {
+      "epoch": 1.054726368159204,
+      "grad_norm": 0.796875,
+      "learning_rate": 3.5282177578265296e-05,
+      "loss": 1.4615557193756104,
+      "step": 424
+    },
+    {
+      "epoch": 1.0597014925373134,
+      "grad_norm": 0.6875,
+      "learning_rate": 3.413352560915988e-05,
+      "loss": 1.426128625869751,
+      "step": 426
+    },
+    {
+      "epoch": 1.064676616915423,
+      "grad_norm": 0.84765625,
+      "learning_rate": 3.2962166256292113e-05,
+      "loss": 1.4995248317718506,
+      "step": 428
+    },
+    {
+      "epoch": 1.0696517412935322,
+      "grad_norm": 0.73046875,
+      "learning_rate": 3.177101170357513e-05,
+      "loss": 1.544938325881958,
+      "step": 430
+    },
+    {
+      "epoch": 1.0746268656716418,
+      "grad_norm": 0.68359375,
+      "learning_rate": 3.056302334890786e-05,
+      "loss": 1.5545825958251953,
+      "step": 432
+    },
+    {
+      "epoch": 1.0796019900497513,
+      "grad_norm": 0.6875,
+      "learning_rate": 2.9341204441673266e-05,
+      "loss": 1.4614660739898682,
+      "step": 434
+    },
+    {
+      "epoch": 1.0845771144278606,
+      "grad_norm": 0.765625,
+      "learning_rate": 2.8108592616187133e-05,
+      "loss": 1.4706202745437622,
+      "step": 436
+    },
+    {
+      "epoch": 1.0895522388059702,
+      "grad_norm": 0.66796875,
+      "learning_rate": 2.686825233966061e-05,
+      "loss": 1.4637281894683838,
+      "step": 438
+    },
+    {
+      "epoch": 1.0945273631840795,
+      "grad_norm": 0.84375,
+      "learning_rate": 2.5623267293451826e-05,
+      "loss": 1.4879995584487915,
+      "step": 440
+    },
+    {
+      "epoch": 1.099502487562189,
+      "grad_norm": 0.75390625,
+      "learning_rate": 2.4376732706548183e-05,
+      "loss": 1.5506470203399658,
+      "step": 442
+    },
+    {
+      "epoch": 1.1044776119402986,
+      "grad_norm": 0.73828125,
+      "learning_rate": 2.3131747660339394e-05,
+      "loss": 1.5075286626815796,
+      "step": 444
+    },
+    {
+      "epoch": 1.109452736318408,
+      "grad_norm": 0.9453125,
+      "learning_rate": 2.189140738381288e-05,
+      "loss": 1.4878989458084106,
+      "step": 446
+    },
+    {
+      "epoch": 1.1144278606965174,
+      "grad_norm": 0.7421875,
+      "learning_rate": 2.0658795558326743e-05,
+      "loss": 1.4383412599563599,
+      "step": 448
+    },
+    {
+      "epoch": 1.1194029850746268,
+      "grad_norm": 0.7265625,
+      "learning_rate": 1.9436976651092144e-05,
+      "loss": 1.5081899166107178,
+      "step": 450
+    },
+    {
+      "epoch": 1.1243781094527363,
+      "grad_norm": 0.8203125,
+      "learning_rate": 1.8228988296424877e-05,
+      "loss": 1.497464656829834,
+      "step": 452
+    },
+    {
+      "epoch": 1.1293532338308458,
+      "grad_norm": 0.80078125,
+      "learning_rate": 1.7037833743707892e-05,
+      "loss": 1.4927465915679932,
+      "step": 454
+    },
+    {
+      "epoch": 1.1343283582089552,
+      "grad_norm": 0.7109375,
+      "learning_rate": 1.5866474390840125e-05,
+      "loss": 1.4225599765777588,
+      "step": 456
+    },
+    {
+      "epoch": 1.1343283582089552,
+      "eval_loss": 1.460990071296692,
+      "eval_runtime": 1.4591,
+      "eval_samples_per_second": 89.097,
+      "eval_steps_per_second": 11.651,
+      "step": 456
+    },
+    {
+      "epoch": 1.1393034825870647,
+      "grad_norm": 0.7109375,
+      "learning_rate": 1.4717822421734718e-05,
+      "loss": 1.3992186784744263,
+      "step": 458
+    },
+    {
+      "epoch": 1.144278606965174,
+      "grad_norm": 0.703125,
+      "learning_rate": 1.3594733566170926e-05,
+      "loss": 1.5498771667480469,
+      "step": 460
+    },
+    {
+      "epoch": 1.1492537313432836,
+      "grad_norm": 0.64453125,
+      "learning_rate": 1.2500000000000006e-05,
+      "loss": 1.4473354816436768,
+      "step": 462
+    },
+    {
+      "epoch": 1.154228855721393,
+      "grad_norm": 0.7578125,
+      "learning_rate": 1.1436343403356017e-05,
+      "loss": 1.4893980026245117,
+      "step": 464
+    },
+    {
+      "epoch": 1.1592039800995024,
+      "grad_norm": 0.75390625,
+      "learning_rate": 1.0406408194130259e-05,
+      "loss": 1.4563506841659546,
+      "step": 466
+    },
+    {
+      "epoch": 1.164179104477612,
+      "grad_norm": 0.70703125,
+      "learning_rate": 9.412754953531663e-06,
+      "loss": 1.4524166584014893,
+      "step": 468
+    },
+    {
+      "epoch": 1.1691542288557213,
+      "grad_norm": 0.7265625,
+      "learning_rate": 8.45785406007852e-06,
+      "loss": 1.4781806468963623,
+      "step": 470
+    },
+    {
+      "epoch": 1.1741293532338308,
+      "grad_norm": 0.69140625,
+      "learning_rate": 7.5440795478481815e-06,
+      "loss": 1.5440560579299927,
+      "step": 472
+    },
+    {
+      "epoch": 1.1791044776119404,
+      "grad_norm": 0.83203125,
+      "learning_rate": 6.673703204254347e-06,
+      "loss": 1.4499876499176025,
+      "step": 474
+    },
+    {
+      "epoch": 1.1840796019900497,
+      "grad_norm": 0.69140625,
+      "learning_rate": 5.848888922025553e-06,
+      "loss": 1.5100181102752686,
+      "step": 476
+    },
+    {
+      "epoch": 1.1890547263681592,
+      "grad_norm": 0.765625,
+      "learning_rate": 5.071687319426946e-06,
+      "loss": 1.4939875602722168,
+      "step": 478
+    },
+    {
+      "epoch": 1.1940298507462686,
+      "grad_norm": 0.6640625,
+      "learning_rate": 4.344030642100133e-06,
+      "loss": 1.5198827981948853,
+      "step": 480
+    },
+    {
+      "epoch": 1.199004975124378,
+      "grad_norm": 0.66015625,
+      "learning_rate": 3.66772795919611e-06,
+      "loss": 1.490272879600525,
+      "step": 482
+    },
+    {
+      "epoch": 1.2039800995024876,
+      "grad_norm": 0.765625,
+      "learning_rate": 3.044460665744284e-06,
+      "loss": 1.5192978382110596,
+      "step": 484
+    },
+    {
+      "epoch": 1.208955223880597,
+      "grad_norm": 0.73046875,
+      "learning_rate": 2.475778302439524e-06,
+      "loss": 1.4486744403839111,
+      "step": 486
+    },
+    {
+      "epoch": 1.2139303482587065,
+      "grad_norm": 0.6796875,
+      "learning_rate": 1.9630947032398067e-06,
+      "loss": 1.4685592651367188,
+      "step": 488
+    },
+    {
+      "epoch": 1.2189054726368158,
+      "grad_norm": 0.71875,
+      "learning_rate": 1.5076844803522922e-06,
+      "loss": 1.3960230350494385,
+      "step": 490
+    },
+    {
+      "epoch": 1.2238805970149254,
+      "grad_norm": 0.5859375,
+      "learning_rate": 1.1106798553464804e-06,
+      "loss": 1.4277310371398926,
+      "step": 492
+    },
+    {
+      "epoch": 1.228855721393035,
+      "grad_norm": 0.66015625,
+      "learning_rate": 7.730678442730538e-07,
+      "loss": 1.5258288383483887,
+      "step": 494
+    },
+    {
+      "epoch": 1.2338308457711442,
+      "grad_norm": 0.80859375,
+      "learning_rate": 4.956878037864043e-07,
+      "loss": 1.5295201539993286,
+      "step": 496
+    },
+    {
+      "epoch": 1.2388059701492538,
+      "grad_norm": 0.7734375,
+      "learning_rate": 2.7922934437178695e-07,
+      "loss": 1.461329698562622,
+      "step": 498
+    },
+    {
+      "epoch": 1.243781094527363,
+      "grad_norm": 0.72265625,
+      "learning_rate": 1.2423061586496477e-07,
+      "loss": 1.4992496967315674,
+      "step": 500
+    },
+    {
+      "epoch": 1.2487562189054726,
+      "grad_norm": 0.78515625,
+      "learning_rate": 3.107696952694139e-08,
+      "loss": 1.4631916284561157,
+      "step": 502
+    },
+    {
+      "epoch": 1.2512437810945274,
+      "eval_loss": 1.4604582786560059,
+      "eval_runtime": 1.4388,
+      "eval_samples_per_second": 90.352,
+      "eval_steps_per_second": 11.815,
+      "step": 503
     }
   ],
   "logging_steps": 2,
         "should_evaluate": false,
         "should_log": false,
         "should_save": true,
+        "should_training_stop": true
       },
       "attributes": {}
     }
   },
+  "total_flos": 1.1908427380948992e+16,
   "train_batch_size": 4,
   "trial_name": null,
   "trial_params": null