Instructions to use Ba2han/experimental_auto with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use Ba2han/experimental_auto with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="Ba2han/experimental_auto", trust_remote_code=True)

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("Ba2han/experimental_auto", trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained("Ba2han/experimental_auto", trust_remote_code=True)

Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use Ba2han/experimental_auto with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "Ba2han/experimental_auto"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Ba2han/experimental_auto",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker

docker model run hf.co/Ba2han/experimental_auto

SGLang

How to use Ba2han/experimental_auto with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "Ba2han/experimental_auto" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Ba2han/experimental_auto",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "Ba2han/experimental_auto" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "Ba2han/experimental_auto",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Unsloth Studio new

How to use Ba2han/experimental_auto with Unsloth Studio:

Install Unsloth Studio (macOS, Linux, WSL)

curl -fsSL https://unsloth.ai/install.sh | sh
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for Ba2han/experimental_auto to start chatting

Install Unsloth Studio (Windows)

irm https://unsloth.ai/install.ps1 | iex
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for Ba2han/experimental_auto to start chatting

Using HuggingFace Spaces for Unsloth

# No setup required
# Open https://huggingface.co/spaces/unsloth/studio in your browser
# Search for Ba2han/experimental_auto to start chatting

Load model with FastModel

pip install unsloth
from unsloth import FastModel
model, tokenizer = FastModel.from_pretrained(
    model_name="Ba2han/experimental_auto",
    max_seq_length=2048,
)

Docker Model Runner
How to use Ba2han/experimental_auto with Docker Model Runner:
```
docker model run hf.co/Ba2han/experimental_auto
```

Ba2han commited on 12 days ago

Commit

2b0502a

verified ·

1 Parent(s): 348e7b0

Training in progress, step 100, checkpoint

Browse files

Files changed (5) hide show

last-checkpoint/model.safetensors +1 -1
last-checkpoint/optimizer.pt +1 -1
last-checkpoint/rng_state.pth +1 -1
last-checkpoint/scheduler.pt +1 -1
last-checkpoint/trainer_state.json +361 -3

last-checkpoint/model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:df7f3da2bd850f33f7298372ff869be8f81c2b3405227fe6c9bd7c6f2f71a131
 size 1008303016

 version https://git-lfs.github.com/spec/v1
+oid sha256:aa0366c83d309f7521675441df462578bea7f619437923da130a73f4b3e1ef98
 size 1008303016

last-checkpoint/optimizer.pt CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:dc159d75d63a73fbd137e19fda0856ce4e3720ab62cac1e9b38b17fedbdf4bf8
 size 1086712487

 version https://git-lfs.github.com/spec/v1
+oid sha256:44564ad6d459ee8b6824ecc63fbcda48f4afb20f4f7137194943df892c91515f
 size 1086712487

last-checkpoint/rng_state.pth CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7c800b778fa7e115e4c34de8529902de8b61c9a1b4bab3eb8295d06dafff030e
 size 14645

 version https://git-lfs.github.com/spec/v1
+oid sha256:9efd33af97ed562c15fc83318701d580bcf56272c251b44d09ee6d97b4cc32c1
 size 14645

last-checkpoint/scheduler.pt CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4c3b6e6139f923a24202d50bcbca49b224309c9139ea19d316d6f4729bb3d183
 size 1465

 version https://git-lfs.github.com/spec/v1
+oid sha256:ccc24e81b738d82c9183e6957980e5f32ed351a8d1b2f1f22e0d6adc4bee1861
 size 1465

last-checkpoint/trainer_state.json CHANGED Viewed

@@ -2,9 +2,9 @@
   "best_global_step": null,
   "best_metric": null,
   "best_model_checkpoint": null,
-  "epoch": 0.4280936454849498,
   "eval_steps": 50,
-  "global_step": 50,
   "is_hyper_param_search": false,
   "is_local_process_zero": true,
   "is_world_process_zero": true,
@@ -366,6 +366,364 @@
       "eval_samples_per_second": 11.856,
       "eval_steps_per_second": 2.969,
       "step": 50
     }
   ],
   "logging_steps": 1,
@@ -385,7 +743,7 @@
       "attributes": {}
     }
   },
-  "total_flos": 7.292068282073088e+16,
   "train_batch_size": 4,
   "trial_name": null,
   "trial_params": null

   "best_global_step": null,
   "best_metric": null,
   "best_model_checkpoint": null,
+  "epoch": 0.8561872909698997,
   "eval_steps": 50,
+  "global_step": 100,
   "is_hyper_param_search": false,
   "is_local_process_zero": true,
   "is_world_process_zero": true,
       "eval_samples_per_second": 11.856,
       "eval_steps_per_second": 2.969,
       "step": 50
+    },
+    {
+      "epoch": 0.43665551839464883,
+      "grad_norm": 0.490234375,
+      "learning_rate": 0.02154667547631338,
+      "loss": 6.548553466796875,
+      "step": 51
+    },
+    {
+      "epoch": 0.44521739130434784,
+      "grad_norm": 0.4296875,
+      "learning_rate": 0.02140867714223579,
+      "loss": 6.469072341918945,
+      "step": 52
+    },
+    {
+      "epoch": 0.4537792642140468,
+      "grad_norm": 0.369140625,
+      "learning_rate": 0.021268436096329016,
+      "loss": 6.5030717849731445,
+      "step": 53
+    },
+    {
+      "epoch": 0.4623411371237458,
+      "grad_norm": 0.3984375,
+      "learning_rate": 0.0211259876435264,
+      "loss": 6.464090347290039,
+      "step": 54
+    },
+    {
+      "epoch": 0.4709030100334448,
+      "grad_norm": 0.4140625,
+      "learning_rate": 0.020981367644464153,
+      "loss": 6.415444374084473,
+      "step": 55
+    },
+    {
+      "epoch": 0.4794648829431438,
+      "grad_norm": 0.400390625,
+      "learning_rate": 0.020834612506453645,
+      "loss": 6.36262845993042,
+      "step": 56
+    },
+    {
+      "epoch": 0.48802675585284283,
+      "grad_norm": 0.421875,
+      "learning_rate": 0.020685759174316067,
+      "loss": 6.345418930053711,
+      "step": 57
+    },
+    {
+      "epoch": 0.4965886287625418,
+      "grad_norm": 0.333984375,
+      "learning_rate": 0.02053484512108174,
+      "loss": 6.309205532073975,
+      "step": 58
+    },
+    {
+      "epoch": 0.5051505016722408,
+      "grad_norm": 0.41015625,
+      "learning_rate": 0.020381908338556534,
+      "loss": 6.3184638023376465,
+      "step": 59
+    },
+    {
+      "epoch": 0.5137123745819397,
+      "grad_norm": 0.390625,
+      "learning_rate": 0.020226987327757566,
+      "loss": 6.273001670837402,
+      "step": 60
+    },
+    {
+      "epoch": 0.5222742474916388,
+      "grad_norm": 0.40234375,
+      "learning_rate": 0.020070121089220835,
+      "loss": 6.3029961585998535,
+      "step": 61
+    },
+    {
+      "epoch": 0.5308361204013378,
+      "grad_norm": 0.365234375,
+      "learning_rate": 0.01991134911318301,
+      "loss": 6.250678062438965,
+      "step": 62
+    },
+    {
+      "epoch": 0.5393979933110368,
+      "grad_norm": 0.359375,
+      "learning_rate": 0.01975071136963998,
+      "loss": 6.183889865875244,
+      "step": 63
+    },
+    {
+      "epoch": 0.5479598662207358,
+      "grad_norm": 0.3515625,
+      "learning_rate": 0.019588248298284636,
+      "loss": 6.137822151184082,
+      "step": 64
+    },
+    {
+      "epoch": 0.5565217391304348,
+      "grad_norm": 0.3203125,
+      "learning_rate": 0.01942400079832638,
+      "loss": 6.1106767654418945,
+      "step": 65
+    },
+    {
+      "epoch": 0.5650836120401338,
+      "grad_norm": 0.333984375,
+      "learning_rate": 0.01925801021819497,
+      "loss": 6.08446741104126,
+      "step": 66
+    },
+    {
+      "epoch": 0.5736454849498328,
+      "grad_norm": 0.302734375,
+      "learning_rate": 0.01909031834513128,
+      "loss": 6.131768226623535,
+      "step": 67
+    },
+    {
+      "epoch": 0.5822073578595318,
+      "grad_norm": 0.326171875,
+      "learning_rate": 0.018920967394667584,
+      "loss": 6.065019607543945,
+      "step": 68
+    },
+    {
+      "epoch": 0.5907692307692308,
+      "grad_norm": 0.3046875,
+      "learning_rate": 0.018750000000000003,
+      "loss": 6.0482282638549805,
+      "step": 69
+    },
+    {
+      "epoch": 0.5993311036789297,
+      "grad_norm": 0.357421875,
+      "learning_rate": 0.01857745920125586,
+      "loss": 6.048603057861328,
+      "step": 70
+    },
+    {
+      "epoch": 0.6078929765886287,
+      "grad_norm": 0.365234375,
+      "learning_rate": 0.018403388434658535,
+      "loss": 5.998836517333984,
+      "step": 71
+    },
+    {
+      "epoch": 0.6164548494983277,
+      "grad_norm": 0.3515625,
+      "learning_rate": 0.01822783152159263,
+      "loss": 5.9749650955200195,
+      "step": 72
+    },
+    {
+      "epoch": 0.6250167224080267,
+      "grad_norm": 0.30859375,
+      "learning_rate": 0.018050832657572177,
+      "loss": 5.919043064117432,
+      "step": 73
+    },
+    {
+      "epoch": 0.6335785953177258,
+      "grad_norm": 0.314453125,
+      "learning_rate": 0.017872436401114647,
+      "loss": 5.9278059005737305,
+      "step": 74
+    },
+    {
+      "epoch": 0.6421404682274248,
+      "grad_norm": 0.3046875,
+      "learning_rate": 0.017692687662523583,
+      "loss": 5.9064860343933105,
+      "step": 75
+    },
+    {
+      "epoch": 0.6507023411371238,
+      "grad_norm": 0.275390625,
+      "learning_rate": 0.01751163169258267,
+      "loss": 5.921048164367676,
+      "step": 76
+    },
+    {
+      "epoch": 0.6592642140468228,
+      "grad_norm": 0.28125,
+      "learning_rate": 0.017329314071164108,
+      "loss": 5.8522515296936035,
+      "step": 77
+    },
+    {
+      "epoch": 0.6678260869565218,
+      "grad_norm": 0.29296875,
+      "learning_rate": 0.017145780695754093,
+      "loss": 5.846250057220459,
+      "step": 78
+    },
+    {
+      "epoch": 0.6763879598662207,
+      "grad_norm": 0.37109375,
+      "learning_rate": 0.016961077769898397,
+      "loss": 5.8790435791015625,
+      "step": 79
+    },
+    {
+      "epoch": 0.6849498327759197,
+      "grad_norm": 0.30078125,
+      "learning_rate": 0.016775251791570862,
+      "loss": 5.849973201751709,
+      "step": 80
+    },
+    {
+      "epoch": 0.6935117056856187,
+      "grad_norm": 0.3125,
+      "learning_rate": 0.016588349541467772,
+      "loss": 5.8167405128479,
+      "step": 81
+    },
+    {
+      "epoch": 0.7020735785953177,
+      "grad_norm": 0.27734375,
+      "learning_rate": 0.016400418071231087,
+      "loss": 5.796406269073486,
+      "step": 82
+    },
+    {
+      "epoch": 0.7106354515050167,
+      "grad_norm": 0.30859375,
+      "learning_rate": 0.01621150469160344,
+      "loss": 5.828444004058838,
+      "step": 83
+    },
+    {
+      "epoch": 0.7191973244147157,
+      "grad_norm": 0.373046875,
+      "learning_rate": 0.016021656960517872,
+      "loss": 5.763159275054932,
+      "step": 84
+    },
+    {
+      "epoch": 0.7277591973244147,
+      "grad_norm": 0.3359375,
+      "learning_rate": 0.015830922671125437,
+      "loss": 5.761376857757568,
+      "step": 85
+    },
+    {
+      "epoch": 0.7363210702341138,
+      "grad_norm": 0.3515625,
+      "learning_rate": 0.015639349839763488,
+      "loss": 5.724917411804199,
+      "step": 86
+    },
+    {
+      "epoch": 0.7448829431438128,
+      "grad_norm": 0.283203125,
+      "learning_rate": 0.015446986693867843,
+      "loss": 5.740510940551758,
+      "step": 87
+    },
+    {
+      "epoch": 0.7534448160535117,
+      "grad_norm": 0.275390625,
+      "learning_rate": 0.015253881659831759,
+      "loss": 5.739223480224609,
+      "step": 88
+    },
+    {
+      "epoch": 0.7620066889632107,
+      "grad_norm": 0.2890625,
+      "learning_rate": 0.015060083350814886,
+      "loss": 5.749945163726807,
+      "step": 89
+    },
+    {
+      "epoch": 0.7705685618729097,
+      "grad_norm": 0.294921875,
+      "learning_rate": 0.014865640554505129,
+      "loss": 5.627425670623779,
+      "step": 90
+    },
+    {
+      "epoch": 0.7791304347826087,
+      "grad_norm": 0.287109375,
+      "learning_rate": 0.014670602220836632,
+      "loss": 5.6766133308410645,
+      "step": 91
+    },
+    {
+      "epoch": 0.7876923076923077,
+      "grad_norm": 0.263671875,
+      "learning_rate": 0.014475017449666875,
+      "loss": 5.628288269042969,
+      "step": 92
+    },
+    {
+      "epoch": 0.7962541806020067,
+      "grad_norm": 0.287109375,
+      "learning_rate": 0.014278935478416067,
+      "loss": 5.660680294036865,
+      "step": 93
+    },
+    {
+      "epoch": 0.8048160535117057,
+      "grad_norm": 0.25390625,
+      "learning_rate": 0.014082405669671866,
+      "loss": 5.6099348068237305,
+      "step": 94
+    },
+    {
+      "epoch": 0.8133779264214047,
+      "grad_norm": 0.265625,
+      "learning_rate": 0.013885477498762639,
+      "loss": 5.595584869384766,
+      "step": 95
+    },
+    {
+      "epoch": 0.8219397993311037,
+      "grad_norm": 0.2431640625,
+      "learning_rate": 0.013688200541302282,
+      "loss": 5.5869598388671875,
+      "step": 96
+    },
+    {
+      "epoch": 0.8305016722408026,
+      "grad_norm": 0.259765625,
+      "learning_rate": 0.013490624460709855,
+      "loss": 5.562591552734375,
+      "step": 97
+    },
+    {
+      "epoch": 0.8390635451505016,
+      "grad_norm": 0.2734375,
+      "learning_rate": 0.013292798995707057,
+      "loss": 5.562747955322266,
+      "step": 98
+    },
+    {
+      "epoch": 0.8476254180602006,
+      "grad_norm": 0.271484375,
+      "learning_rate": 0.013094773947796783,
+      "loss": 5.608129501342773,
+      "step": 99
+    },
+    {
+      "epoch": 0.8561872909698997,
+      "grad_norm": 0.265625,
+      "learning_rate": 0.012896599168725847,
+      "loss": 5.565805912017822,
+      "step": 100
+    },
+    {
+      "epoch": 0.8561872909698997,
+      "eval_loss": 5.537259578704834,
+      "eval_runtime": 51.1142,
+      "eval_samples_per_second": 11.954,
+      "eval_steps_per_second": 2.993,
+      "step": 100
     }
   ],
   "logging_steps": 1,
       "attributes": {}
     }
   },
+  "total_flos": 1.4579790139891507e+17,
   "train_batch_size": 4,
   "trial_name": null,
   "trial_params": null