Instructions to use craa/exceptions_exp2_swap_last_to_push_3591 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use craa/exceptions_exp2_swap_last_to_push_3591 with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="craa/exceptions_exp2_swap_last_to_push_3591")

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("craa/exceptions_exp2_swap_last_to_push_3591")
model = AutoModelForCausalLM.from_pretrained("craa/exceptions_exp2_swap_last_to_push_3591")

Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use craa/exceptions_exp2_swap_last_to_push_3591 with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "craa/exceptions_exp2_swap_last_to_push_3591"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "craa/exceptions_exp2_swap_last_to_push_3591",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker

docker model run hf.co/craa/exceptions_exp2_swap_last_to_push_3591

SGLang

How to use craa/exceptions_exp2_swap_last_to_push_3591 with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "craa/exceptions_exp2_swap_last_to_push_3591" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "craa/exceptions_exp2_swap_last_to_push_3591",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "craa/exceptions_exp2_swap_last_to_push_3591" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "craa/exceptions_exp2_swap_last_to_push_3591",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Docker Model Runner
How to use craa/exceptions_exp2_swap_last_to_push_3591 with Docker Model Runner:
```
docker model run hf.co/craa/exceptions_exp2_swap_last_to_push_3591
```

exceptions_exp2_swap_last_to_push_3591

This model is a fine-tuned version of on an unknown dataset. It achieves the following results on the evaluation set:

Loss: 3.5616
Accuracy: 0.3689

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 0.0006
train_batch_size: 16
eval_batch_size: 16
seed: 3591
gradient_accumulation_steps: 5
total_train_batch_size: 80
optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.98) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
lr_scheduler_type: linear
lr_scheduler_warmup_steps: 100
num_epochs: 50.0
mixed_precision_training: Native AMP

Training results

Training Loss	Epoch	Step	Validation Loss	Accuracy
4.8481	0.2915	1000	4.7774	0.2512
4.346	0.5830	2000	4.2926	0.2978
4.1791	0.8744	3000	4.1280	0.3123
4.019	1.1659	4000	3.9984	0.3241
3.9226	1.4573	5000	3.9232	0.3304
3.8872	1.7488	6000	3.8636	0.3356
3.7431	2.0402	7000	3.8204	0.3408
3.7528	2.3317	8000	3.7912	0.3428
3.7452	2.6232	9000	3.7602	0.3461
3.73	2.9147	10000	3.7325	0.3486
3.6369	3.2061	11000	3.7202	0.3508
3.6515	3.4976	12000	3.7034	0.3522
3.6544	3.7890	13000	3.6826	0.3540
3.5562	4.0804	14000	3.6777	0.3550
3.5736	4.3719	15000	3.6656	0.3564
3.5766	4.6634	16000	3.6514	0.3578
3.5778	4.9549	17000	3.6385	0.3587
3.5127	5.2463	18000	3.6396	0.3592
3.5177	5.5378	19000	3.6295	0.3601
3.5436	5.8293	20000	3.6188	0.3611
3.4484	6.1207	21000	3.6239	0.3616
3.4759	6.4121	22000	3.6174	0.3618
3.4945	6.7036	23000	3.6045	0.3632
3.4965	6.9951	24000	3.5957	0.3638
3.4315	7.2865	25000	3.6049	0.3636
3.4516	7.5780	26000	3.5958	0.3646
3.4475	7.8695	27000	3.5840	0.3653
3.3798	8.1609	28000	3.5933	0.3651
3.4275	8.4524	29000	3.5898	0.3656
3.4291	8.7438	30000	3.5792	0.3664
3.3356	9.0353	31000	3.5841	0.3664
3.3778	9.3267	32000	3.5817	0.3666
3.4029	9.6182	33000	3.5735	0.3673
3.4147	9.9097	34000	3.5669	0.3677
3.3511	10.2011	35000	3.5799	0.3673
3.3766	10.4926	36000	3.5717	0.3679
3.4025	10.7841	37000	3.5628	0.3684
3.2836	11.0755	38000	3.5748	0.3679
3.3571	11.3670	39000	3.5713	0.3684
3.3725	11.6584	40000	3.5616	0.3689
3.3797	11.9499	41000	3.5542	0.3693
3.3073	12.2413	42000	3.5698	0.3686
3.3394	12.5328	43000	3.5633	0.3692
3.3429	12.8243	44000	3.5545	0.3698
3.2604	13.1157	45000	3.5698	0.3693
3.3149	13.4072	46000	3.5601	0.3696
3.3372	13.6987	47000	3.5521	0.3703
3.346	13.9901	48000	3.5462	0.3705
3.2841	14.2816	49000	3.5636	0.3700
3.3156	14.5730	50000	3.5547	0.3704
3.3218	14.8645	51000	3.5459	0.3710
3.2466	15.1559	52000	3.5613	0.3704
3.289	15.4474	53000	3.5574	0.3706
3.3036	15.7389	54000	3.5460	0.3712
3.206	16.0303	55000	3.5601	0.3705
3.2723	16.3218	56000	3.5579	0.3710
3.2838	16.6133	57000	3.5520	0.3710
3.3059	16.9047	58000	3.5406	0.3720
3.22	17.1962	59000	3.5600	0.3710
3.2509	17.4876	60000	3.5512	0.3715
3.2746	17.7791	61000	3.5432	0.3719
3.2006	18.0705	62000	3.5587	0.3713
3.2379	18.3620	63000	3.5561	0.3714
3.2578	18.6535	64000	3.5497	0.3720
3.2822	18.9450	65000	3.5386	0.3726
3.2186	19.2364	66000	3.5548	0.3719
3.2471	19.5279	67000	3.5496	0.3718
3.2705	19.8193	68000	3.5387	0.3727
3.1865	20.1108	69000	3.5558	0.3718
3.226	20.4022	70000	3.5513	0.3721
3.2304	20.6937	71000	3.5463	0.3727
3.2683	20.9852	72000	3.5381	0.3729
3.2001	21.2766	73000	3.5545	0.3721
3.2202	21.5681	74000	3.5456	0.3727
3.2363	21.8596	75000	3.5372	0.3731
3.1743	22.1510	76000	3.5595	0.3723
3.2089	22.4425	77000	3.5515	0.3722
3.2294	22.7339	78000	3.5431	0.3732
3.1305	23.0254	79000	3.5553	0.3726
3.1863	23.3168	80000	3.5539	0.3724
3.2041	23.6083	81000	3.5471	0.3730
3.2128	23.8998	82000	3.5403	0.3737
3.161	24.1912	83000	3.5574	0.3729
3.2043	24.4827	84000	3.5522	0.3731
3.2034	24.7742	85000	3.5422	0.3735
3.1296	25.0656	86000	3.5571	0.3730
3.1738	25.3571	87000	3.5549	0.3728
3.1881	25.6485	88000	3.5454	0.3736
3.2107	25.9400	89000	3.5398	0.3739
3.151	26.2314	90000	3.5577	0.3729
3.178	26.5229	91000	3.5541	0.3731
3.2011	26.8144	92000	3.5412	0.3742
3.1188	27.1058	93000	3.5580	0.3730
3.1519	27.3973	94000	3.5561	0.3733
3.1757	27.6888	95000	3.5465	0.3737

Framework versions

Transformers 4.55.2
Pytorch 2.8.0+cu128
Datasets 4.0.0
Tokenizers 0.21.4

Downloads last month: 1

Safetensors

Model size

0.1B params

Tensor type

F32