Instructions to use saracandu/stldec_random_128 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use saracandu/stldec_random_128 with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="saracandu/stldec_random_128", trust_remote_code=True)

# Load model directly
from transformers import AutoModelForCausalLM
model = AutoModelForCausalLM.from_pretrained("saracandu/stldec_random_128", trust_remote_code=True, dtype="auto")

Notebooks
Google Colab
Kaggle
Local Apps Settings

vLLM

How to use saracandu/stldec_random_128 with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "saracandu/stldec_random_128"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "saracandu/stldec_random_128",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker

docker model run hf.co/saracandu/stldec_random_128

SGLang

How to use saracandu/stldec_random_128 with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "saracandu/stldec_random_128" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "saracandu/stldec_random_128",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "saracandu/stldec_random_128" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "saracandu/stldec_random_128",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Docker Model Runner
How to use saracandu/stldec_random_128 with Docker Model Runner:
```
docker model run hf.co/saracandu/stldec_random_128
```

saracandu commited on Aug 29, 2025

Commit

fb5373d

verified ·

1 Parent(s): 8d055e9

Update checkpoint step_98000

Browse files

Files changed (3) hide show

model.safetensors +1 -1
optimizer.bin +1 -1
scheduler.bin +1 -1

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:9906305d73cd01f13ac9d01aa5666381767d89fdcb641a304f8eb8cfa5ded51a
 size 57165744

 version https://git-lfs.github.com/spec/v1
+oid sha256:30d84eba0dd22e31a87c3fb04819646c51751b1c2a62ffb8e1c17867adb86208
 size 57165744

optimizer.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:971ce071a8a66be3da15dee12c0c6b50123981d86d8443c495f422e1599fb16d
 size 113942475

 version https://git-lfs.github.com/spec/v1
+oid sha256:8b7288f89836a1f8bcf2ad405a0eceb9155d122f3328d7a50378c46c5f9b1c94
 size 113942475

scheduler.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d4a126036ac9ad87f78fc18bd29d0da2ceff80bd4ac92361d87c6c5cb0035f67
 size 1465

 version https://git-lfs.github.com/spec/v1
+oid sha256:b1ebe8f7c0ce9cdfa22726f4399ff3ab244160c17571f3aeea27a34da58436b7
 size 1465