Instructions to use prithivMLmods/Stark-Prompt-Extender with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use prithivMLmods/Stark-Prompt-Extender with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="prithivMLmods/Stark-Prompt-Extender")

# Load model directly
from transformers import AutoTokenizer, AutoModelForMultimodalLM

tokenizer = AutoTokenizer.from_pretrained("prithivMLmods/Stark-Prompt-Extender")
model = AutoModelForMultimodalLM.from_pretrained("prithivMLmods/Stark-Prompt-Extender")

Notebooks
Google Colab
Kaggle
Local Apps Settings

vLLM

How to use prithivMLmods/Stark-Prompt-Extender with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "prithivMLmods/Stark-Prompt-Extender"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "prithivMLmods/Stark-Prompt-Extender",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker

docker model run hf.co/prithivMLmods/Stark-Prompt-Extender

SGLang

How to use prithivMLmods/Stark-Prompt-Extender with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "prithivMLmods/Stark-Prompt-Extender" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "prithivMLmods/Stark-Prompt-Extender",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "prithivMLmods/Stark-Prompt-Extender" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "prithivMLmods/Stark-Prompt-Extender",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Docker Model Runner
How to use prithivMLmods/Stark-Prompt-Extender with Docker Model Runner:
```
docker model run hf.co/prithivMLmods/Stark-Prompt-Extender
```

prithivMLmods commited on Jan 27, 2025

Commit

50606f3

verified ·

1 Parent(s): f61ba7e

Upload folder using huggingface_hub

Browse files

Files changed (8) hide show

onnx/model.onnx +3 -0
onnx/model_bnb4.onnx +3 -0
onnx/model_fp16.onnx +3 -0
onnx/model_int8.onnx +3 -0
onnx/model_q4.onnx +3 -0
onnx/model_q4f16.onnx +3 -0
onnx/model_quantized.onnx +3 -0
onnx/model_uint8.onnx +3 -0

onnx/model.onnx ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:2822eb9fbb9cf6af735fbcdddb4815d5de971f04f6aa9d1f2584d2b65c81ad29
+size 503480854

onnx/model_bnb4.onnx ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:12a67810b313f6ab0d68ea35dc0128672db65842cfe5963702c97700e75ebc5e
+size 503480873

onnx/model_fp16.onnx ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:7a26455fccced6031bc7d2f7e2e2ed6309fff902117fe353c51c076d916f7056
+size 251929793

onnx/model_int8.onnx ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:9e0d84b88f097374609a930afe4fa57ddacc023001916cde4d88fcae9d1f2a99
+size 286326669

onnx/model_q4.onnx ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:12a67810b313f6ab0d68ea35dc0128672db65842cfe5963702c97700e75ebc5e
+size 503480873

onnx/model_q4f16.onnx ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:fa06a559f383b8070f9274b9280a925ad4ee5842f68a5939952f8b33e2a95cff
+size 251929812

onnx/model_quantized.onnx ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:9e0d84b88f097374609a930afe4fa57ddacc023001916cde4d88fcae9d1f2a99
+size 286326669

onnx/model_uint8.onnx ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:a338bba9c828e00d1ee7c72879d9a4139a76865dac2c8b7e8791648ff680fc4d
+size 286326688