Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

ZombitX64
/

HanumanGPT

Text Generation

Eval Results (legacy)

text-generation-inference

Model card Files Files and versions

Instructions to use ZombitX64/HanumanGPT with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use ZombitX64/HanumanGPT with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="ZombitX64/HanumanGPT")

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("ZombitX64/HanumanGPT")
model = AutoModelForCausalLM.from_pretrained("ZombitX64/HanumanGPT")

Notebooks
Google Colab
Kaggle
Local Apps Settings

How to use ZombitX64/HanumanGPT with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "ZombitX64/HanumanGPT"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "ZombitX64/HanumanGPT",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker

docker model run hf.co/ZombitX64/HanumanGPT

How to use ZombitX64/HanumanGPT with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "ZombitX64/HanumanGPT" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "ZombitX64/HanumanGPT",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "ZombitX64/HanumanGPT" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "ZombitX64/HanumanGPT",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Docker Model Runner
How to use ZombitX64/HanumanGPT with Docker Model Runner:
```
docker model run hf.co/ZombitX64/HanumanGPT
```

502 MB

Ctrl+K

Ctrl+K

2 contributors

History: 13 commits

JonusNattapong's picture

Upload model.safetensors

3156372 verified 11 months ago

.gitattributes

1.52 kB
initial commit 11 months ago
README.md

5.21 kB
Update README.md 11 months ago
config.json

877 Bytes
Upload folder using huggingface_hub 11 months ago
generation_config.json

119 Bytes
Upload folder using huggingface_hub 11 months ago
model.safetensors

498 MB
xet

Upload model.safetensors 11 months ago
special_tokens_map.json

2.7 kB
Upload 6 files 11 months ago
tokenizer.json

4.37 MB
Upload 6 files 11 months ago
tokenizer_config.json

23.5 kB
Upload 6 files 11 months ago