ScottzillaSystems
/

How to use from
vLLM
Install from pip and serve model
# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "ScottzillaSystems/ChatGPT-5"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "ScottzillaSystems/ChatGPT-5",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'
Use Docker
docker model run hf.co/ScottzillaSystems/ChatGPT-5
Quick Links

ChatGPT-5

Ultra-fast AI chat model based on Qwen2.5-0.5B-Instruct architecture (494M parameters).

Features

  • โšก Ultra-fast โ€” Lightweight 494M parameter model for instant responses
  • ๐Ÿ’ฌ Conversational โ€” Optimized for multi-turn chat
  • ๐Ÿ”ง Instruction Following โ€” Follows instructions accurately

Chat UI

Try it now: ChatGPT-5 Chat

Usage

from transformers import AutoModelForCausalLM, AutoTokenizer

model = AutoModelForCausalLM.from_pretrained("ScottzillaSystems/ChatGPT-5")
tokenizer = AutoTokenizer.from_pretrained("ScottzillaSystems/ChatGPT-5")

messages = [{"role": "user", "content": "Hello!"}]
text = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
inputs = tokenizer(text, return_tensors="pt")
outputs = model.generate(**inputs, max_new_tokens=512)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))
Downloads last month
1,038
Safetensors
Model size
0.5B params
Tensor type
F16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for ScottzillaSystems/ChatGPT-5

Finetuned
(786)
this model

Space using ScottzillaSystems/ChatGPT-5 1