Tribewarez/synthetic-pot-o-challanges-22-22k
Viewer • Updated • 22.2k • 137
How to use Tribewarez/pot-o-22-slim with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("text-generation", model="Tribewarez/pot-o-22-slim") # Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM
tokenizer = AutoTokenizer.from_pretrained("Tribewarez/pot-o-22-slim")
model = AutoModelForCausalLM.from_pretrained("Tribewarez/pot-o-22-slim")How to use Tribewarez/pot-o-22-slim with vLLM:
# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "Tribewarez/pot-o-22-slim"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/completions" \
-H "Content-Type: application/json" \
--data '{
"model": "Tribewarez/pot-o-22-slim",
"prompt": "Once upon a time,",
"max_tokens": 512,
"temperature": 0.5
}'docker model run hf.co/Tribewarez/pot-o-22-slim
How to use Tribewarez/pot-o-22-slim with SGLang:
# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
--model-path "Tribewarez/pot-o-22-slim" \
--host 0.0.0.0 \
--port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
-H "Content-Type: application/json" \
--data '{
"model": "Tribewarez/pot-o-22-slim",
"prompt": "Once upon a time,",
"max_tokens": 512,
"temperature": 0.5
}'docker run --gpus all \
--shm-size 32g \
-p 30000:30000 \
-v ~/.cache/huggingface:/root/.cache/huggingface \
--env "HF_TOKEN=<secret>" \
--ipc=host \
lmsysorg/sglang:latest \
python3 -m sglang.launch_server \
--model-path "Tribewarez/pot-o-22-slim" \
--host 0.0.0.0 \
--port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
-H "Content-Type: application/json" \
--data '{
"model": "Tribewarez/pot-o-22-slim",
"prompt": "Once upon a time,",
"max_tokens": 512,
"temperature": 0.5
}'How to use Tribewarez/pot-o-22-slim with Docker Model Runner:
docker model run hf.co/Tribewarez/pot-o-22-slim
Ultra-slim GPT-2-style causal LM (~22,200 trainable parameters) for PoT-O path / MML experiments. Pairs with the 22,222-example dataset Tribewarez/synthetic-pot-o-challanges-22-22k (signature param_signature 22.2222).
Weights are random initialization — intended for fine-tuning on PoT-O challenge → optimal_path text (see dataset card). Architecture chosen to land near 22.2k params with a 257-token byte-level tokenizer (256 bytes + <|endoftext|>).
| Architecture | GPT2LMHeadModel |
vocab_size |
257 |
n_positions |
64 (truncate long challenge strings for prefill) |
n_embd |
24 |
n_layer |
2 |
n_head |
1 |
n_inner |
96 |
| Parameters | 22,200 |
cd pot-o-22-slim
python create_model.py
pip install transformers huggingface_hub
huggingface-cli login
python upload_model.py
from transformers import AutoModelForCausalLM, AutoTokenizer
model_id = "Tribewarez/pot-o-22-slim"
tok = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(model_id)
text = "tensor:shape=[32,64];dtype=float16;target_mml=0.22;ops:matmul,gelu"
inputs = tok(text, return_tensors="pt", max_length=64, truncation=True)
# ... generation
MIT licensed • Tribewarez guild • live beta