Instructions to use sasuke/gpt2-wikitext2 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use sasuke/gpt2-wikitext2 with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="sasuke/gpt2-wikitext2")# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("sasuke/gpt2-wikitext2") model = AutoModelForCausalLM.from_pretrained("sasuke/gpt2-wikitext2") - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- vLLM
How to use sasuke/gpt2-wikitext2 with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "sasuke/gpt2-wikitext2" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "sasuke/gpt2-wikitext2", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/sasuke/gpt2-wikitext2
- SGLang
How to use sasuke/gpt2-wikitext2 with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "sasuke/gpt2-wikitext2" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "sasuke/gpt2-wikitext2", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "sasuke/gpt2-wikitext2" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "sasuke/gpt2-wikitext2", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }' - Docker Model Runner
How to use sasuke/gpt2-wikitext2 with Docker Model Runner:
docker model run hf.co/sasuke/gpt2-wikitext2
Commit History
Training in progress, step 6500 872563c
sasuke xie commited on
Training in progress, step 5500 ff735ab
sasuke xie commited on
Training in progress, step 5000 9acfc0d
sasuke xie commited on
Training in progress, step 4000 3788954
sasuke xie commited on
Training in progress, step 2500 e6b34f6
sasuke xie commited on
Training in progress, step 1500 0460f9b
sasuke xie commited on
Training in progress, step 500 04d3bb0
sasuke xie commited on
initial commit 3f81b56
sasuke xie commited on