Text Generation
Transformers
PyTorch
opt
instruction-tuning
text-generation-inference
text2text-generation
Instructions to use akoksal/LongForm-OPT-6.7B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use akoksal/LongForm-OPT-6.7B with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="akoksal/LongForm-OPT-6.7B")# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("akoksal/LongForm-OPT-6.7B") model = AutoModelForCausalLM.from_pretrained("akoksal/LongForm-OPT-6.7B") - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- vLLM
How to use akoksal/LongForm-OPT-6.7B with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "akoksal/LongForm-OPT-6.7B" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "akoksal/LongForm-OPT-6.7B", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/akoksal/LongForm-OPT-6.7B
- SGLang
How to use akoksal/LongForm-OPT-6.7B with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "akoksal/LongForm-OPT-6.7B" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "akoksal/LongForm-OPT-6.7B", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "akoksal/LongForm-OPT-6.7B" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "akoksal/LongForm-OPT-6.7B", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }' - Docker Model Runner
How to use akoksal/LongForm-OPT-6.7B with Docker Model Runner:
docker model run hf.co/akoksal/LongForm-OPT-6.7B
Commit History
Update README.md fba531c verified
Update README.md 33bd403
Update README.md 690de87
Update README.md 1fc27eb
Update README.md 6d3f295
Update README.md 7344ca9
Update README.md 889061c
Update README.md 83ae1aa
Update README.md 902dfe0
Update README.md f4cf071
Update README.md 171dba5
first commit d21a7f7
Abdullatif Koksal commited on
first commit 38e93af
Abdullatif Koksal commited on