Instructions to use OrionStarAI/Orion-14B-Chat-Plugin with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use OrionStarAI/Orion-14B-Chat-Plugin with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="OrionStarAI/Orion-14B-Chat-Plugin", trust_remote_code=True)# Load model directly from transformers import AutoModelForCausalLM model = AutoModelForCausalLM.from_pretrained("OrionStarAI/Orion-14B-Chat-Plugin", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use OrionStarAI/Orion-14B-Chat-Plugin with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "OrionStarAI/Orion-14B-Chat-Plugin" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "OrionStarAI/Orion-14B-Chat-Plugin", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/OrionStarAI/Orion-14B-Chat-Plugin
- SGLang
How to use OrionStarAI/Orion-14B-Chat-Plugin with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "OrionStarAI/Orion-14B-Chat-Plugin" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "OrionStarAI/Orion-14B-Chat-Plugin", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "OrionStarAI/Orion-14B-Chat-Plugin" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "OrionStarAI/Orion-14B-Chat-Plugin", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }' - Docker Model Runner
How to use OrionStarAI/Orion-14B-Chat-Plugin with Docker Model Runner:
docker model run hf.co/OrionStarAI/Orion-14B-Chat-Plugin
Commit History
Update README_ja.md bf5f4e5 verified
Update README_zh.md 6783e7e verified
Update README.md 4c3dab3 verified
Update Wechat group QR code. fefc50a verified
Update tokenization_orion.py 3dcd1ca verified
Update README_zh.md ff5ad7c verified
Update README.md 20ea805 verified
Update README_ko.md 46381d8 verified
Update README_ja.md 55a2280 verified
Update README_zh.md 6aa4d3b verified
Update README.md 2c85ed6 verified
Upload 2 files 59ddd7c verified
Update README_zh.md 9555fc2 verified
Update README_ko.md 742b711 verified
Update README_ja.md 7de802b verified
Update README.md 703959b verified
Update README.md 951dfc4 verified
Update README_zh.md 2ae9a66 verified
Update README_ko.md 4813f43 verified
Update README_ja.md ca1ac7a verified
Update README.md bf7de3c verified
Upload 2 files 4ccf566 verified
Update README_zh.md 66b4194 verified
Update README_ko.md e7faaef verified
Update README_ja.md 687dd29 verified
Update README.md 3117539 verified
Update README_ko.md ccfd5e0 verified
Update README.md 5876302 verified
Upload 3 files b979b20 verified
Update README.md f74147e verified
Delete README_cn.md 486ea5a verified
Update README_cn.md 1aa10d4 verified
Update README.md 5217b9f verified
update readme 9925e22
Du Chen commited on
update readme 788a90e
Du Chen commited on
update readme 88aabdc
Du Chen commited on
update readme daa9fd0
Du Chen commited on
update readme f5377b5
Du Chen commited on
update readme and special_tokens_map.json a84426e
Du Chen commited on
initial commit 7acc177
Du Chen commited on