Instructions to use WizardLMTeam/WizardLM-13B-V1.2 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use WizardLMTeam/WizardLM-13B-V1.2 with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="WizardLMTeam/WizardLM-13B-V1.2")# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("WizardLMTeam/WizardLM-13B-V1.2") model = AutoModelForCausalLM.from_pretrained("WizardLMTeam/WizardLM-13B-V1.2") - Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use WizardLMTeam/WizardLM-13B-V1.2 with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "WizardLMTeam/WizardLM-13B-V1.2" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "WizardLMTeam/WizardLM-13B-V1.2", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/WizardLMTeam/WizardLM-13B-V1.2
- SGLang
How to use WizardLMTeam/WizardLM-13B-V1.2 with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "WizardLMTeam/WizardLM-13B-V1.2" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "WizardLMTeam/WizardLM-13B-V1.2", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "WizardLMTeam/WizardLM-13B-V1.2" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "WizardLMTeam/WizardLM-13B-V1.2", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }' - Docker Model Runner
How to use WizardLMTeam/WizardLM-13B-V1.2 with Docker Model Runner:
docker model run hf.co/WizardLMTeam/WizardLM-13B-V1.2
BOS is actually the EOS token by default
#30 opened over 2 years ago
by
dblakely
Adding Evaluation Results
#29 opened over 2 years ago
by
leaderboard-pr-bot
Adding `safetensors` variant of this model
#28 opened over 2 years ago
by
qeternity
set use_cache=true for faster decoding
#27 opened over 2 years ago
by
zxcvvxcz
Set model_max_length of tokenizer equal to the max_length of the model
#26 opened over 2 years ago
by
zxcvvxcz
Something's Wrong, It's Not Your Fault, Llama 2 Is Fundamentally Flawed
1
#25 opened over 2 years ago
by deleted
Update README.md
#16 opened over 2 years ago
by
haipeng1
mat1 and mat2 shapes cannot be multiplied (38x5120 and 1x2560)
1
#13 opened almost 3 years ago
by
LaferriereJC
Request: Wizardlm-22b
#12 opened almost 3 years ago
by
rombodawg
inference speed is considerably slow
#11 opened almost 3 years ago
by
sonald
Should the "Max Positional Embeddings" be 4096?
1
#10 opened almost 3 years ago
by
barleyspectacular
Feel free to use my datasets for further refinement!
#9 opened almost 3 years ago
by
rombodawg
Missing model card & datasets info
👍 1
#8 opened almost 3 years ago
by
markding
RuntimeError: probability tensor contains either `inf`, `nan` or element < 0
#7 opened almost 3 years ago
by
Satya93
Still non-commercial?
#6 opened almost 3 years ago
by
kalijason
RuntimeError: expected scalar type Half but found Char
2
#5 opened almost 3 years ago
by
nicolasbo
database connection ?
#4 opened almost 3 years ago
by
nobitha
What is the prompt format?
❤️ 3
1
#3 opened almost 3 years ago
by
TheBloke
What is this model based off of?
1
#2 opened almost 3 years ago
by
rombodawg
Dataset Availability?
👍 13
#1 opened almost 3 years ago
by
jonfairbanks