Text Generation
Transformers
Safetensors
MLX
starcoder2
code
Eval Results (legacy)
text-generation-inference
# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM
tokenizer = AutoTokenizer.from_pretrained("mlx-community/starcoder2-15b-4bit")
model = AutoModelForCausalLM.from_pretrained("mlx-community/starcoder2-15b-4bit")Quick Links
mlx-community/starcoder2-15b-4bit
This model was converted to MLX format from bigcode/starcoder2-15b.
Refer to the original model card for more details on the model.
Use with mlx
pip install mlx-lm
from mlx_lm import load, generate
model, tokenizer = load("mlx-community/starcoder2-15b-4bit")
response = generate(model, tokenizer, prompt="hello", verbose=True)
- Downloads last month
- 106
Hardware compatibility
Log In to add your hardware
Quantized
Dataset used to train mlx-community/starcoder2-15b-4bit
Evaluation results
- pass@1 on CruxEval-Iself-reported48.100
- pass@1 on DS-1000self-reported33.800
- accuracy on GSM8K (PAL)self-reported65.100
- pass@1 on HumanEval+self-reported37.800
- pass@1 on HumanEvalself-reported46.300
- edit-smiliarity on RepoBench-v1.1self-reported74.080
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="mlx-community/starcoder2-15b-4bit")