# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM
tokenizer = AutoTokenizer.from_pretrained("mlx-community/stable-code-3b-4bit")
model = AutoModelForCausalLM.from_pretrained("mlx-community/stable-code-3b-4bit")Quick Links
mlx-community/stable-code-3b-4bit
This model was converted to MLX format from stabilityai/stable-code-3b.
Refer to the original model card for more details on the model.
Use with mlx
pip install mlx-lm
from mlx_lm import load, generate
model, tokenizer = load("mlx-community/stable-code-3b-4bit")
response = generate(model, tokenizer, prompt="hello", verbose=True)
- Downloads last month
- 40
Hardware compatibility
Log In to add your hardware
Quantized
Datasets used to train mlx-community/stable-code-3b-4bit
Evaluation results
- pass@1 on MultiPL-HumanEval (Python)self-reported32.400
- pass@1 on MultiPL-HumanEval (Python)self-reported30.900
- pass@1 on MultiPL-HumanEval (Python)self-reported32.100
- pass@1 on MultiPL-HumanEval (Python)self-reported32.100
- pass@1 on MultiPL-HumanEval (Python)self-reported24.200
- pass@1 on MultiPL-HumanEval (Python)self-reported23.000
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="mlx-community/stable-code-3b-4bit")