Text Generation
Transformers
Safetensors
MLX
llama
code
Eval Results (legacy)
text-generation-inference
# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM
tokenizer = AutoTokenizer.from_pretrained("mlx-community/granite-8b-code-base-8bit")
model = AutoModelForCausalLM.from_pretrained("mlx-community/granite-8b-code-base-8bit")Quick Links
mlx-community/granite-8b-code-base-8bit
The Model mlx-community/granite-8b-code-base-8bit was converted to MLX format from ibm-granite/granite-8b-code-base using mlx-lm version 0.12.0.
Use with mlx
pip install mlx-lm
from mlx_lm import load, generate
model, tokenizer = load("mlx-community/granite-8b-code-base-8bit")
response = generate(model, tokenizer, prompt="hello", verbose=True)
- Downloads last month
- 56
Model size
1B params
Tensor type
F16
·
U32 ·
Hardware compatibility
Log In to add your hardware
Quantized
Datasets used to train mlx-community/granite-8b-code-base-8bit
Evaluation results
- pass@1 on MBPPself-reported42.200
- pass@1 on MBPP+self-reported49.600
- pass@1 on HumanEvalSynthesis(Python)self-reported43.900
- pass@1 on HumanEvalSynthesis(Python)self-reported52.400
- pass@1 on HumanEvalSynthesis(Python)self-reported56.100
- pass@1 on HumanEvalSynthesis(Python)self-reported31.700
- pass@1 on HumanEvalSynthesis(Python)self-reported43.900
- pass@1 on HumanEvalSynthesis(Python)self-reported32.900
- pass@1 on HumanEvalSynthesis(Python)self-reported23.500
- pass@1 on HumanEvalSynthesis(Python)self-reported32.300
- pass@1 on HumanEvalSynthesis(Python)self-reported25.000
- pass@1 on HumanEvalSynthesis(Python)self-reported23.200
- pass@1 on HumanEvalSynthesis(Python)self-reported28.000
- pass@1 on HumanEvalSynthesis(Python)self-reported19.500
- pass@1 on HumanEvalSynthesis(Python)self-reported22.600
- pass@1 on HumanEvalSynthesis(Python)self-reported35.400
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="mlx-community/granite-8b-code-base-8bit")