How to use from the
Use from the
Transformers library
# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="styalai/phi-2_quantize_gptq", trust_remote_code=True)
# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("styalai/phi-2_quantize_gptq", trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained("styalai/phi-2_quantize_gptq", trust_remote_code=True)
Quick Links

Model Card for Model ID

Model Details

It's just the model Phi-2 by microsoft quantized by gptq in 4 bits.

Downloads last month
8
Safetensors
Model size
3B params
Tensor type
I32
·
F16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support