How to use from the
Use from the
Transformers library
# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="Jorghi21/llama7b-4bit-fixed")
# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("Jorghi21/llama7b-4bit-fixed")
model = AutoModelForCausalLM.from_pretrained("Jorghi21/llama7b-4bit-fixed")
Quick Links

No model card

Downloads last month
5
Safetensors
Model size
1B params
Tensor type
I64
F32
I32
F16
Inference Providers NEW
This model isn't deployed by any Inference Provider. 馃檵 Ask for provider support