How to use from the
Use from the
Transformers library
# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="mgoin/tiny-random-llama-2-quant")
# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("mgoin/tiny-random-llama-2-quant")
model = AutoModelForCausalLM.from_pretrained("mgoin/tiny-random-llama-2-quant")
Quick Links

No model card

Downloads last month
5
Safetensors
Model size
104k params
Tensor type
F32
I32
BF16
U8
Inference Providers NEW
This model isn't deployed by any Inference Provider. 馃檵 Ask for provider support