How to use from the
Use from the
Transformers library
# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="kavinilavan/Llama-2-13b-chat-hf-array_8bit")
# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("kavinilavan/Llama-2-13b-chat-hf-array_8bit")
model = AutoModelForCausalLM.from_pretrained("kavinilavan/Llama-2-13b-chat-hf-array_8bit")
Quick Links

No model card

Downloads last month
3
Inference Providers NEW