How to use from the
Use from the
Transformers library
# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="TaylorAI/Flash-Llama-13B", trust_remote_code=True)
# Load model directly
from transformers import AutoTokenizer, AutoModelForMultimodalLM

tokenizer = AutoTokenizer.from_pretrained("TaylorAI/Flash-Llama-13B", trust_remote_code=True)
model = AutoModelForMultimodalLM.from_pretrained("TaylorAI/Flash-Llama-13B", trust_remote_code=True)
Quick Links

No model card

Downloads last month
128
Safetensors
Model size
13B params
Tensor type
F32
Β·
F16
Β·
Inference Providers NEW

Model tree for TaylorAI/Flash-Llama-13B

Quantizations
1 model

Spaces using TaylorAI/Flash-Llama-13B 30