How to use from the
Use from the
Transformers library
# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="Xenova/really-tiny-falcon-testing", trust_remote_code=True)
# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("Xenova/really-tiny-falcon-testing", trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained("Xenova/really-tiny-falcon-testing", trust_remote_code=True)
Quick Links

https://huggingface.co/fxmarty/really-tiny-falcon-testing with ONNX weights to be compatible with Transformers.js.

Note: Having a separate repo for ONNX weights is intended to be a temporary solution until WebML gains more traction. If you would like to make your models web-ready, we recommend converting to ONNX using 🤗 Optimum and structuring your repo like this one (with ONNX weights located in a subfolder named onnx).

Downloads last month
9
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Xenova/really-tiny-falcon-testing

Quantized
(1)
this model