Quants
Collection
Miscellaneous quantifications, mostly GGUF โข 18 items โข Updated โข 1
How to use cstr/wmt21ct2_int8 with Transformers:
# Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("cstr/wmt21ct2_int8", dtype="auto")facebook/wmt21-dense-24-wide-en-x translation model in int8 (4.7gb), cf. the model as safetensor
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("cstr/wmt21ct2_int8", dtype="auto")