How to use DoNotChoke/output_fp16 with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("DoNotChoke/output_fp16", dtype="auto")
How to fix it?