How to use vxbrandon/pruned_model_iterative with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("question-answering", model="vxbrandon/pruned_model_iterative")
# Load model directly from transformers import AutoTokenizer, AutoModelForQuestionAnswering tokenizer = AutoTokenizer.from_pretrained("vxbrandon/pruned_model_iterative") model = AutoModelForQuestionAnswering.from_pretrained("vxbrandon/pruned_model_iterative")