How to use prithivida/flashrank with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("prithivida/flashrank", dtype="auto")
Some int8 model export requires custom forward code, If you intend to duplicate or clone this repo or download files. You have attribute to the author of this repo and add a backlink to this repo in the readme.