How to use Qdrant/gte-large-onnx-Q with Transformers:
# Load model directly from transformers import AutoTokenizer, AutoModel tokenizer = AutoTokenizer.from_pretrained("Qdrant/gte-large-onnx-Q") model = AutoModel.from_pretrained("Qdrant/gte-large-onnx-Q")
Quantized ONNX port of thenlper/gte-large for text classification and similarity searches.