Text Ranking
Transformers
ONNX
Safetensors
sentence-transformers
Transformers.js
English
modernbert
text-classification
text-embeddings-inference
Instructions to use Alibaba-NLP/gte-reranker-modernbert-base with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use Alibaba-NLP/gte-reranker-modernbert-base with Transformers:
# Load model directly from transformers import AutoTokenizer, AutoModelForSequenceClassification tokenizer = AutoTokenizer.from_pretrained("Alibaba-NLP/gte-reranker-modernbert-base") model = AutoModelForSequenceClassification.from_pretrained("Alibaba-NLP/gte-reranker-modernbert-base") - sentence-transformers
How to use Alibaba-NLP/gte-reranker-modernbert-base with sentence-transformers:
from sentence_transformers import CrossEncoder model = CrossEncoder("Alibaba-NLP/gte-reranker-modernbert-base") query = "Which planet is known as the Red Planet?" passages = [ "Venus is often called Earth's twin because of its similar size and proximity.", "Mars, known for its reddish appearance, is often referred to as the Red Planet.", "Jupiter, the largest planet in our solar system, has a prominent red spot.", "Saturn, famous for its rings, is sometimes mistaken for the Red Planet." ] scores = model.predict([(query, passage) for passage in passages]) print(scores) - Transformers.js
How to use Alibaba-NLP/gte-reranker-modernbert-base with Transformers.js:
// npm i @huggingface/transformers import { pipeline } from '@huggingface/transformers'; // Allocate pipeline const pipe = await pipeline('text-ranking', 'Alibaba-NLP/gte-reranker-modernbert-base'); - Notebooks
- Google Colab
- Kaggle
patch inference on CPU & Windows + Update README snippets
#2
by tomaarsen HF Staff - opened
Hello!
Pull Request overview
- Remove
reference_compileconfig option. When not specified in the config, it will be set dynamically based on the user's hardware and software: https://github.com/huggingface/transformers/blob/f439e28d32c9fa061c4fd90696ba0b158d273d09/src/transformers/models/modernbert/modeling_modernbert.py#L689-L718 - Update the README:
- Add tag for Sentence Transformers to boost visibility
- Add model outputs so people get a better feel for what the model does
- Remove 'trust_remote_code', not needed for ModernBERT!
- Update minimum 'transformers' to v4.48.0, as that version introduced the
modernbertarchitecture. - Mention that
flash_attnis recommended (but not required) for faster inference.
Details
Regarding the reference_compile config change: if that isn't done, then parts of the model are always compiled, even if the user does not have triton (a core requirement for compilation) or if they are running on CPU (which isn't compatible with compilation). Removing the option will help.
- Tom Aarsen
tomaarsen changed pull request status to open
thenlper changed pull request status to merged