onnx-community
/

NeoBERT-ONNX

Feature Extraction

Transformers.js

Model card Files Files and versions

Xenova HF Staff commited on Jul 1

Commit

921f626

·

verified ·

1 Parent(s): 9e3e5cd

Update README.md

Files changed (1) hide show

README.md +21 -0

README.md CHANGED Viewed

@@ -15,6 +15,27 @@ NeoBERT is a **next-generation encoder** model for English text representation,
 - Paper: [paper](https://arxiv.org/abs/2502.19587)
 - Repository: [github](https://github.com/chandar-lab/NeoBERT).
 ## Conversion
 The export script can be found at [./export.py](https://huggingface.co/onnx-community/NeoBERT-ONNX/blob/main/export.py).

 - Paper: [paper](https://arxiv.org/abs/2502.19587)
 - Repository: [github](https://github.com/chandar-lab/NeoBERT).
+## Usage
+### ONNXRuntime
+```py
+from transformers import AutoTokenizer
+from huggingface_hub import hf_hub_download
+import onnxruntime as ort
+model_id = "onnx-community/NeoBERT-ONNX"
+tokenizer = AutoTokenizer.from_pretrained(model_id)
+model_file = hf_hub_download(model_id, filename="onnx/model.onnx")
+session = ort.InferenceSession(model_file)
+text = ["NeoBERT is the most efficient model of its kind!"]
+inputs = tokenizer(text, return_tensors="np").data
+outputs = session.run(None, inputs)[0]
+embeddings = outputs[:, 0, :]
+print(f"{embeddings.shape=}") # (1, 768)
+```
 ## Conversion
 The export script can be found at [./export.py](https://huggingface.co/onnx-community/NeoBERT-ONNX/blob/main/export.py).