broadfield-dev
/

bert-small-ner-pii-mobile

Token Classification

Model card Files Files and versions

broadfield-dev commited on Dec 27, 2025

Commit

911426b

·

verified ·

1 Parent(s): 72bd10f

Update README.md

Files changed (1) hide show

README.md +16 -11

README.md CHANGED Viewed

@@ -33,24 +33,29 @@ pip install onnxruntime transformers
 ### Python Example
 ```python
-from transformers import AutoTokenizer
 import onnxruntime as ort
 import numpy as np
-# 1. Load Tokenizer
-tokenizer = AutoTokenizer.from_pretrained("broadfield-dev/bert-small-ner-pii-tuned-12261022-onnx")
-# 2. Load Model
 session = ort.InferenceSession("model.onnx")
-# 3. Preprocess
-text = "This is a test sentence."
-inputs = tokenizer(text, return_tensors="np")
-# 4. Inference
-outputs = session.run(None, dict(inputs))
-print(outputs[0])
 ```

 ### Python Example
 ```python
+from tokenizers import Tokenizer
 import onnxruntime as ort
 import numpy as np
+# 1. Load the lightweight tokenizer (No Transformers dependency needed)
+tokenizer = Tokenizer.from_pretrained("broadfield-dev/bert-small-ner-pii-tuned-12261022-onnx")
+# 2. Load the ONNX model
 session = ort.InferenceSession("model.onnx")
+# 3. Preprocess (Simple text encoding)
+text = "Run inference on mobile!"
+encoding = tokenizer.encode(text)
+# Prepare inputs (Exact names vary by model, usually input_ids + attention_mask)
+inputs = {{
+    "input_ids": np.array([encoding.ids], dtype=np.int64),
+    "attention_mask": np.array([encoding.attention_mask], dtype=np.int64)
+}}
+# 4. Run Inference
+outputs = session.run(None, inputs)
+print("Output logits shape:", outputs[0].shape)
 ```