temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc3
Third QA release candidate for Irish core PII detection with OpenMed mLiteClinical.
This repository should be evaluated against:
- current public release:
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v1 - previous RC:
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc2 - this repository:
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc3
This RC is the first version in the IrishCorePII line that is stronger than temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc2 across the main suites used in this workspace for release gating:
- Irish core multilabel suite
- Irish PPSN and phone edge suite
- exact numeric QA suite
- exact gap-feedback QA suite
- user PPSN regression suite
- multilingual PPSN suite
- strict exact-span boundary suite
This repository also includes a stronger bundled ONNX q8 artifact for CPU inference in onnx/model_quantized.onnx.
Included Variants
| Variant | Artifact | Backend | Recommended Thresholds | Intended Use |
|---|---|---|---|---|
| Full checkpoint | repo root | transformers |
ppsn=0.55, other=0.40 |
highest-fidelity evaluation and deployment |
| Quantized checkpoint | onnx/model_quantized.onnx |
ONNX Runtime dynamic int8 | ppsn=0.60, other=0.45 |
CPU-oriented deployment |
Coverage
PPSNaccount_numberbank_routing_numbercredit_debit_cardPASSPORT_NUMBERpostcodephone_numberemailfirst_namelast_nameswift_bic
Recommended Inference
Full checkpoint:
python3 inference_mask.py --model temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc3 --ppsn-min-score 0.55 --other-min-score 0.40 --text "My sort code is 90-00-17 for AIB." --json
Fast CPU path with the bundled ONNX q8 artifact:
python3 inference_mask_onnx.py --model temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc3 --ppsn-min-score 0.60 --other-min-score 0.45 --text "Please provide your passport: NN5123456." --json
What Improved Versus temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc2
Full checkpoint:
- core suite F1:
0.9554->0.9806 - edge suite F1:
0.9500->1.0000 - exact numeric QA F1:
0.8966->0.9333 - exact gap-feedback QA F1:
0.8696->0.9167 - user PPSN regression F1:
0.8571->1.0000 - multilingual PPSN F1:
0.8038->0.9333
Bundled ONNX q8:
- core suite F1:
0.9677->0.9677 - edge suite F1:
0.9500->1.0000 - exact numeric QA F1:
0.8667->0.9333 - exact gap-feedback QA F1:
0.8696->0.9167 - user PPSN regression F1:
0.8571->1.0000 - multilingual PPSN F1:
0.8077->0.9333
Quality Snapshot
Key label changes on the Irish core suite for the full checkpoint:
PASSPORT_NUMBER:0.8000->1.0000POSTCODE:0.7500->1.0000PPSN:0.8571->0.9231PHONE_NUMBERon the numeric QA suite:0.7500->0.8889
Benchmark Tables
Broader CPU benchmarks:
| Variant | User PPSN | Core | Edge | Multilingual PPSN | Strict Remaining IoU=1.0 |
|---|---|---|---|---|---|
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc2 full |
0.8571 | 0.9554 | 0.9500 | 0.8038 | 0.4000 |
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc3 full |
1.0000 | 0.9806 | 1.0000 | 0.9333 | 0.4444 |
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc2 ONNX q8 |
0.8571 | 0.9677 | 0.9500 | 0.8077 | 0.6000 |
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc3 ONNX q8 |
1.0000 | 0.9677 | 1.0000 | 0.9333 | 0.6667 |
Exact QA suites:
| Variant | Numeric v2 | Passport | Routing | Phone | Gap | Passport | Routing | Phone |
|---|---|---|---|---|---|---|---|---|
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc2 full |
0.8966 | 0.9091 | 1.0000 | 0.7500 | 0.8696 | 0.8889 | 1.0000 | 0.6667 |
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc3 full |
0.9333 | 0.9091 | 1.0000 | 0.8889 | 0.9167 | 0.8889 | 1.0000 | 0.8571 |
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc2 ONNX q8 |
0.8667 | 0.8333 | 1.0000 | 0.7500 | 0.8696 | 0.8889 | 1.0000 | 0.6667 |
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc3 ONNX q8 |
0.9333 | 0.9091 | 1.0000 | 0.8889 | 0.9167 | 0.8889 | 1.0000 | 0.8571 |
CPU throughput:
| Variant | Core ex/s | Edge ex/s | Multilingual PPSN ex/s |
|---|---|---|---|
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc2 full |
58.3521 | 63.1592 | 128.1891 |
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc3 full |
64.8768 | 62.4612 | 126.4996 |
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc2 ONNX q8 |
44.9578 | 45.5786 | 297.5262 |
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc3 ONNX q8 |
44.9398 | 41.7212 | 190.8299 |
Quantized Artifact
The bundled quantized artifact is:
onnx/model_quantized.onnx
For this release line, the promoted q8 recipe remains the standard dynamic int8 ONNX export with per-channel quantization. Two alternative q8 recipes were tested locally and not promoted because they reduced multilingual PPSN quality.
Known Limits
This is still a raw-model-only release candidate. QA should still test these carefully:
Passport PA 1234567 was used to board the flight.Usaideadh pas PA 1234567 chun dul ar bord an eitilt.My PPSN is 1234567T and my sort code is 90-00-17.- the full checkpoint can still trim the final digit in some short English compact-phone contexts such as
Call me on 0851234567 tomorrow. - multilingual PPSN quality is stronger than
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc2, but still belowtemsa/OpenMed-mLiteClinical-IrishCorePII-135M-v1on the multilingual PPSN suite
Included Files
- full
transformerscheckpoint in the repo root - dynamic int8 ONNX artifact in
onnx/model_quantized.onnx inference_mask.pyinference_mask_onnx.pyqa_config.jsontraining_sources.json- benchmark summaries in
eval/
License And Attribution
- release license: Apache-2.0
- base model:
OpenMed/OpenMed-PII-mLiteClinical-Base-135M-v1 - upstream attributed data:
joelniklaus/mapa,gretelai/synthetic_pii_finance_multilingual - synthetic Irish training and replay data created in this workspace
See NOTICE for attribution details.
Portfolio Comparison
Updated: 2026-03-15.
Use this section for the fastest public comparison across the temsa PII masking portfolio.
- The first core table only includes public checkpoints that ship both comparable q8 accuracy and q8 CPU throughput.
- The first PPSN table only includes public artifacts that ship comparable PPSN accuracy and CPU throughput.
- Missing cells in the archive tables mean the older release did not ship that metric in its public bundle.
- DiffMask rows use the reconciled
clean_single_passharness that matches the deployed runtime. - GlobalPointer rows use the public raw-only span-matrix release bundle and its packaged q8 ONNX artifact.
- The same content is shipped as
PORTFOLIO_COMPARISON.mdinside each public model repo.
Irish Core PII: Comparable Public Checkpoints
| Repo | Stack | Full Core F1 | Q8 Core F1 | Q8 Multilingual PPSN F1 | Q8 Core ex/s |
|---|---|---|---|---|---|
temsa/IrishCore-GlobalPointer-ContextPII-135M-v1-rc17 |
GlobalPointer raw-only + context labels | 1.0000 | 1.0000 | 0.9333 | 125.5 |
temsa/IrishCore-GlobalPointer-ContextPII-135M-v1-rc16 |
GlobalPointer raw-only + context labels | 1.0000 | 1.0000 | 0.9333 | 125.5 |
temsa/IrishCore-GlobalPointer-ContextPII-135M-v1-rc15 |
GlobalPointer raw-only + context labels | 1.0000 | 1.0000 | 0.9333 | 125.5 |
temsa/IrishCore-GlobalPointer-ContextPII-135M-v1-rc14 |
GlobalPointer raw-only + context labels | 1.0000 | 1.0000 | 0.9333 | 119.2 |
temsa/IrishCore-GlobalPointer-ContextPII-135M-v1-rc13 |
GlobalPointer raw-only + context labels | 1.0000 | 1.0000 | 0.9333 | 126.1 |
temsa/IrishCore-GlobalPointer-ContextPII-135M-v1-rc12 |
GlobalPointer raw-only + context labels | 1.0000 | 1.0000 | 0.9333 | 73.6 |
temsa/IrishCore-GlobalPointer-ContextPII-135M-v1-rc11 |
GlobalPointer raw-only + context labels | 1.0000 | 1.0000 | 0.9333 | 94.1 |
temsa/IrishCore-GlobalPointer-ContextPII-135M-v1-rc10 |
GlobalPointer raw-only + context labels | 1.0000 | 1.0000 | 0.9333 | 125.8 |
temsa/IrishCore-GlobalPointer-ContextPII-135M-v1-rc9 |
GlobalPointer raw-only + context labels | 1.0000 | 1.0000 | 0.9333 | 119.8 |
temsa/IrishCore-GlobalPointer-ContextPII-135M-v1-rc8 |
GlobalPointer raw-only + context labels | 1.0000 | 1.0000 | 0.9333 | 128.9 |
temsa/IrishCore-GlobalPointer-ContextPII-135M-v1-rc7 |
GlobalPointer raw-only + context labels | 1.0000 | 1.0000 | 0.9333 | 89.0 |
temsa/IrishCore-GlobalPointer-ContextPII-135M-v1-rc6 |
GlobalPointer raw-only + context labels | 1.0000 | 1.0000 | 0.9333 | 89.0 |
temsa/IrishCore-GlobalPointer-ContextPII-135M-v1-rc5 |
GlobalPointer raw-only + context labels | 1.0000 | 1.0000 | 0.9333 | 84.5 |
temsa/IrishCore-GlobalPointer-ContextPII-135M-v1-rc4 |
GlobalPointer raw-only + context labels | 0.9935 | 0.9935 | 0.9333 | 61.5 |
temsa/IrishCore-GlobalPointer-ContextPII-135M-v1-rc3 |
GlobalPointer raw-only + context labels | 0.9935 | 0.9935 | 0.9333 | 61.5 |
temsa/IrishCore-GlobalPointer-ContextPII-135M-v1-rc2 |
GlobalPointer raw-only + context labels | 0.9935 | 0.9935 | 0.9222 | 61.5 |
temsa/IrishCore-GlobalPointer-ContextPII-135M-v1-rc1 |
GlobalPointer raw-only + context labels | 0.9935 | 0.9935 | 0.9222 | 61.5 |
temsa/IrishCore-GlobalPointer-135M-v1-rc4 |
GlobalPointer raw-only span-matrix | 1.0000 | 1.0000 | 0.9333 | 221.6 |
temsa/IrishCore-GlobalPointer-135M-v1-rc3 |
GlobalPointer raw-only span-matrix | 1.0000 | 1.0000 | 0.9213 | 204.9 |
temsa/IrishCore-GlobalPointer-135M-v1-rc2 |
GlobalPointer raw-only span-matrix | 0.9934 | 0.9934 | 0.9326 | 231.2 |
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc8 |
Raw-only token-span | 0.9737 | 0.9737 | 0.9176 | 46.1 |
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc7 |
Hybrid classifier + generated scanner spec | 1.0000 | 0.9934 | 1.0000 | 30.0 |
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc6 |
Hybrid classifier + repair decoders | 1.0000 | 0.9934 | 1.0000 | 29.5 |
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc5 |
Hybrid classifier + repair decoders | 0.9737 | 0.9669 | 0.9333 | 34.4 |
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc4 |
Hybrid classifier + repair decoders | 0.9870 | 0.9740 | 0.9600 | 114.2 |
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc3 |
Hybrid classifier + repair decoders | 0.9806 | 0.9677 | 0.9333 | 44.9 |
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc2 |
Hybrid classifier + repair decoders | 0.9554 | 0.9615 | 0.7887 | 119.1 |
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v1 |
Hybrid classifier baseline | 0.9530 | 0.9333 | 0.9882 | 103.3 |
temsa/IrishCore-DiffMask-135M-v1-rc6 |
DiffMask token-span, scanner-free | 0.9801 | 0.9733 | 0.9274 | 130.3 |
temsa/IrishCore-DiffMask-135M-v1-rc5 |
DiffMask token-span, scanner-free | 0.9733 | 0.9733 | 0.9379 | 249.2 |
temsa/IrishCore-DiffMask-135M-v1-rc4 |
DiffMask token-span, scanner-free | 0.9733 | 0.9733 | 0.9371 | 29.5 |
temsa/IrishCore-DiffMask-135M-v1-rc3 |
DiffMask token-span, scanner-free | 0.9664 | 0.9664 | 0.9591 | 30.0 |
temsa/IrishCore-DiffMask-135M-v1-rc2 |
DiffMask token-span, scanner-free | 0.9664 | 0.9664 | 0.9212 | 247.1 |
temsa/IrishCore-DiffMask-135M-v1-rc1 |
DiffMask token-span, scanner-free | 0.9801 | 0.9934 | 0.9412 | 251.2 |
Irish Core PII: Other Public Checkpoints
| Repo | Stack | Full Core F1 | Q8 Core F1 | Q8 Multilingual PPSN F1 | Notes |
|---|---|---|---|---|---|
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc1 |
Hybrid classifier prototype | 0.9487 | โ | โ | Predates the public q8 artifact. |
Finance-boundary q8 F1 is 1.0000 for OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc6, OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc7, OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc8, and all public IrishCore-DiffMask releases from rc1 to rc6. OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc5 ships 0.8750 on that public q8 suite.
PPSN-Only: Comparable Public Artifacts
| Repo | Artifact | Irish Large F1 | Multilingual PPSN F1 | User Raw F1 | QA v8 F1 | CPU ex/s |
|---|---|---|---|---|---|---|
temsa/OpenMed-mLiteClinical-IrishPPSN-135M-v1 |
fp32 canonical checkpoint | 0.8979 | 0.9704 | 0.8000 | 0.7385 | 57.4 |
temsa/OpenMed-mLiteClinical-IrishPPSN-135M-v1-fp16 |
fp16 CPU/GPU artifact | โ | 0.9704 | 0.8000 | 0.7385 | 45.8 |
temsa/OpenMed-mLiteClinical-IrishPPSN-135M-v1-q8 |
dynamic int8 CPU artifact | โ | 0.9040 | โ | โ | 132.1 |
PPSN-Only: Historical Public Checkpoints
| Repo | Main Published Metrics | Notes |
|---|---|---|
temsa/OpenMed-PPSN-mLiteClinical-v1 |
same as canonical fp32 repo: multilingual 0.9704, user raw 0.8000 | Legacy alias; prefer temsa/OpenMed-mLiteClinical-IrishPPSN-135M-v1. |
temsa/OpenMed-PPSN-v6-raw-rc2 |
irish_reg_v5 0.8750; user_raw 0.8000; qa_v8 0.7385 | Raw PPSN-only research checkpoint; no packaged multilingual CPU benchmark row. |
temsa/OpenMed-PPSN-v5_1 |
irish_large_v2 raw 0.9285; qa_v6 hybrid strict 1.0000 | Hybrid PPSN-only checkpoint; predates the canonical multilingual suite packaging. |
temsa/OpenMed-PPSN-v5 |
irish_reg_v5 raw 0.8235; irish_reg_v5 hybrid strict 1.0000 | Hybrid PPSN-only checkpoint; predates the canonical multilingual suite packaging. |
temsa/OpenMed-PPSN-v4 |
synthetic non-PPSN drift check only | Predates the current PPSN eval suite; no packaged apples-to-apples multilingual CPU row. |
If you need the strongest current raw-only Irish core model, start with IrishCore-GlobalPointer-135M-v1-rc4. If you need the fastest CPU-first raw-only line, compare it against IrishCore-DiffMask-135M-v1-rc6. If you need a PPSN-only artifact, compare the canonical fp32, fp16, and q8 variants of OpenMed-mLiteClinical-IrishPPSN-135M-v1 directly in the table above.
- Downloads last month
- 112