temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc3

Third QA release candidate for Irish core PII detection with OpenMed mLiteClinical.

This repository should be evaluated against:

  • current public release: temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v1
  • previous RC: temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc2
  • this repository: temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc3

This RC is the first version in the IrishCorePII line that is stronger than temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc2 across the main suites used in this workspace for release gating:

  • Irish core multilabel suite
  • Irish PPSN and phone edge suite
  • exact numeric QA suite
  • exact gap-feedback QA suite
  • user PPSN regression suite
  • multilingual PPSN suite
  • strict exact-span boundary suite

This repository also includes a stronger bundled ONNX q8 artifact for CPU inference in onnx/model_quantized.onnx.

Included Variants

Variant Artifact Backend Recommended Thresholds Intended Use
Full checkpoint repo root transformers ppsn=0.55, other=0.40 highest-fidelity evaluation and deployment
Quantized checkpoint onnx/model_quantized.onnx ONNX Runtime dynamic int8 ppsn=0.60, other=0.45 CPU-oriented deployment

Coverage

  • PPSN
  • account_number
  • bank_routing_number
  • credit_debit_card
  • PASSPORT_NUMBER
  • postcode
  • phone_number
  • email
  • first_name
  • last_name
  • swift_bic

Recommended Inference

Full checkpoint:

python3 inference_mask.py   --model temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc3   --ppsn-min-score 0.55   --other-min-score 0.40   --text "My sort code is 90-00-17 for AIB."   --json

Fast CPU path with the bundled ONNX q8 artifact:

python3 inference_mask_onnx.py   --model temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc3   --ppsn-min-score 0.60   --other-min-score 0.45   --text "Please provide your passport: NN5123456."   --json

What Improved Versus temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc2

Full checkpoint:

  • core suite F1: 0.9554 -> 0.9806
  • edge suite F1: 0.9500 -> 1.0000
  • exact numeric QA F1: 0.8966 -> 0.9333
  • exact gap-feedback QA F1: 0.8696 -> 0.9167
  • user PPSN regression F1: 0.8571 -> 1.0000
  • multilingual PPSN F1: 0.8038 -> 0.9333

Bundled ONNX q8:

  • core suite F1: 0.9677 -> 0.9677
  • edge suite F1: 0.9500 -> 1.0000
  • exact numeric QA F1: 0.8667 -> 0.9333
  • exact gap-feedback QA F1: 0.8696 -> 0.9167
  • user PPSN regression F1: 0.8571 -> 1.0000
  • multilingual PPSN F1: 0.8077 -> 0.9333

Quality Snapshot

Key label changes on the Irish core suite for the full checkpoint:

  • PASSPORT_NUMBER: 0.8000 -> 1.0000
  • POSTCODE: 0.7500 -> 1.0000
  • PPSN: 0.8571 -> 0.9231
  • PHONE_NUMBER on the numeric QA suite: 0.7500 -> 0.8889

Benchmark Tables

Broader CPU benchmarks:

Variant User PPSN Core Edge Multilingual PPSN Strict Remaining IoU=1.0
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc2 full 0.8571 0.9554 0.9500 0.8038 0.4000
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc3 full 1.0000 0.9806 1.0000 0.9333 0.4444
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc2 ONNX q8 0.8571 0.9677 0.9500 0.8077 0.6000
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc3 ONNX q8 1.0000 0.9677 1.0000 0.9333 0.6667

Exact QA suites:

Variant Numeric v2 Passport Routing Phone Gap Passport Routing Phone
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc2 full 0.8966 0.9091 1.0000 0.7500 0.8696 0.8889 1.0000 0.6667
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc3 full 0.9333 0.9091 1.0000 0.8889 0.9167 0.8889 1.0000 0.8571
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc2 ONNX q8 0.8667 0.8333 1.0000 0.7500 0.8696 0.8889 1.0000 0.6667
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc3 ONNX q8 0.9333 0.9091 1.0000 0.8889 0.9167 0.8889 1.0000 0.8571

CPU throughput:

Variant Core ex/s Edge ex/s Multilingual PPSN ex/s
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc2 full 58.3521 63.1592 128.1891
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc3 full 64.8768 62.4612 126.4996
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc2 ONNX q8 44.9578 45.5786 297.5262
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc3 ONNX q8 44.9398 41.7212 190.8299

Quantized Artifact

The bundled quantized artifact is:

  • onnx/model_quantized.onnx

For this release line, the promoted q8 recipe remains the standard dynamic int8 ONNX export with per-channel quantization. Two alternative q8 recipes were tested locally and not promoted because they reduced multilingual PPSN quality.

Known Limits

This is still a raw-model-only release candidate. QA should still test these carefully:

  • Passport PA 1234567 was used to board the flight.
  • Usaideadh pas PA 1234567 chun dul ar bord an eitilt.
  • My PPSN is 1234567T and my sort code is 90-00-17.
  • the full checkpoint can still trim the final digit in some short English compact-phone contexts such as Call me on 0851234567 tomorrow.
  • multilingual PPSN quality is stronger than temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc2, but still below temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v1 on the multilingual PPSN suite

Included Files

  • full transformers checkpoint in the repo root
  • dynamic int8 ONNX artifact in onnx/model_quantized.onnx
  • inference_mask.py
  • inference_mask_onnx.py
  • qa_config.json
  • training_sources.json
  • benchmark summaries in eval/

License And Attribution

  • release license: Apache-2.0
  • base model: OpenMed/OpenMed-PII-mLiteClinical-Base-135M-v1
  • upstream attributed data: joelniklaus/mapa, gretelai/synthetic_pii_finance_multilingual
  • synthetic Irish training and replay data created in this workspace

See NOTICE for attribution details.

Portfolio Comparison

Updated: 2026-03-15.

Use this section for the fastest public comparison across the temsa PII masking portfolio.

  • The first core table only includes public checkpoints that ship both comparable q8 accuracy and q8 CPU throughput.
  • The first PPSN table only includes public artifacts that ship comparable PPSN accuracy and CPU throughput.
  • Missing cells in the archive tables mean the older release did not ship that metric in its public bundle.
  • DiffMask rows use the reconciled clean_single_pass harness that matches the deployed runtime.
  • GlobalPointer rows use the public raw-only span-matrix release bundle and its packaged q8 ONNX artifact.
  • The same content is shipped as PORTFOLIO_COMPARISON.md inside each public model repo.

Irish Core PII: Comparable Public Checkpoints

Repo Stack Full Core F1 Q8 Core F1 Q8 Multilingual PPSN F1 Q8 Core ex/s
temsa/IrishCore-GlobalPointer-ContextPII-135M-v1-rc17 GlobalPointer raw-only + context labels 1.0000 1.0000 0.9333 125.5
temsa/IrishCore-GlobalPointer-ContextPII-135M-v1-rc16 GlobalPointer raw-only + context labels 1.0000 1.0000 0.9333 125.5
temsa/IrishCore-GlobalPointer-ContextPII-135M-v1-rc15 GlobalPointer raw-only + context labels 1.0000 1.0000 0.9333 125.5
temsa/IrishCore-GlobalPointer-ContextPII-135M-v1-rc14 GlobalPointer raw-only + context labels 1.0000 1.0000 0.9333 119.2
temsa/IrishCore-GlobalPointer-ContextPII-135M-v1-rc13 GlobalPointer raw-only + context labels 1.0000 1.0000 0.9333 126.1
temsa/IrishCore-GlobalPointer-ContextPII-135M-v1-rc12 GlobalPointer raw-only + context labels 1.0000 1.0000 0.9333 73.6
temsa/IrishCore-GlobalPointer-ContextPII-135M-v1-rc11 GlobalPointer raw-only + context labels 1.0000 1.0000 0.9333 94.1
temsa/IrishCore-GlobalPointer-ContextPII-135M-v1-rc10 GlobalPointer raw-only + context labels 1.0000 1.0000 0.9333 125.8
temsa/IrishCore-GlobalPointer-ContextPII-135M-v1-rc9 GlobalPointer raw-only + context labels 1.0000 1.0000 0.9333 119.8
temsa/IrishCore-GlobalPointer-ContextPII-135M-v1-rc8 GlobalPointer raw-only + context labels 1.0000 1.0000 0.9333 128.9
temsa/IrishCore-GlobalPointer-ContextPII-135M-v1-rc7 GlobalPointer raw-only + context labels 1.0000 1.0000 0.9333 89.0
temsa/IrishCore-GlobalPointer-ContextPII-135M-v1-rc6 GlobalPointer raw-only + context labels 1.0000 1.0000 0.9333 89.0
temsa/IrishCore-GlobalPointer-ContextPII-135M-v1-rc5 GlobalPointer raw-only + context labels 1.0000 1.0000 0.9333 84.5
temsa/IrishCore-GlobalPointer-ContextPII-135M-v1-rc4 GlobalPointer raw-only + context labels 0.9935 0.9935 0.9333 61.5
temsa/IrishCore-GlobalPointer-ContextPII-135M-v1-rc3 GlobalPointer raw-only + context labels 0.9935 0.9935 0.9333 61.5
temsa/IrishCore-GlobalPointer-ContextPII-135M-v1-rc2 GlobalPointer raw-only + context labels 0.9935 0.9935 0.9222 61.5
temsa/IrishCore-GlobalPointer-ContextPII-135M-v1-rc1 GlobalPointer raw-only + context labels 0.9935 0.9935 0.9222 61.5
temsa/IrishCore-GlobalPointer-135M-v1-rc4 GlobalPointer raw-only span-matrix 1.0000 1.0000 0.9333 221.6
temsa/IrishCore-GlobalPointer-135M-v1-rc3 GlobalPointer raw-only span-matrix 1.0000 1.0000 0.9213 204.9
temsa/IrishCore-GlobalPointer-135M-v1-rc2 GlobalPointer raw-only span-matrix 0.9934 0.9934 0.9326 231.2
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc8 Raw-only token-span 0.9737 0.9737 0.9176 46.1
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc7 Hybrid classifier + generated scanner spec 1.0000 0.9934 1.0000 30.0
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc6 Hybrid classifier + repair decoders 1.0000 0.9934 1.0000 29.5
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc5 Hybrid classifier + repair decoders 0.9737 0.9669 0.9333 34.4
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc4 Hybrid classifier + repair decoders 0.9870 0.9740 0.9600 114.2
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc3 Hybrid classifier + repair decoders 0.9806 0.9677 0.9333 44.9
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc2 Hybrid classifier + repair decoders 0.9554 0.9615 0.7887 119.1
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v1 Hybrid classifier baseline 0.9530 0.9333 0.9882 103.3
temsa/IrishCore-DiffMask-135M-v1-rc6 DiffMask token-span, scanner-free 0.9801 0.9733 0.9274 130.3
temsa/IrishCore-DiffMask-135M-v1-rc5 DiffMask token-span, scanner-free 0.9733 0.9733 0.9379 249.2
temsa/IrishCore-DiffMask-135M-v1-rc4 DiffMask token-span, scanner-free 0.9733 0.9733 0.9371 29.5
temsa/IrishCore-DiffMask-135M-v1-rc3 DiffMask token-span, scanner-free 0.9664 0.9664 0.9591 30.0
temsa/IrishCore-DiffMask-135M-v1-rc2 DiffMask token-span, scanner-free 0.9664 0.9664 0.9212 247.1
temsa/IrishCore-DiffMask-135M-v1-rc1 DiffMask token-span, scanner-free 0.9801 0.9934 0.9412 251.2

Irish Core PII: Other Public Checkpoints

Repo Stack Full Core F1 Q8 Core F1 Q8 Multilingual PPSN F1 Notes
temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc1 Hybrid classifier prototype 0.9487 โ€” โ€” Predates the public q8 artifact.

Finance-boundary q8 F1 is 1.0000 for OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc6, OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc7, OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc8, and all public IrishCore-DiffMask releases from rc1 to rc6. OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc5 ships 0.8750 on that public q8 suite.

PPSN-Only: Comparable Public Artifacts

Repo Artifact Irish Large F1 Multilingual PPSN F1 User Raw F1 QA v8 F1 CPU ex/s
temsa/OpenMed-mLiteClinical-IrishPPSN-135M-v1 fp32 canonical checkpoint 0.8979 0.9704 0.8000 0.7385 57.4
temsa/OpenMed-mLiteClinical-IrishPPSN-135M-v1-fp16 fp16 CPU/GPU artifact โ€” 0.9704 0.8000 0.7385 45.8
temsa/OpenMed-mLiteClinical-IrishPPSN-135M-v1-q8 dynamic int8 CPU artifact โ€” 0.9040 โ€” โ€” 132.1

PPSN-Only: Historical Public Checkpoints

Repo Main Published Metrics Notes
temsa/OpenMed-PPSN-mLiteClinical-v1 same as canonical fp32 repo: multilingual 0.9704, user raw 0.8000 Legacy alias; prefer temsa/OpenMed-mLiteClinical-IrishPPSN-135M-v1.
temsa/OpenMed-PPSN-v6-raw-rc2 irish_reg_v5 0.8750; user_raw 0.8000; qa_v8 0.7385 Raw PPSN-only research checkpoint; no packaged multilingual CPU benchmark row.
temsa/OpenMed-PPSN-v5_1 irish_large_v2 raw 0.9285; qa_v6 hybrid strict 1.0000 Hybrid PPSN-only checkpoint; predates the canonical multilingual suite packaging.
temsa/OpenMed-PPSN-v5 irish_reg_v5 raw 0.8235; irish_reg_v5 hybrid strict 1.0000 Hybrid PPSN-only checkpoint; predates the canonical multilingual suite packaging.
temsa/OpenMed-PPSN-v4 synthetic non-PPSN drift check only Predates the current PPSN eval suite; no packaged apples-to-apples multilingual CPU row.

If you need the strongest current raw-only Irish core model, start with IrishCore-GlobalPointer-135M-v1-rc4. If you need the fastest CPU-first raw-only line, compare it against IrishCore-DiffMask-135M-v1-rc6. If you need a PPSN-only artifact, compare the canonical fp32, fp16, and q8 variants of OpenMed-mLiteClinical-IrishPPSN-135M-v1 directly in the table above.

Downloads last month
112
Safetensors
Model size
0.1B params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for temsa/OpenMed-mLiteClinical-IrishCorePII-135M-v2-rc3