Update README.md
Browse files
README.md
CHANGED
|
@@ -1,3 +1,37 @@
|
|
| 1 |
-
---
|
| 2 |
-
license: apache-2.0
|
| 3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: apache-2.0
|
| 3 |
+
language:
|
| 4 |
+
- en
|
| 5 |
+
---
|
| 6 |
+
|
| 7 |
+
# DistilBERT-Reconciler (v1)
|
| 8 |
+
|
| 9 |
+
Fine-tuned **DistilBERT** on 3.2 M labelled *post-trade break* descriptions +
|
| 10 |
+
resolution actions (ISO 20022 & proprietary logs).
|
| 11 |
+
|
| 12 |
+
| split | accuracy | micro-F1 | macro-F1 |
|
| 13 |
+
|-------|----------|----------|----------|
|
| 14 |
+
| hold-out (20 %) | **0.88** | **0.88** | **0.85** |
|
| 15 |
+
|
| 16 |
+
## Intended use
|
| 17 |
+
|
| 18 |
+
Automated classification of reconciliation exceptions in fixed-income
|
| 19 |
+
settlement workflows (CUSIP/ISIN). Produces `label_id` then mapped to human
|
| 20 |
+
root-cause & recommended next action.
|
| 21 |
+
|
| 22 |
+
Not for retail investment advice.
|
| 23 |
+
|
| 24 |
+
## Training details
|
| 25 |
+
* **Base** : `distilbert-base-uncased`
|
| 26 |
+
* **Epochs** : 4 • lr = 3e-5 • max_len = 256
|
| 27 |
+
* **Hardware** : 2× A100 40 GB
|
| 28 |
+
* **Loss curve & confusion matrix** : see `/training_artifacts/`.
|
| 29 |
+
|
| 30 |
+
## Limitations & bias
|
| 31 |
+
|
| 32 |
+
Labels derived from North-American corporate-bond desks (2019–2025). May
|
| 33 |
+
under-perform on equities or non-USD repos without re-training.
|
| 34 |
+
|
| 35 |
+
## Citation
|
| 36 |
+
|
| 37 |
+
> Kelvin Musodza, *Exception Handling & Reconciliation for Fixed-Income Trading*, Coreledger (2025). DOI: 10.5281/zenodo.1234567
|