Update README.md
Browse files
README.md
CHANGED
|
@@ -48,7 +48,7 @@ BankAI-BERT was fine-tuned on a manually annotated dataset comprising sentences
|
|
| 48 |
| Hardware | Google Colab GPU (T4) |
|
| 49 |
|
| 50 |
## Evaluation & Robustness
|
| 51 |
-
* Benchmarked against Logistic Regression, Naive Bayes, Random Forest, and XGBoost (TF-IDF features);
|
| 52 |
* Calibration checked via Brier Score (0 = perfect).
|
| 53 |
* SHAP analysis shows the model focuses on meaningful cues (e.g., machine learning, AI-powered)—not noise—ensuring interpretability and trust.
|
| 54 |
* Robust to:
|
|
@@ -60,12 +60,15 @@ BankAI-BERT was fine-tuned on a manually annotated dataset comprising sentences
|
|
| 60 |
- `config.json`, `tokenizer.json`, `vocab.txt`, `model.safetensors`: Model files
|
| 61 |
- `tokenizer_config.json`, `special_tokens_map.json`: Tokenizer configuration
|
| 62 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 63 |
## Citation
|
| 64 |
Please cite my paper if you use this model:
|
| 65 |
-
- **Zafar, M. B. (2025).
|
| 66 |
|
| 67 |
-
## Contact
|
| 68 |
-
For questions or feedback, please contact me at bilalezafar@gmail.com
|
| 69 |
|
| 70 |
## Usage
|
| 71 |
```python
|
|
|
|
| 48 |
| Hardware | Google Colab GPU (T4) |
|
| 49 |
|
| 50 |
## Evaluation & Robustness
|
| 51 |
+
* Benchmarked against Logistic Regression, Naive Bayes, Random Forest, and XGBoost (TF-IDF features); BankAI-BERT scored highest on F1.
|
| 52 |
* Calibration checked via Brier Score (0 = perfect).
|
| 53 |
* SHAP analysis shows the model focuses on meaningful cues (e.g., machine learning, AI-powered)—not noise—ensuring interpretability and trust.
|
| 54 |
* Robust to:
|
|
|
|
| 60 |
- `config.json`, `tokenizer.json`, `vocab.txt`, `model.safetensors`: Model files
|
| 61 |
- `tokenizer_config.json`, `special_tokens_map.json`: Tokenizer configuration
|
| 62 |
|
| 63 |
+
## GitHub Repository
|
| 64 |
+
|
| 65 |
+
For full pipeline, data, and visualizations, see the **[GitHub repository]**(https://github.com/bilalezafar/BankAI-BERT).
|
| 66 |
+
.
|
| 67 |
+
|
| 68 |
## Citation
|
| 69 |
Please cite my paper if you use this model:
|
| 70 |
+
- **Zafar, M. B. (2025). AI in Banking Disclosures: A BERT Classifier and Corpus-Level Thematic Mapping**
|
| 71 |
|
|
|
|
|
|
|
| 72 |
|
| 73 |
## Usage
|
| 74 |
```python
|