Update README.md
Browse files
README.md
CHANGED
|
@@ -11,7 +11,7 @@ metrics:
|
|
| 11 |
model-index:
|
| 12 |
- name: hindi-hindiroberta-edu-classifier
|
| 13 |
results: []
|
| 14 |
-
license:
|
| 15 |
datasets:
|
| 16 |
- Polygl0t/hindi-edu-qwen-annotations
|
| 17 |
language:
|
|
@@ -33,7 +33,11 @@ hindi-roberta-edu-classifier is a [HindRoBERTa](https://huggingface.co/l3cube-pu
|
|
| 33 |
- **Learning Rate:** 3e-4
|
| 34 |
- **Eval Metric:** `f1-score`
|
| 35 |
|
| 36 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
| 37 |
|
| 38 |
| | **1** | **2** | **3** | **4** | **5** |
|
| 39 |
|-------|-------|-------|-------|-------|-------|
|
|
@@ -43,6 +47,32 @@ hindi-roberta-edu-classifier is a [HindRoBERTa](https://huggingface.co/l3cube-pu
|
|
| 43 |
| **4** | 3 | 56 | 325 | 174 | 3 |
|
| 44 |
| **5** | 0 | 0 | 2 | 1 | 0 |
|
| 45 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 46 |
## Usage
|
| 47 |
|
| 48 |
Here's an example of how to use the Edu Classifier:
|
|
@@ -84,8 +114,10 @@ print({
|
|
| 84 |
|
| 85 |
## Aknowlegments
|
| 86 |
|
| 87 |
-
|
|
|
|
|
|
|
| 88 |
|
| 89 |
## License
|
| 90 |
|
| 91 |
-
hindi-roberta-
|
|
|
|
| 11 |
model-index:
|
| 12 |
- name: hindi-hindiroberta-edu-classifier
|
| 13 |
results: []
|
| 14 |
+
license: other
|
| 15 |
datasets:
|
| 16 |
- Polygl0t/hindi-edu-qwen-annotations
|
| 17 |
language:
|
|
|
|
| 33 |
- **Learning Rate:** 3e-4
|
| 34 |
- **Eval Metric:** `f1-score`
|
| 35 |
|
| 36 |
+
This repository has the [source code](xxx) used to train this model.
|
| 37 |
+
|
| 38 |
+
### Evaluation Results
|
| 39 |
+
|
| 40 |
+
#### Confusion Matrix
|
| 41 |
|
| 42 |
| | **1** | **2** | **3** | **4** | **5** |
|
| 43 |
|-------|-------|-------|-------|-------|-------|
|
|
|
|
| 47 |
| **4** | 3 | 56 | 325 | 174 | 3 |
|
| 48 |
| **5** | 0 | 0 | 2 | 1 | 0 |
|
| 49 |
|
| 50 |
+
- Precision: 0.52385
|
| 51 |
+
- Recall: 0.46223
|
| 52 |
+
- F1 Macro: 0.47855
|
| 53 |
+
- Accuracy: 0.69985
|
| 54 |
+
|
| 55 |
+
|
| 56 |
+
#### Binary Confusion Matrix (< 3 vs ≥ 3)
|
| 57 |
+
|
| 58 |
+
Converting to binary classification where:
|
| 59 |
+
|
| 60 |
+
- Class 0: Labels 1, 2 (< 3)
|
| 61 |
+
- Class 1: Labels 3, 4, 5 (≥ 3)
|
| 62 |
+
|
| 63 |
+
We get the following confusion matrix:
|
| 64 |
+
|
| 65 |
+
| | **< 3** | **≥ 3** |
|
| 66 |
+
|-----------|---------|---------|
|
| 67 |
+
| **< 3** | 16453 | 669 |
|
| 68 |
+
| **≥ 3** | 1072 | 1806 |
|
| 69 |
+
|
| 70 |
+
- Precision:
|
| 71 |
+
- Recall:
|
| 72 |
+
- F1 Score:
|
| 73 |
+
- Accuracy:
|
| 74 |
+
|
| 75 |
+
|
| 76 |
## Usage
|
| 77 |
|
| 78 |
Here's an example of how to use the Edu Classifier:
|
|
|
|
| 114 |
|
| 115 |
## Aknowlegments
|
| 116 |
|
| 117 |
+
Polyglot is a project funded by the Federal Ministry of Education and Research (BMBF) and the Ministry of Culture and Science of the State of North Rhine-Westphalia (MWK) as part of TRA Sustainable Futures (University of Bonn) and the Excellence Strategy of the federal and state governments.
|
| 118 |
+
|
| 119 |
+
We also gratefully acknowledge the granted access to the [Marvin cluster](https://www.hpc.uni-bonn.de/en/systems/marvin) hosted by [University of Bonn](https://www.uni-bonn.de/en) along with the support provided by its High Performance Computing & Analytics Lab.
|
| 120 |
|
| 121 |
## License
|
| 122 |
|
| 123 |
+
According to [l3cube-pune/hindi-roberta](https://huggingface.co/l3cube-pune/hindi-roberta), the model is released under [cc-by-4.0](https://spdx.org/licenses/CC-BY-4.0). For any queries, please get in touch with the authors of the original paper tied to [hindi-roberta](https://huggingface.co/l3cube-pune).
|