Update README.md
Browse files
README.md
CHANGED
|
@@ -4,19 +4,8 @@ license: mit
|
|
| 4 |
|
| 5 |
|
| 6 |
|
| 7 |
-
|
| 8 |
-
* **
|
| 9 |
-
* **Size:** 2,405 sentences
|
| 10 |
-
|
| 11 |
-
**Class distribution:**
|
| 12 |
-
* `neutral`: 1,068 (44.41%)
|
| 13 |
-
* `positive`: 1,026 (42.66%)
|
| 14 |
-
* `negative`: 311 (12.93%)
|
| 15 |
-
|
| 16 |
-
**Splits** (row-wise, stratified by label):
|
| 17 |
-
* **train:** 1,924
|
| 18 |
-
* **validation:** 240
|
| 19 |
-
* **test:** 241
|
| 20 |
|
| 21 |
---
|
| 22 |
|
|
|
|
| 4 |
|
| 5 |
|
| 6 |
|
| 7 |
+
|
| 8 |
+
**Training data:** The dataset consists of **2,405** custom, *manually annotated* sentences related to Central Bank Digital Currencies (CBDCs). The class distribution is **neutral**: *1,068* (44.41%), **positive**: *1,026* (42.66%), and **negative**: *311* (12.93%). The data is split **row-wise**, stratified by label, into **train**: *1,924*, **validation**: *240*, and **test**: *241* examples.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 9 |
|
| 10 |
---
|
| 11 |
|