Update README.md
Browse files
README.md
CHANGED
|
@@ -12,5 +12,7 @@ tags:
|
|
| 12 |
---
|
| 13 |
|
| 14 |
|
| 15 |
-
We used GPT4.1-nano to classify generic texts from OSCAR as non-medical/medical. We labeled 400.000 texts, with about 40.000 labeled as positive.
|
| 16 |
-
We then trained a SequenceClassifier on 80.000 samples with a 50/50 class ratio.
|
|
|
|
|
|
|
|
|
| 12 |
---
|
| 13 |
|
| 14 |
|
| 15 |
+
We used GPT4.1-nano to classify generic texts from OSCAR as non-medical/medical using [PubScience](https://github.com/bramiozo/PubScience/tree/main/pubscience/label). We labeled 400.000 texts, with about 40.000 labeled as positive.
|
| 16 |
+
We then trained a SequenceClassifier on 80.000 samples with a 50/50 class ratio.
|
| 17 |
+
|
| 18 |
+
This can be used e.g.
|