UMCU commited on
Commit
5a5e26d
·
verified ·
1 Parent(s): eba0180

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +17 -11
README.md CHANGED
@@ -1,8 +1,9 @@
1
  ---
2
  id: CardioNER.nl_128xtokenWindow
3
  name: CardioNER.nl_128xtokenWindow
4
- description: CardioBERTa.nl_clinical finetuned for multilabel NER task with tokenwindow
5
- of 128
 
6
  license: gpl-3.0
7
  language: nl
8
  tags:
@@ -16,21 +17,28 @@ tags:
16
  - bionlp
17
  base_model: UMCU/CardioBERTa.nl_clinical
18
  pipeline_tag: token-classification
 
 
 
19
  ---
20
 
21
- # Model Card for Cardioner.Nl 128Xtokenwindow
22
-
23
-
24
 
25
  This a UMCU/CardioBERTa.nl_clinical base model finetuned for span classification. For this model
26
  we used IOB-tagging. Using the IOB-tagging schema facilitates the aggregation of predictions
27
  over sequences. This specific model is trained on a batch of about 500 span-labeled documents.
28
 
 
 
29
  ### Expected input and output
30
- The input should be a string with **Dutch** clinical text related to **cardiology**
31
 
32
- CardioNER.nl_128xtokenWindow is a multiclass span classification model.
33
- The classes that can be predicted are **procedure**, **medication**, **disease**, **symptom**.
 
 
 
 
34
 
35
  #### Extracting span classification from CardioNER.nl_128xtokenWindow
36
 
@@ -69,6 +77,4 @@ This is part of the [DT4H project](https://www.datatools4heart.eu/).
69
 
70
 
71
  For more details about training/eval and other scripts, see CardioNER [github repo](https://github.com/DataTools4Heart/CardioNER).
72
- and for more information on the background, see Datatools4Heart [Huggingface](https://huggingface.co/DT4H)/[Website](https://www.datatools4heart.eu/)
73
-
74
-
 
1
  ---
2
  id: CardioNER.nl_128xtokenWindow
3
  name: CardioNER.nl_128xtokenWindow
4
+ description: >-
5
+ CardioBERTa.nl_clinical finetuned for multilabel NER task with tokenwindow of
6
+ 128
7
  license: gpl-3.0
8
  language: nl
9
  tags:
 
17
  - bionlp
18
  base_model: UMCU/CardioBERTa.nl_clinical
19
  pipeline_tag: token-classification
20
+ datasets:
21
+ - DT4H/CardioCCC
22
+ - UMCU/cardioccc_dutch
23
  ---
24
 
25
+ # Model Card for Cardioner.nl 128
 
 
26
 
27
  This a UMCU/CardioBERTa.nl_clinical base model finetuned for span classification. For this model
28
  we used IOB-tagging. Using the IOB-tagging schema facilitates the aggregation of predictions
29
  over sequences. This specific model is trained on a batch of about 500 span-labeled documents.
30
 
31
+ This is version was trained with context windows of 128 tokens. For the chunking we used a paragraph-based splitter.
32
+
33
  ### Expected input and output
34
+ The input should be a string with **Dutch** clinical text related to **cardiology**.
35
 
36
+ CardioNER.nl_128 is a multiclass span classification model.
37
+ The classes that can be predicted are
38
+ * **procedure**,
39
+ * **medication**,
40
+ * **disease**,
41
+ * **symptom**.
42
 
43
  #### Extracting span classification from CardioNER.nl_128xtokenWindow
44
 
 
77
 
78
 
79
  For more details about training/eval and other scripts, see CardioNER [github repo](https://github.com/DataTools4Heart/CardioNER).
80
+ and for more information on the background, see Datatools4Heart [Huggingface](https://huggingface.co/DT4H)/[Website](https://www.datatools4heart.eu/)