RaThorat commited on
Commit
97dfebf
·
verified ·
1 Parent(s): 8b4777f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -4
README.md CHANGED
@@ -78,7 +78,7 @@ Use the code below to get started with the model.
78
 
79
  ### Training Data
80
 
81
- <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
82
 
83
  [More Information Needed]
84
 
@@ -88,12 +88,12 @@ Use the code below to get started with the model.
88
 
89
  #### Preprocessing [optional]
90
 
91
- [More Information Needed]
92
 
93
 
94
  #### Training Hyperparameters
95
 
96
- - **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
97
 
98
  #### Speeds, Sizes, Times [optional]
99
 
@@ -131,6 +131,9 @@ Use the code below to get started with the model.
131
 
132
  #### Summary
133
 
 
 
 
134
 
135
 
136
  ## Model Examination [optional]
@@ -163,7 +166,7 @@ Carbon emissions can be estimated using the [Machine Learning Impact calculator]
163
 
164
  #### Hardware
165
 
166
- [More Information Needed]
167
 
168
  #### Software
169
 
 
78
 
79
  ### Training Data
80
 
81
+ <!-- 46 txt, pdf en odt documenten van de DUS-I website zijn gebruikt om Chunks (200 woorden per chunk) te maken in JSON-formaat. -->
82
 
83
  [More Information Needed]
84
 
 
88
 
89
  #### Preprocessing [optional]
90
 
91
+ [Documenten gegroepeerd (groeperen_segment_text_to_jsonl.py) in labels zoals: PROJECT, HANDLEIDING, OVEREENKOMST, PLAN, BELEID, SUBSIDIE.]
92
 
93
 
94
  #### Training Hyperparameters
95
 
96
+ - **Training regime:** [Uitgevoerd met GroNLP/bert-base-dutch-cased model (110 miljoen parameters).] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
97
 
98
  #### Speeds, Sizes, Times [optional]
99
 
 
131
 
132
  #### Summary
133
 
134
+ Categorisatie:
135
+
136
+ Script voor textcat model: train_textcat_model.py.
137
 
138
 
139
  ## Model Examination [optional]
 
166
 
167
  #### Hardware
168
 
169
+ [8 vCPU's en 64 GB RAM was vereist.]
170
 
171
  #### Software
172