UMCU commited on
Commit
532f5e7
·
verified ·
1 Parent(s): fbe9aec

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -3
README.md CHANGED
@@ -9,7 +9,7 @@ license: afl-3.0
9
  ---
10
 
11
 
12
- Useful HF resources and contributors for Dutch NLP are
13
 
14
  ## Individuals
15
  * [Pieter Delobelle](https://huggingface.co/pdelobelle), [homepage](https://pieter.ai/) and [git](https://github.com/ipieter)
@@ -42,7 +42,7 @@ Useful HF resources and contributors for Dutch NLP are
42
  * [Clinlp](https://github.com/umcu/clinlp)
43
 
44
  ## Encoder models
45
- * [RobBERT v2](https://huggingface.co/pdelobelle/robbert-v2-dutch-base)
46
  * [BERTje](https://huggingface.co/GroNLP/bert-base-dutch-cased)
47
  * [BelabBERT](https://huggingface.co/jwouts/belabBERT_115k)
48
  * [MedRoBERTa.nl](https://huggingface.co/CLTL/MedRoBERTa.nl)
@@ -51,6 +51,7 @@ Useful HF resources and contributors for Dutch NLP are
51
  * [GPT-2 on mC4](https://huggingface.co/yhavinga/gpt2-large-dutch), [GPT-2 finetuned on ](https://huggingface.co/GroNLP/gpt2-medium-dutch-embeddings)
52
  * [GPT-neo on mC4](https://huggingface.co/yhavinga/gpt-neo-1.3B-dutch)
53
  * [GEITje (based on Mistral)](https://github.com/Rijgersberg/GEITje)
 
54
 
55
  ## NTMs
56
  * [NLLB200](https://huggingface.co/facebook/nllb-200-3.3B)
@@ -64,4 +65,6 @@ Useful HF resources and contributors for Dutch NLP are
64
  * [COW](https://rolandschaefer.net/archives/142)
65
  * [mc4 cleaned](https://huggingface.co/datasets/yhavinga/mc4_nl_cleaned)
66
  * [TWnC](https://research.utwente.nl/en/publications/twnc-a-multifaceted-dutch-news-corpus)
67
- * [Gigacorpus](http://gigacorpus.nl/)
 
 
 
9
  ---
10
 
11
 
12
+ Useful HF resources and fantastic contributors for Dutch NLP are
13
 
14
  ## Individuals
15
  * [Pieter Delobelle](https://huggingface.co/pdelobelle), [homepage](https://pieter.ai/) and [git](https://github.com/ipieter)
 
42
  * [Clinlp](https://github.com/umcu/clinlp)
43
 
44
  ## Encoder models
45
+ * [RobBERT 2023](https://huggingface.co/DTAI-KULeuven/robbert-2023-dutch-base)
46
  * [BERTje](https://huggingface.co/GroNLP/bert-base-dutch-cased)
47
  * [BelabBERT](https://huggingface.co/jwouts/belabBERT_115k)
48
  * [MedRoBERTa.nl](https://huggingface.co/CLTL/MedRoBERTa.nl)
 
51
  * [GPT-2 on mC4](https://huggingface.co/yhavinga/gpt2-large-dutch), [GPT-2 finetuned on ](https://huggingface.co/GroNLP/gpt2-medium-dutch-embeddings)
52
  * [GPT-neo on mC4](https://huggingface.co/yhavinga/gpt-neo-1.3B-dutch)
53
  * [GEITje (based on Mistral)](https://github.com/Rijgersberg/GEITje)
54
+ * [Fietje (based on Phi-2)](https://huggingface.co/BramVanroy/fietje-2)
55
 
56
  ## NTMs
57
  * [NLLB200](https://huggingface.co/facebook/nllb-200-3.3B)
 
65
  * [COW](https://rolandschaefer.net/archives/142)
66
  * [mc4 cleaned](https://huggingface.co/datasets/yhavinga/mc4_nl_cleaned)
67
  * [TWnC](https://research.utwente.nl/en/publications/twnc-a-multifaceted-dutch-news-corpus)
68
+ * [Gigacorpus](http://gigacorpus.nl/)
69
+ * [CulturaX](https://huggingface.co/datasets/uonlp/CulturaX)
70
+ * [FineWeb](https://huggingface.co/datasets/HuggingFaceFW/fineweb)