Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
|
@@ -9,7 +9,7 @@ license: afl-3.0
|
|
| 9 |
---
|
| 10 |
|
| 11 |
|
| 12 |
-
Useful HF resources and contributors for Dutch NLP are
|
| 13 |
|
| 14 |
## Individuals
|
| 15 |
* [Pieter Delobelle](https://huggingface.co/pdelobelle), [homepage](https://pieter.ai/) and [git](https://github.com/ipieter)
|
|
@@ -42,7 +42,7 @@ Useful HF resources and contributors for Dutch NLP are
|
|
| 42 |
* [Clinlp](https://github.com/umcu/clinlp)
|
| 43 |
|
| 44 |
## Encoder models
|
| 45 |
-
* [RobBERT
|
| 46 |
* [BERTje](https://huggingface.co/GroNLP/bert-base-dutch-cased)
|
| 47 |
* [BelabBERT](https://huggingface.co/jwouts/belabBERT_115k)
|
| 48 |
* [MedRoBERTa.nl](https://huggingface.co/CLTL/MedRoBERTa.nl)
|
|
@@ -51,6 +51,7 @@ Useful HF resources and contributors for Dutch NLP are
|
|
| 51 |
* [GPT-2 on mC4](https://huggingface.co/yhavinga/gpt2-large-dutch), [GPT-2 finetuned on ](https://huggingface.co/GroNLP/gpt2-medium-dutch-embeddings)
|
| 52 |
* [GPT-neo on mC4](https://huggingface.co/yhavinga/gpt-neo-1.3B-dutch)
|
| 53 |
* [GEITje (based on Mistral)](https://github.com/Rijgersberg/GEITje)
|
|
|
|
| 54 |
|
| 55 |
## NTMs
|
| 56 |
* [NLLB200](https://huggingface.co/facebook/nllb-200-3.3B)
|
|
@@ -64,4 +65,6 @@ Useful HF resources and contributors for Dutch NLP are
|
|
| 64 |
* [COW](https://rolandschaefer.net/archives/142)
|
| 65 |
* [mc4 cleaned](https://huggingface.co/datasets/yhavinga/mc4_nl_cleaned)
|
| 66 |
* [TWnC](https://research.utwente.nl/en/publications/twnc-a-multifaceted-dutch-news-corpus)
|
| 67 |
-
* [Gigacorpus](http://gigacorpus.nl/)
|
|
|
|
|
|
|
|
|
| 9 |
---
|
| 10 |
|
| 11 |
|
| 12 |
+
Useful HF resources and fantastic contributors for Dutch NLP are
|
| 13 |
|
| 14 |
## Individuals
|
| 15 |
* [Pieter Delobelle](https://huggingface.co/pdelobelle), [homepage](https://pieter.ai/) and [git](https://github.com/ipieter)
|
|
|
|
| 42 |
* [Clinlp](https://github.com/umcu/clinlp)
|
| 43 |
|
| 44 |
## Encoder models
|
| 45 |
+
* [RobBERT 2023](https://huggingface.co/DTAI-KULeuven/robbert-2023-dutch-base)
|
| 46 |
* [BERTje](https://huggingface.co/GroNLP/bert-base-dutch-cased)
|
| 47 |
* [BelabBERT](https://huggingface.co/jwouts/belabBERT_115k)
|
| 48 |
* [MedRoBERTa.nl](https://huggingface.co/CLTL/MedRoBERTa.nl)
|
|
|
|
| 51 |
* [GPT-2 on mC4](https://huggingface.co/yhavinga/gpt2-large-dutch), [GPT-2 finetuned on ](https://huggingface.co/GroNLP/gpt2-medium-dutch-embeddings)
|
| 52 |
* [GPT-neo on mC4](https://huggingface.co/yhavinga/gpt-neo-1.3B-dutch)
|
| 53 |
* [GEITje (based on Mistral)](https://github.com/Rijgersberg/GEITje)
|
| 54 |
+
* [Fietje (based on Phi-2)](https://huggingface.co/BramVanroy/fietje-2)
|
| 55 |
|
| 56 |
## NTMs
|
| 57 |
* [NLLB200](https://huggingface.co/facebook/nllb-200-3.3B)
|
|
|
|
| 65 |
* [COW](https://rolandschaefer.net/archives/142)
|
| 66 |
* [mc4 cleaned](https://huggingface.co/datasets/yhavinga/mc4_nl_cleaned)
|
| 67 |
* [TWnC](https://research.utwente.nl/en/publications/twnc-a-multifaceted-dutch-news-corpus)
|
| 68 |
+
* [Gigacorpus](http://gigacorpus.nl/)
|
| 69 |
+
* [CulturaX](https://huggingface.co/datasets/uonlp/CulturaX)
|
| 70 |
+
* [FineWeb](https://huggingface.co/datasets/HuggingFaceFW/fineweb)
|