Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
|
@@ -21,6 +21,7 @@ Individuals
|
|
| 21 |
* [François Remy](https://huggingface.co/FremyCompany), [homepage](http://fremycompany.com) and [git](https://github.com/FremyCompany)
|
| 22 |
|
| 23 |
Organisations
|
|
|
|
| 24 |
* [NLPtown](https://huggingface.co/nlptown) and [homepage](http://nlp.town/)
|
| 25 |
* [doc2query](https://huggingface.co/doc2query)
|
| 26 |
* [LT3, language and translation technology team, University of Gent](https://huggingface.co/LT3) and [homepage](https://lt3.ugent.be/)
|
|
@@ -31,7 +32,10 @@ Organisations
|
|
| 31 |
* [GroNLP](https://huggingface.co/GroNLP), [homepage](https://www.rug.nl/research/clcg/research/cl/)
|
| 32 |
* [CLTL](https://huggingface.co/CLTL), [homepage](http://cltl.nl) and [git](https://github.com/CLTL)
|
| 33 |
* [Nederlands Forensic Institute](https://huggingface.co/NetherlandsForensicInstitute), [homepage](https://forensicinstitute.nl/) and [git](https://github.com/NetherlandsForensicInstitute)
|
|
|
|
| 34 |
|
|
|
|
|
|
|
| 35 |
|
| 36 |
Encoder models
|
| 37 |
* [RobBERT v2](https://huggingface.co/pdelobelle/robbert-v2-dutch-base)
|
|
@@ -41,6 +45,7 @@ Encoder models
|
|
| 41 |
Decoder models
|
| 42 |
* [GPT-2 on mC4](https://huggingface.co/yhavinga/gpt2-large-dutch), [GPT-2 finetuned on ](https://huggingface.co/GroNLP/gpt2-medium-dutch-embeddings)
|
| 43 |
* [GPT-neo on mC4](https://huggingface.co/yhavinga/gpt-neo-1.3B-dutch)
|
|
|
|
| 44 |
|
| 45 |
NTMs
|
| 46 |
* [NLLB200](https://huggingface.co/facebook/nllb-200-3.3B)
|
|
@@ -48,4 +53,10 @@ NTMs
|
|
| 48 |
* [OPUS MT, en-nl](https://huggingface.co/Helsinki-NLP/opus-mt-en-nl), [OPUS MT, nl-en](https://huggingface.co/Helsinki-NLP/opus-mt-nl-en)
|
| 49 |
* [Llama 2 MT, nl-en](https://huggingface.co/kaitchup/Llama-2-7b-mt-Dutch-to-English)
|
| 50 |
|
| 51 |
-
Still to add; sentence similarity and data sets.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 21 |
* [François Remy](https://huggingface.co/FremyCompany), [homepage](http://fremycompany.com) and [git](https://github.com/FremyCompany)
|
| 22 |
|
| 23 |
Organisations
|
| 24 |
+
* [University Medical Center Utrecht](https://github.com/umcu)
|
| 25 |
* [NLPtown](https://huggingface.co/nlptown) and [homepage](http://nlp.town/)
|
| 26 |
* [doc2query](https://huggingface.co/doc2query)
|
| 27 |
* [LT3, language and translation technology team, University of Gent](https://huggingface.co/LT3) and [homepage](https://lt3.ugent.be/)
|
|
|
|
| 32 |
* [GroNLP](https://huggingface.co/GroNLP), [homepage](https://www.rug.nl/research/clcg/research/cl/)
|
| 33 |
* [CLTL](https://huggingface.co/CLTL), [homepage](http://cltl.nl) and [git](https://github.com/CLTL)
|
| 34 |
* [Nederlands Forensic Institute](https://huggingface.co/NetherlandsForensicInstitute), [homepage](https://forensicinstitute.nl/) and [git](https://github.com/NetherlandsForensicInstitute)
|
| 35 |
+
* [Integraal Kanker centrum Nederland (iKNL)](https://github.com/iknl)
|
| 36 |
|
| 37 |
+
Libraries:
|
| 38 |
+
* [Clinlp](https://github.com/umcu/clinlp)
|
| 39 |
|
| 40 |
Encoder models
|
| 41 |
* [RobBERT v2](https://huggingface.co/pdelobelle/robbert-v2-dutch-base)
|
|
|
|
| 45 |
Decoder models
|
| 46 |
* [GPT-2 on mC4](https://huggingface.co/yhavinga/gpt2-large-dutch), [GPT-2 finetuned on ](https://huggingface.co/GroNLP/gpt2-medium-dutch-embeddings)
|
| 47 |
* [GPT-neo on mC4](https://huggingface.co/yhavinga/gpt-neo-1.3B-dutch)
|
| 48 |
+
* [Geitje (based on Mistral)](https://github.com/Rijgersberg/GEITje)
|
| 49 |
|
| 50 |
NTMs
|
| 51 |
* [NLLB200](https://huggingface.co/facebook/nllb-200-3.3B)
|
|
|
|
| 53 |
* [OPUS MT, en-nl](https://huggingface.co/Helsinki-NLP/opus-mt-en-nl), [OPUS MT, nl-en](https://huggingface.co/Helsinki-NLP/opus-mt-nl-en)
|
| 54 |
* [Llama 2 MT, nl-en](https://huggingface.co/kaitchup/Llama-2-7b-mt-Dutch-to-English)
|
| 55 |
|
| 56 |
+
Still to add; sentence similarity and data sets.
|
| 57 |
+
|
| 58 |
+
* [SoNaR](https://taalmaterialen.ivdnt.org/download/tstc-sonar-corpus/)
|
| 59 |
+
* [COW](https://rolandschaefer.net/archives/142)
|
| 60 |
+
* [mc4 cleaned](https://huggingface.co/datasets/yhavinga/mc4_nl_cleaned)
|
| 61 |
+
* [TWnC](https://research.utwente.nl/en/publications/twnc-a-multifaceted-dutch-news-corpus)
|
| 62 |
+
* [Gigacorpus](http://gigacorpus.nl/)
|