yhavinga
/

modernbert-dutch-base-wide

masked-language-model

Model card Files Files and versions

yhavinga commited on Dec 26, 2025

Commit

2b0c587

·

verified ·

1 Parent(s): e21462a

Update README.md

Files changed (1) hide show

README.md +0 -12

README.md CHANGED Viewed

@@ -53,18 +53,6 @@ predictions = tokenizer.decode(outputs.logits[0, 4].topk(5).indices[0])
 # Expected: "hoofdstad" (capital)
 ```
-## Model Architecture Differences
-This model (`1024h-22L-2`) differs from the earlier `1024h-22L` variant:
-| Parameter | 1024h-22L | 1024h-22L-2 (this model) |
-|-----------|-----------|--------------------------|
-| `intermediate_size` | 4096 | **1536** |
-| `tokenizer` | `jhu-clsp/mmBERT-small` | **`yhavinga/dutch-llama-tokenizer`** |
-| `vocab_size` | 256,000 | **32,128** |
-The smaller intermediate MLP size and Dutch-specific tokenizer make this model more efficient while maintaining strong Dutch language understanding.
 ## Citation
 If you use this model, please cite:

 # Expected: "hoofdstad" (capital)
 ```
 ## Citation
 If you use this model, please cite: