KRLabsOrg
/

ModernBERT-base-hu

Model card Files Files and versions

ModernBERT-base-hu / README.md

adaamko's picture

Update README.md

4c9495c verified 10 months ago

|

history blame contribute delete

453 Bytes

	---
	license: apache-2.0
	language:
	- hu
	base_model:
	- answerdotai/ModernBERT-base
	library_name: transformers
	---

	# Hungarian ModernBERT

	We've used the [transtokenizer](https://github.com/LAGoM-NLP/transtokenizer) repo to create a mapped ModernBERT-base model to Hungarian. To create the data, we've used the [OpenSubtitles](https://opus.nlpl.eu/OpenSubtitles/corpus/version/OpenSubtitles) corpus to obtain EN-HU parallel sentences to train the mapping.