Add model card for EuroBERT-210M related to MLM vs CLM paper

by nielsr HF Staff - opened Jul 2, 2025

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

+68

-0

nielsr

Jul 2, 2025

This PR adds a comprehensive model card for the EuroBERT-210M model. This model is associated with the paper "Should We Still Pretrain Encoders with Masked Language Modeling?", which investigates optimal pretraining strategies for encoders.

The model card now includes:

Essential metadata (pipeline_tag, library_name, license).
A concise summary of the paper's key findings and contributions.
Direct links to the paper, the project page (https://hf.co/MLMvsCLM), and the underlying GitHub repository (https://github.com/Nicolas-BZRD/EuroBERT).
A practical usage example demonstrating how to load the model and extract text features using the transformers library, including the necessary trust_remote_code=True for this custom architecture.
A BibTeX citation for the paper.

This update significantly improves the model's discoverability and provides users with critical information for understanding, using, and citing the artifact.

Add model card for EuroBERT-210M related to MLM vs CLM paperbc8cf816

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Cannot merge

This branch has merge conflicts in the following files:

README.md

· Sign up or log in to comment