The original checkpoint is avaliable at princeton-nlp/efficient_mlm_m0.80. Unfortunately this checkpoint depends on code that isn't part of the official transformers library. Additionally, the checkpoints contains unused weights due to a bug.

This checkpoint fixes the unused weights issue and uses the RobertaPreLayerNorm model from the transformers library.

Downloads last month: 9

Safetensors

Model size

0.4B params

Tensor type

I64

F32

Paper for andreasmadsen/efficient_mlm_m0.80

Should You Mask 15% in Masked Language Modeling?

Paper • 2202.08005 • Published Feb 16, 2022 • 1