SzegedAI
/

bert-medium-mlsm

Model card Files Files and versions

berendg commited on Jul 10, 2023

Commit

f919e34

·

1 Parent(s): 3bc69b5

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -11,7 +11,7 @@ metrics:
 <!-- Provide a quick summary of what the model is/does. -->
-This medium-sized BERT model was created using the [Masked Latent Semantic Modeling] (MLSM) pre-training objective, which is a sample efficient alternative for classic Masked Language Modeling (MLM).
 During MLSM, the objective is to recover the latent semantic profile of the masked tokens, as opposed to recovering their exact identity.
 The contextualized latent semantic profile during pre-training is determined by performing sparse coding of the hidden representation of an already pre-trained model (a base-sized BERT model in this particular case).

 <!-- Provide a quick summary of what the model is/does. -->
+This medium-sized BERT model was created using the [Masked Latent Semantic Modeling](https://aclanthology.org/2023.findings-acl.876/) (MLSM) pre-training objective, which is a sample efficient alternative for classic Masked Language Modeling (MLM).
 During MLSM, the objective is to recover the latent semantic profile of the masked tokens, as opposed to recovering their exact identity.
 The contextualized latent semantic profile during pre-training is determined by performing sparse coding of the hidden representation of an already pre-trained model (a base-sized BERT model in this particular case).