mosaicml
/

mosaic-bert-base-seqlen-1024

Model card Files Files and versions

mosaic-bert-base-seqlen-1024

550 MB

Ctrl+K

Ctrl+K

3 contributors

History: 14 commits

daking's picture

kobindra's picture

Create LICENSE (#2)

b9fba86 verified over 2 years ago

.gitattributes

1.48 kB
initial commit about 3 years ago
LICENSE

11.3 kB
Create LICENSE (#2) over 2 years ago
README.md

13.9 kB
Update citation in README over 2 years ago
bert_layers.py

47.3 kB
Upload BertForMaskedLM about 3 years ago
bert_padding.py

6.26 kB
Upload BertForMaskedLM about 3 years ago
config.json

845 Bytes
Change attention_probs_dropout_prob to 0.1 so that triton FlashAttention dependencies are avoided over 2 years ago
configuration_bert.py

1.01 kB
Upload BertForMaskedLM about 3 years ago
flash_attn_triton.py

42.7 kB
Upload BertForMaskedLM about 3 years ago
pytorch_model.bin
Detected Pickle imports (3)
- "collections.OrderedDict",
- "torch._utils._rebuild_tensor_v2",
- "torch.FloatStorage"
What is a pickle import?
550 MB
xet

Upload BertForMaskedLM about 3 years ago