Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

flax-community
/
roberta-pretraining-hindi

Fill-Mask
Transformers
PyTorch
JAX
TensorBoard
roberta
Model card Files Files and versions
xet
Metrics Training metrics Community
1
roberta-pretraining-hindi
1.04 GB
  • 5 contributors
History: 38 commits
dk-crazydiv's picture
dk-crazydiv
mc4 epoch14
9142843 over 4 years ago
  • .gitattributes
    737 Bytes
    Saving weights and logs of epoch 2 over 4 years ago
  • README.md
    162 Bytes
    Updated README over 4 years ago
  • config.json
    670 Bytes
    Updated torch model and fixed typo in conversion script over 4 years ago
  • create_config.py
    147 Bytes
    All set to train over 4 years ago
  • events.out.tfevents.1625416432.t1v-n-9df4ce0e-w-0.447041.3.v2
    40 Bytes
    xet
    Saving weights and logs of epoch 1 over 4 years ago
  • events.out.tfevents.1625418057.t1v-n-9df4ce0e-w-0.452509.3.v2
    41.6 MB
    xet
    Saving weights and logs of epoch 8 over 4 years ago
  • flax_model.msgpack
    499 MB
    xet
    mc4 epoch14 over 4 years ago
  • flax_to_torch.py
    805 Bytes
    Updated torch model and fixed typo in conversion script over 4 years ago
  • pytorch_model.bin
    499 MB
    xet
    Converted latest model to torch model over 4 years ago
  • run.sh
    562 Bytes
    Updated code to have different seed and reduced lr over 4 years ago
  • run_mlm_flax.py
    28.7 kB
    Fix on lr over 4 years ago
  • tokenizer.json
    2.86 MB
    mc4 epoch2 over 4 years ago
  • train_tokenizer.py
    776 Bytes
    tzz over 4 years ago