Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Bereketab
/
multihed1attention_dropout_balanced

PyTorch
CharTokenModel
Model card Files Files and versions
xet
Community
1
multihed1attention_dropout_balanced
3.34 GB
  • 1 contributor
History: 4 commits
SFconvertbot's picture
SFconvertbot
Adding `safetensors` variant of this model
87b17a7 verified about 1 year ago
  • .gitattributes
    1.52 kB
    initial commit about 1 year ago
  • best_model.pth

    Detected Pickle imports (3)

    • "torch.FloatStorage",
    • "torch._utils._rebuild_tensor_v2",
    • "collections.OrderedDict"

    What is a pickle import?

    1.11 GB
    xet
    Add CharTokenModel weights and config about 1 year ago
  • config.json
    76 Bytes
    Add CharTokenModel weights and config about 1 year ago
  • learning_curve.png
    49.6 kB
    Add CharTokenModel weights and config about 1 year ago
  • model.safetensors
    1.11 GB
    xet
    Adding `safetensors` variant of this model about 1 year ago
  • pytorch_model.bin

    Detected Pickle imports (3)

    • "torch._utils._rebuild_tensor_v2",
    • "collections.OrderedDict",
    • "torch.FloatStorage"

    What is a pickle import?

    1.11 GB
    xet
    Add CharTokenModel weights and config about 1 year ago
  • training_log.txt
    400 Bytes
    Add CharTokenModel weights and config about 1 year ago