Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Lanni-ni
/
forgetting_pile_2layer

Text Generation
Transformers
Safetensors
forgetting_transformer
custom_code
Model card Files Files and versions
xet
Community
forgetting_pile_2layer / ops /__pycache__
63.9 kB
  • 1 contributor
History: 1 commit
Lanni-ni's picture
Lanni-ni
add remote code + model files
15063d0 verified 28 days ago
  • __init__.cpython-310.pyc
    208 Bytes
    add remote code + model files 28 days ago
  • direction_sensitive_geometric.cpython-310.pyc
    5.28 kB
    add remote code + model files 28 days ago
  • forgetting_attention.cpython-310.pyc
    25.1 kB
    add remote code + model files 28 days ago
  • forgetting_attention_std.cpython-310.pyc
    1.84 kB
    add remote code + model files 28 days ago
  • framework_mock.cpython-310.pyc
    1.01 kB
    add remote code + model files 28 days ago
  • geometric_attention_final.cpython-310.pyc
    2.16 kB
    add remote code + model files 28 days ago
  • geometric_attention_std.cpython-310.pyc
    3.89 kB
    add remote code + model files 28 days ago
  • layer_with_visualization.cpython-310.pyc
    2.17 kB
    add remote code + model files 28 days ago
  • multi_head_attention.cpython-310.pyc
    6.92 kB
    add remote code + model files 28 days ago
  • multi_head_relative_pos_attention.cpython-310.pyc
    8.08 kB
    add remote code + model files 28 days ago
  • sliding_window_attention_std.cpython-310.pyc
    2.07 kB
    add remote code + model files 28 days ago
  • stickbreaking_attention_std.cpython-310.pyc
    1.14 kB
    add remote code + model files 28 days ago
  • vanilla_attention_std.cpython-310.pyc
    3.95 kB
    add remote code + model files 28 days ago