Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Lanni-ni
/
forgetting_pile_2layer
like
0
Text Generation
Transformers
Safetensors
forgetting_transformer
custom_code
arxiv:
1910.09700
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
forgetting_pile_2layer
/
ops
/
__pycache__
63.9 kB
1 contributor
History:
1 commit
Lanni-ni
add remote code + model files
15063d0
verified
28 days ago
__init__.cpython-310.pyc
Safe
208 Bytes
add remote code + model files
28 days ago
direction_sensitive_geometric.cpython-310.pyc
Safe
5.28 kB
add remote code + model files
28 days ago
forgetting_attention.cpython-310.pyc
Safe
25.1 kB
add remote code + model files
28 days ago
forgetting_attention_std.cpython-310.pyc
Safe
1.84 kB
add remote code + model files
28 days ago
framework_mock.cpython-310.pyc
Safe
1.01 kB
add remote code + model files
28 days ago
geometric_attention_final.cpython-310.pyc
Safe
2.16 kB
add remote code + model files
28 days ago
geometric_attention_std.cpython-310.pyc
Safe
3.89 kB
add remote code + model files
28 days ago
layer_with_visualization.cpython-310.pyc
Safe
2.17 kB
add remote code + model files
28 days ago
multi_head_attention.cpython-310.pyc
Safe
6.92 kB
add remote code + model files
28 days ago
multi_head_relative_pos_attention.cpython-310.pyc
Safe
8.08 kB
add remote code + model files
28 days ago
sliding_window_attention_std.cpython-310.pyc
Safe
2.07 kB
add remote code + model files
28 days ago
stickbreaking_attention_std.cpython-310.pyc
Safe
1.14 kB
add remote code + model files
28 days ago
vanilla_attention_std.cpython-310.pyc
Safe
3.95 kB
add remote code + model files
28 days ago