Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Lanni-ni
/
forgetting_gate_4_6_384_pile
like
0
Text Generation
Transformers
Safetensors
forgetting_transformer
custom_code
arxiv:
1910.09700
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
forgetting_gate_4_6_384_pile
/
ops
261 kB
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
Lanni-ni
Copy Pile model from forgetting_gate_4_6_384_
4531c12
verified
5 months ago
.ipynb_checkpoints
Copy Pile model from forgetting_gate_4_6_384_
5 months ago
__pycache__
Copy Pile model from forgetting_gate_4_6_384_
5 months ago
geometric_attention
Copy Pile model from forgetting_gate_4_6_384_
5 months ago
__init__.py
Safe
69 Bytes
Copy Pile model from forgetting_gate_4_6_384_
5 months ago
direction_sensitive_geometric.py
Safe
5.97 kB
Copy Pile model from forgetting_gate_4_6_384_
5 months ago
direction_sensitive_geometric.py.bak
Safe
5.9 kB
Copy Pile model from forgetting_gate_4_6_384_
5 months ago
forgetting_attention.py
Safe
47.3 kB
Copy Pile model from forgetting_gate_4_6_384_
5 months ago
forgetting_attention_std.py
Safe
2.19 kB
Copy Pile model from forgetting_gate_4_6_384_
5 months ago
framework_mock.py
Safe
520 Bytes
Copy Pile model from forgetting_gate_4_6_384_
5 months ago
geometric_attention_final.py
Safe
2.82 kB
Copy Pile model from forgetting_gate_4_6_384_
5 months ago
geometric_attention_std.py
Safe
5.8 kB
Copy Pile model from forgetting_gate_4_6_384_
5 months ago
layer_with_visualization.py
Safe
1.26 kB
Copy Pile model from forgetting_gate_4_6_384_
5 months ago
multi_head_attention.py
Safe
7.09 kB
Copy Pile model from forgetting_gate_4_6_384_
5 months ago
multi_head_relative_pos_attention.py
Safe
9.64 kB
Copy Pile model from forgetting_gate_4_6_384_
5 months ago
multi_head_relative_pos_attention.py.bak
Safe
9.56 kB
Copy Pile model from forgetting_gate_4_6_384_
5 months ago
sliding_window_attention_std.py
Safe
2.39 kB
Copy Pile model from forgetting_gate_4_6_384_
5 months ago
stickbreaking_attention_std.py
Safe
1.11 kB
Copy Pile model from forgetting_gate_4_6_384_
5 months ago
transformer.py
Safe
7.3 kB
Copy Pile model from forgetting_gate_4_6_384_
5 months ago
vanilla_attention_std.py
Safe
5.64 kB
Copy Pile model from forgetting_gate_4_6_384_
5 months ago