Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Kristijan
/
gpt2_wt103_12-layer
like
0
PyTorch
English
gpt2
language-model
transformer
wikitext-103
Eval Results (legacy)
arxiv:
2210.13569
Model card
Files
Files and versions
xet
Community
1
main
gpt2_wt103_12-layer
1.3 GB
1 contributor
History:
5 commits
Kristijan
Update README.md
5667178
over 2 years ago
.gitattributes
1.48 kB
initial commit
almost 3 years ago
README.md
1.87 kB
Update README.md
over 2 years ago
config.json
686 Bytes
initial commit, for 12-layer gpt2-like transformer (checkpoint 27500)
almost 3 years ago
optimizer.pt
862 MB
xet
initial commit, for 12-layer gpt2-like transformer (checkpoint 27500)
almost 3 years ago
pytorch_model.bin
443 MB
xet
initial commit, for 12-layer gpt2-like transformer (checkpoint 27500)
almost 3 years ago
rng_state.pth
14.5 kB
xet
initial commit, for 12-layer gpt2-like transformer (checkpoint 27500)
almost 3 years ago
scaler.pt
559 Bytes
xet
initial commit, for 12-layer gpt2-like transformer (checkpoint 27500)
almost 3 years ago
scheduler.pt
623 Bytes
xet
initial commit, for 12-layer gpt2-like transformer (checkpoint 27500)
almost 3 years ago
trainer_state.json
15.2 kB
initial commit, for 12-layer gpt2-like transformer (checkpoint 27500)
almost 3 years ago
training_args.bin
2.48 kB
xet
initial commit, for 12-layer gpt2-like transformer (checkpoint 27500)
almost 3 years ago