Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Kristijan
/
gpt2_wt103_12-layer
like
0
PyTorch
English
gpt2
language-model
transformer
wikitext-103
Eval Results
arxiv:
2210.13569
Model card
Files
Files and versions
xet
Community
1
bcb66e3
gpt2_wt103_12-layer
1.3 GB
1 contributor
History:
2 commits
Kristijan
initial commit, for 12-layer gpt2-like transformer (checkpoint 27500)
bcb66e3
almost 3 years ago
.gitattributes
1.48 kB
initial commit
almost 3 years ago
config.json
686 Bytes
initial commit, for 12-layer gpt2-like transformer (checkpoint 27500)
almost 3 years ago
optimizer.pt
862 MB
xet
initial commit, for 12-layer gpt2-like transformer (checkpoint 27500)
almost 3 years ago
pytorch_model.bin
443 MB
xet
initial commit, for 12-layer gpt2-like transformer (checkpoint 27500)
almost 3 years ago
rng_state.pth
14.5 kB
xet
initial commit, for 12-layer gpt2-like transformer (checkpoint 27500)
almost 3 years ago
scaler.pt
559 Bytes
xet
initial commit, for 12-layer gpt2-like transformer (checkpoint 27500)
almost 3 years ago
scheduler.pt
623 Bytes
xet
initial commit, for 12-layer gpt2-like transformer (checkpoint 27500)
almost 3 years ago
trainer_state.json
15.2 kB
initial commit, for 12-layer gpt2-like transformer (checkpoint 27500)
almost 3 years ago
training_args.bin
2.48 kB
xet
initial commit, for 12-layer gpt2-like transformer (checkpoint 27500)
almost 3 years ago