Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
crumb
/
768d-init
like
0
Text Generation
Transformers
PyTorch
gpt2a
custom_code
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
main
768d-init
/
README.md
crumb
Update README.md
07f04b1
about 2 years ago
preview
code
|
raw
Copy download link
history
blame
contribute
delete
169 Bytes
`31,870,464`
non-embedding params,
`38,598,913`
embedding params,
`70,469,377`
total.
```
"n_embd": 768
"n_head": 6
"n_inner": 1920
"n_layer": 6
"n_positions": 4096
```