File size: 184 Bytes
3e21791 |
1 2 3 4 5 6 7 8 9 |
---
{}
---
# Model Details
- **Architecture**: Basic/default GPT-2, decoder only
- **Num params**: ~810M
- **Num tokens seen**: ~2 B
- **Dataset**: PubMed Abstracts subset of The Pile
|