File size: 184 Bytes
3e21791
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
---
{}
---
# Model Details
- **Architecture**: Basic/default GPT-2, decoder only
- **Num params**: ~810M
- **Num tokens seen**: ~2 B
- **Dataset**: PubMed Abstracts subset of The Pile