| {} | |
| # Model Details | |
| - **Architecture**: Basic/default GPT-2, decoder only | |
| - **Num params**: ~810M | |
| - **Num tokens seen**: ~2 B | |
| - **Dataset**: PubMed Abstracts subset of The Pile | |
| {} | |
| # Model Details | |
| - **Architecture**: Basic/default GPT-2, decoder only | |
| - **Num params**: ~810M | |
| - **Num tokens seen**: ~2 B | |
| - **Dataset**: PubMed Abstracts subset of The Pile | |