| {} | |
| # Model Details | |
| - **Architecture**: Basic/default GPT-2, decoder only | |
| - **Num params**: ~204M | |
| - **Num tokens seen**: ~1.3 B | |
| - **Dataset**: USPTO subset of The Pile |
| {} | |
| # Model Details | |
| - **Architecture**: Basic/default GPT-2, decoder only | |
| - **Num params**: ~204M | |
| - **Num tokens seen**: ~1.3 B | |
| - **Dataset**: USPTO subset of The Pile |