--- {} --- # Model Details - **Architecture**: Basic/default GPT-2, decoder only - **Num params**: ~204M - **Num tokens seen**: ~1.3 B - **Dataset**: USPTO subset of The Pile