Add detailed base model info, pre-training datasets, and research context 925c5eb verified ronnengmail commited on about 9 hours ago
Clarify: 20M is SFT tokens, base model pre-trained on 9.8B tokens 4dd04f7 verified ronnengmail commited on about 18 hours ago
Upload tokenizer.model with huggingface_hub 544443d verified ronnengmail commited on about 19 hours ago