Fix model card: document original 8K-param model in best_model.json, note jld2 files are v2 a188fdb verified LisaMegaWatts commited on 5 days ago
Update best model - val loss 2.3610 (step 2450, CPU-optimized training) e27b2c2 verified LisaMegaWatts commited on 10 days ago
Update training data: 28-char vocab (a-z + space + period) a5ce96f verified LisaMegaWatts commited on 11 days ago