Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
pankajmathur
/
nanochat-d34-rl
like
0
Text Generation
HuggingFaceTB/smol-smoltalk
openai/gsm8k
English
nanochat
gpt
conversational
rl
grpo
gsm8k
math
reinforcement-learning
License:
mit
Model card
Files
Files and versions
xet
Community
main
nanochat-d34-rl
8.59 GB
1 contributor
History:
9 commits
pankajmathur
Update README.md
89b3a56
verified
3 months ago
chatrl_checkpoints
Upload model_000466.pt
3 months ago
logs
Upload d34_rl.log
3 months ago
report
Upload 4 files
3 months ago
tokenizer
Upload 2 files
3 months ago
.gitattributes
1.64 kB
Upload Screenshot 2025-12-08 at 5.19.32 PM.png
3 months ago
README.md
3.6 kB
Update README.md
3 months ago
Screenshot 2025-12-08 at 5.19.32 PM.png
626 kB
xet
Upload Screenshot 2025-12-08 at 5.19.32 PM.png
3 months ago