Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
pankajmathur
/
nanochat-d34-rl
like
0
Text Generation
HuggingFaceTB/smol-smoltalk
openai/gsm8k
English
nanochat
gpt
conversational
rl
grpo
gsm8k
math
reinforcement-learning
License:
mit
Model card
Files
Files and versions
xet
Community
main
nanochat-d34-rl
/
report
3.25 kB
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
pankajmathur
Upload 4 files
52e26ac
verified
4 months ago
chat-evaluation-rl.md
Safe
403 Bytes
Upload 4 files
4 months ago
chat-rl.md
Safe
396 Bytes
Upload 4 files
4 months ago
header.md
Safe
651 Bytes
Upload 4 files
4 months ago
report.md
Safe
1.8 kB
Upload 4 files
4 months ago