Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

pankajmathur
/
nanochat-d34-rl

Text Generation
English
nanochat
gpt
conversational
rl
grpo
gsm8k
math
reinforcement-learning
Model card Files Files and versions
xet
Community
nanochat-d34-rl / report
3.25 kB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 1 commit
pankajmathur's picture
pankajmathur
Upload 4 files
52e26ac verified 4 months ago
  • chat-evaluation-rl.md
    403 Bytes
    Upload 4 files 4 months ago
  • chat-rl.md
    396 Bytes
    Upload 4 files 4 months ago
  • header.md
    651 Bytes
    Upload 4 files 4 months ago
  • report.md
    1.8 kB
    Upload 4 files 4 months ago