Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
prithivMLmods
/
SmolLM2_135M_Grpo_Checkpoint
like
1
Text Generation
Transformers
Safetensors
openai/gsm8k
English
llama
text-generation-inference
GRPO
conversational
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
SmolLM2_135M_Grpo_Checkpoint
Commit History
Update README.md
13f0063
verified
prithivMLmods
commited on
Feb 17, 2025
Update README.md
17f5f78
verified
prithivMLmods
commited on
Feb 17, 2025
Update README.md
b87f9c2
verified
prithivMLmods
commited on
Feb 17, 2025
Update README.md
cdd099e
verified
prithivMLmods
commited on
Feb 17, 2025
Update README.md
937dac8
verified
prithivMLmods
commited on
Feb 17, 2025
Update README.md
7a4989e
verified
prithivMLmods
commited on
Feb 17, 2025
Upload SmolLM_x_Grpo.ipynb
9b868a8
verified
prithivMLmods
commited on
Feb 17, 2025
Create README.md
343f3dd
verified
prithivMLmods
commited on
Feb 17, 2025
Upload folder using huggingface_hub
7797c34
verified
prithivMLmods
commited on
Feb 17, 2025
Delete final_model
1dbe979
verified
prithivMLmods
commited on
Feb 17, 2025
Upload folder using huggingface_hub
2d43f42
verified
prithivMLmods
commited on
Feb 17, 2025
initial commit
61c8346
verified
prithivMLmods
commited on
Feb 17, 2025