Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
prithivMLmods
/
Bellatrix-Tiny-1B-R1
like
1
Text Generation
Transformers
Safetensors
English
llama
GRPO
Reinforcement learning
trl
SFT
conversational
text-generation-inference
License:
llama3.2
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Bellatrix-Tiny-1B-R1
/
generation_config.json
Commit History
Add files using upload-large-folder tool
02f95bc
verified
prithivMLmods
commited on
Jan 31