Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
eundain
/
grpo_model_output
like
0
Transformers
Safetensors
Generated from Trainer
trl
grpo
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
grpo_model_output
/
final_model
122 MB
1 contributor
History:
1 commit
eundain
eundain/gemma2-2b-ecg-thinking
9ac96cc
verified
10 months ago
README.md
5.09 kB
eundain/gemma2-2b-ecg-thinking
10 months ago
adapter_config.json
793 Bytes
eundain/gemma2-2b-ecg-thinking
10 months ago
adapter_model.safetensors
83.1 MB
xet
eundain/gemma2-2b-ecg-thinking
10 months ago
special_tokens_map.json
522 Bytes
eundain/gemma2-2b-ecg-thinking
10 months ago
tokenizer.json
34.4 MB
xet
eundain/gemma2-2b-ecg-thinking
10 months ago
tokenizer.model
4.24 MB
xet
eundain/gemma2-2b-ecg-thinking
10 months ago
tokenizer_config.json
47 kB
eundain/gemma2-2b-ecg-thinking
10 months ago