Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

kovidritesh
/

Qwen2.5_GRPO_RL

Text Generation

Model card Files Files and versions

Instructions to use kovidritesh/Qwen2.5_GRPO_RL with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries
PEFT
How to use kovidritesh/Qwen2.5_GRPO_RL with PEFT:
```
Base model is not found.
```
Notebooks
Google Colab
Kaggle

Qwen2.5_GRPO_RL

128 MB

Ctrl+K

Ctrl+K

1 contributor

History: 2 commits

kovidritesh's picture

Upload folder using huggingface_hub

9561c5a verified 6 months ago

.gitattributes

1.52 kB
initial commit 6 months ago
README.md

5.17 kB
Upload folder using huggingface_hub 6 months ago
adapter_config.json

1.1 kB
Upload folder using huggingface_hub 6 months ago
adapter_model.safetensors

120 MB
xet

Upload folder using huggingface_hub 6 months ago
value_head.pt
Detected Pickle imports (3)
- "collections.OrderedDict",
- "torch._utils._rebuild_tensor_v2",
- "torch.FloatStorage"
What is a pickle import?
8.4 MB
xet

Upload folder using huggingface_hub 6 months ago