| base_model: unsloth/qwen2.5-3b-instruct-unsloth-bnb-4bit | |
| tags: | |
| - text-generation-inference | |
| - transformers | |
| - unsloth | |
| - qwen2 | |
| - trl | |
| - grpo | |
| license: apache-2.0 | |
| language: | |
| - en | |
| In development |
| base_model: unsloth/qwen2.5-3b-instruct-unsloth-bnb-4bit | |
| tags: | |
| - text-generation-inference | |
| - transformers | |
| - unsloth | |
| - qwen2 | |
| - trl | |
| - grpo | |
| license: apache-2.0 | |
| language: | |
| - en | |
| In development |