Instructions to use HumorR1/policy-e2b-grpo-thinking with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use HumorR1/policy-e2b-grpo-thinking with PEFT:
from peft import PeftModel from transformers import AutoModelForCausalLM base_model = AutoModelForCausalLM.from_pretrained("Qwen/Qwen3-VL-2B-Thinking") model = PeftModel.from_pretrained(base_model, "HumorR1/policy-e2b-grpo-thinking") - Notebooks
- Google Colab
- Kaggle
File size: 129 Bytes
b5ae70e | 1 2 3 4 | version https://git-lfs.github.com/spec/v1
oid sha256:9b58498f5c0b9297dffe3c5269aa516b68c7e2266979cd4f963c297338e66a05
size 7377
|