Instructions to use HumorR1/policy-e3-dpo-no-thinking with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use HumorR1/policy-e3-dpo-no-thinking with PEFT:
from peft import PeftModel from transformers import AutoModelForCausalLM base_model = AutoModelForCausalLM.from_pretrained("Qwen/Qwen3-VL-2B-Instruct") model = PeftModel.from_pretrained(base_model, "HumorR1/policy-e3-dpo-no-thinking") - Notebooks
- Google Colab
- Kaggle
- Xet hash:
- 98330571d5df35d0919d330712cb062f195c6013cab35bc59995a33560d01df8
- Size of remote file:
- 5.91 kB
- SHA256:
- 47087f54e8be6e96ca67a869de0d393f2661fcffe79908930c5b5eedabae591f
·
Xet efficiently stores Large Files inside Git, intelligently splitting files into unique chunks and accelerating uploads and downloads. More info.