--- license: apache-2.0 base_model: Qwen/Qwen3-VL-4B-Instruct tags: - reward_model - rfm - preference_comparisons library_name: transformers --- # rewardfm/libero_90_prog_pref_4frames_fixdata ## Model Details - **Base Model**: Qwen/Qwen3-VL-4B-Instruct - **Model Type**: qwen3_vl ## Training Run - **Wandb Run**: [libero_ablation_prog_pref_4frames_fixeddata](https://wandb.ai/clvr/rfm/runs/r7yk4zqg) - **Wandb ID**: `r7yk4zqg` - **Project**: rfm - **Notes**: libero prog only ## Citation If you use this model, please cite: