--- license: apache-2.0 base_model: Qwen/Qwen3-VL-4B-Instruct tags: - reward_model - rfm - preference_comparisons library_name: transformers --- # rewardfm/libero_testset_prog_4frames_fixdata ## Model Details - **Base Model**: Qwen/Qwen3-VL-4B-Instruct - **Model Type**: qwen3_vl ## Training Run - **Wandb Run**: [libero_ablation_prog_4frames_fixdata](https://wandb.ai/clvr/rfm/runs/ds6utsjz) - **Wandb ID**: `ds6utsjz` - **Project**: rfm - **Notes**: libero prog only ## Citation If you use this model, please cite: