metadata
license: apache-2.0
base_model: Qwen/Qwen3-VL-4B-Instruct
tags:
- reward_model
- rbm
- preference_comparisons
library_name: transformers
amburger66/robometer-4b-lora-robotsmith-task03-v2
Model Details
- Base Model: Qwen/Qwen3-VL-4B-Instruct
- Model Type: qwen3_vl
Training Run
- Wandb Run: lora_task03
- Wandb ID:
t2aag6u7 - Project: rbm-finetune-robotsmith
- Notes: fine-tuning Robometer on RobotSmith
Citation
If you use this model, please cite: