amburger66's picture
LoRA fine-tune on RobotSmith task03 - correlation fixed
1d7080b verified
---
license: apache-2.0
base_model: Qwen/Qwen3-VL-4B-Instruct
tags:
- reward_model
- rbm
- preference_comparisons
library_name: transformers
---
# amburger66/robometer-4b-lora-robotsmith-task03-v2
## Model Details
- **Base Model**: Qwen/Qwen3-VL-4B-Instruct
- **Model Type**: qwen3_vl
## Training Run
- **Wandb Run**: [lora_task03](https://wandb.ai/r-pad/rbm-finetune-robotsmith/runs/t2aag6u7)
- **Wandb ID**: `t2aag6u7`
- **Project**: rbm-finetune-robotsmith
- **Notes**: fine-tuning Robometer on RobotSmith
## Citation
If you use this model, please cite: