amburger66
/

robometer-4b-lora-robotsmith-task02

preference_comparisons

Model card Files Files and versions

robometer-4b-lora-robotsmith-task02 / README.md

amburger66's picture

LoRA fine-tune on RobotSmith task02 reaching

933e925 verified 13 days ago

|

history blame contribute delete

561 Bytes

license: apache-2.0
base_model: Qwen/Qwen3-VL-4B-Instruct
tags:
  - reward_model
  - rbm
  - preference_comparisons
library_name: transformers

amburger66/robometer-4b-lora-robotsmith-task02

Model Details

Base Model: Qwen/Qwen3-VL-4B-Instruct
Model Type: qwen3_vl

Training Run

Wandb Run: lora_task02
Wandb ID: k51jvvii
Project: rbm-finetune-robotsmith
Notes: fine-tuning Robometer on RobotSmith

Citation

If you use this model, please cite: