LoRA fine-tune on RobotSmith task03 - correlation fixed

1d7080b verified 20 days ago

564 Bytes

	---
	license: apache-2.0
	base_model: Qwen/Qwen3-VL-4B-Instruct
	tags:
	- reward_model
	- rbm
	- preference_comparisons
	library_name: transformers
	---

	# amburger66/robometer-4b-lora-robotsmith-task03-v2

	## Model Details

	- Base Model: Qwen/Qwen3-VL-4B-Instruct
	- Model Type: qwen3_vl

	## Training Run

	- Wandb Run: [lora_task03](https://wandb.ai/r-pad/rbm-finetune-robotsmith/runs/t2aag6u7)
	- Wandb ID: `t2aag6u7`
	- Project: rbm-finetune-robotsmith
	- Notes: fine-tuning Robometer on RobotSmith

	## Citation

	If you use this model, please cite: