LoRA fine-tune on RobotSmith real world task00 ball in ring

11f786b verified 5 days ago

561 Bytes

	---
	license: apache-2.0
	base_model: Qwen/Qwen3-VL-4B-Instruct
	tags:
	- reward_model
	- rbm
	- preference_comparisons
	library_name: transformers
	---

	# amburger66/robometer-4b-lora-robotsmith-task00

	## Model Details

	- Base Model: Qwen/Qwen3-VL-4B-Instruct
	- Model Type: qwen3_vl

	## Training Run

	- Wandb Run: [lora_task00](https://wandb.ai/r-pad/rbm-finetune-robotsmith/runs/tbeldoz6)
	- Wandb ID: `tbeldoz6`
	- Project: rbm-finetune-robotsmith
	- Notes: fine-tuning Robometer on RobotSmith

	## Citation

	If you use this model, please cite: