---
tags:
  - gui-agent
  - qwen2.5-vl
  - cooperative-lora
  - gui-360
license: apache-2.0
base_model: Qwen/Qwen2.5-VL-7B-Instruct
datasets:
  - Stevenshuqing/gui360-balanced
---

# gui360-fullparam-sft-step250

Full-parameter SFT on GUI-360 balanced 2K data (Qwen2.5-VL-7B-Instruct). TSR=22.2%, StepSR=69.3%. Best overall. ~epoch 3.7, LLaMA-Factory+ZeRO-3.

## Base Model
- **Qwen2.5-VL-7B-Instruct**

## Training Data
- GUI-360 balanced 2K episodes (17,264 steps)
- Action types: click, type, swipe (balanced sampling)

## Evaluation (GUI-360 test 1K balanced)

| Metric | Value |
|--------|-------|
| TSR (Task Success Rate) | 22.2% |
| StepSR (Step Success Rate) | 69.3% |
| Progress | 35.3% |

## Full Ranking

| # | Method | TSR | Params |
|---|--------|----:|--------|
| 1 | Full-param SFT step-250 | 22.2% | 7.6B |
| 2 | **V15 Cooperative RL step-25** | **20.8%** | ~142M |
| 3 | PEFT Cooperative r=128 (SVD) | 18.6% | ~67M |
| 4 | PEFT Standard r=128 (SVD) | 18.1% | ~67M |
| 5 | Base model (zero-shot) | 2.4% | — |

## Citation

Part of the Cooperative LoRA research for GUI agents.