| --- |
| tags: |
| - gui-agent |
| - qwen2.5-vl |
| - cooperative-lora |
| - gui-360 |
| license: apache-2.0 |
| base_model: Qwen/Qwen2.5-VL-7B-Instruct |
| datasets: |
| - Stevenshuqing/gui360-balanced |
| --- |
| |
| # gui360-fullparam-sft-step250 |
|
|
| Full-parameter SFT on GUI-360 balanced 2K data (Qwen2.5-VL-7B-Instruct). TSR=22.2%, StepSR=69.3%. Best overall. ~epoch 3.7, LLaMA-Factory+ZeRO-3. |
|
|
| ## Base Model |
| - **Qwen2.5-VL-7B-Instruct** |
|
|
| ## Training Data |
| - GUI-360 balanced 2K episodes (17,264 steps) |
| - Action types: click, type, swipe (balanced sampling) |
|
|
| ## Evaluation (GUI-360 test 1K balanced) |
|
|
| | Metric | Value | |
| |--------|-------| |
| | TSR (Task Success Rate) | 22.2% | |
| | StepSR (Step Success Rate) | 69.3% | |
| | Progress | 35.3% | |
|
|
| ## Full Ranking |
|
|
| | # | Method | TSR | Params | |
| |---|--------|----:|--------| |
| | 1 | Full-param SFT step-250 | 22.2% | 7.6B | |
| | 2 | **V15 Cooperative RL step-25** | **20.8%** | ~142M | |
| | 3 | PEFT Cooperative r=128 (SVD) | 18.6% | ~67M | |
| | 4 | PEFT Standard r=128 (SVD) | 18.1% | ~67M | |
| | 5 | Base model (zero-shot) | 2.4% | — | |
|
|
| ## Citation |
|
|
| Part of the Cooperative LoRA research for GUI agents. |
|
|