Stevenshuqing/gui360-balanced
Viewer • Updated • 2.57k • 72
Cooperative LoRA after RL (GSPO) training. TSR=20.8%, StepSR=67.9%. Best LoRA result, 94% of full SFT performance with ~142M params.
| Metric | Value |
|---|---|
| TSR (Task Success Rate) | 20.8% |
| StepSR (Step Success Rate) | 67.9% |
| Progress | 34.5% |
| # | Method | TSR | Params |
|---|---|---|---|
| 1 | Full-param SFT step-250 | 22.2% | 7.6B |
| 2 | V15 Cooperative RL step-25 | 20.8% | ~142M |
| 3 | PEFT Cooperative r=128 (SVD) | 18.6% | ~67M |
| 4 | PEFT Standard r=128 (SVD) | 18.1% | ~67M |
| 5 | Base model (zero-shot) | 2.4% | — |
Part of the Cooperative LoRA research for GUI agents.
Base model
Qwen/Qwen2.5-VL-7B-Instruct