|
|
--- |
|
|
language: |
|
|
- en |
|
|
base_model: openai/clip-vit-base-patch32 |
|
|
--- |
|
|
`GUIClip` is a vision-language model in GUI domain. |
|
|
|
|
|
Code and dataset can be found at https://github.com/Jl-wei/guing |
|
|
|
|
|
If you find our work useful, please cite our paper: |
|
|
```bibtex |
|
|
@article{wei2024guing, |
|
|
author = {Wei, Jialiang and Courbis, Anne-Lise and Lambolais, Thomas and Xu, Binbin and Bernard, Pierre Louis and Dray, G\'{e}rard and Maalej, Walid}, |
|
|
title = {GUing: A Mobile GUI Search Engine using a Vision-Language Model}, |
|
|
year = {2025}, |
|
|
volume = {34}, |
|
|
number = {4}, |
|
|
doi = {10.1145/3702993}, |
|
|
journal = {ACM Trans. Softw. Eng. Methodol.}, |
|
|
publisher = {Association for Computing Machinery}, |
|
|
address = {New York, NY, USA} |
|
|
} |
|
|
``` |
|
|
|
|
|
Please note that the model can only be used for academic purpose. |