HongxinLi
/

AutoGUI-Qwen-v0.1

@@ -1,34 +1,36 @@
 ---
-library_name: transformers
 base_model: Qwen/Qwen-VL-Chat
 ---
 # Model Card for Model ID
 <!-- Provide a quick summary of what the model is/does. -->
 ## Model Details
 ### Model Description
 <!-- Provide a longer summary of what this model is. -->
-The UI-VLM trained with the AutoGUI-625k training data.
 - **Developed by:** [Hongxin Li]
 - **Model type:** [Vision-language model]
 - **Language(s) (NLP):** [English]
-- **License:** [More Information Needed]
 - **Finetuned from model [optional]:** [Qwen-VL-Chat]
 ### Model Sources [optional]
 <!-- Provide the basic links for the model. -->
-- **Repository:** [More Information Needed]
-- **Paper [optional]:** [More Information Needed]
 - **Demo [optional]:** [More Information Needed]
 ## Uses
@@ -87,7 +89,6 @@ Use the code below to get started with the model.
 [More Information Needed]
 #### Training Hyperparameters
 - **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
@@ -128,8 +129,6 @@ Use the code below to get started with the model.
 #### Summary
 ## Model Examination [optional]
 <!-- Relevant interpretability work for the model goes here -->

 ---
 base_model: Qwen/Qwen-VL-Chat
+library_name: transformers
+pipeline_tag: image-text-to-text
+license: cc-by-4.0
 ---
 # Model Card for Model ID
 <!-- Provide a quick summary of what the model is/does. -->
+This model is a UI-VLM trained with the AutoGUI-625k training data.
 ## Model Details
 ### Model Description
 <!-- Provide a longer summary of what this model is. -->
+The UI-VLM trained with the AutoGUI-625k training data as described in [AutoGUI: Scaling GUI Grounding with Automatic Functionality Annotations from LLMs](https://huggingface.co/papers/2502.01977).
 - **Developed by:** [Hongxin Li]
 - **Model type:** [Vision-language model]
 - **Language(s) (NLP):** [English]
+- **License:** [CC-BY-4.0]
 - **Finetuned from model [optional]:** [Qwen-VL-Chat]
 ### Model Sources [optional]
 <!-- Provide the basic links for the model. -->
+- **Repository:** [https://github.com/BraveGroup/AutoGUI](https://github.com/BraveGroup/AutoGUI)
+- **Paper [optional]:** [https://huggingface.co/papers/2502.01977](https://huggingface.co/papers/2502.01977)
+- **Project page [optional]:** [https://autogui-project.github.io/](https://autogui-project.github.io/)
 - **Demo [optional]:** [More Information Needed]
 ## Uses
 [More Information Needed]
 #### Training Hyperparameters
 - **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
 #### Summary
 ## Model Examination [optional]
 <!-- Relevant interpretability work for the model goes here -->