liuhaotian/LLaVA-Instruct-150K
Preview • Updated • 6.85k • 598
This is the model repository of paper EDGE: Enhanced Grounded GUI Understanding with Enriched Multi-Granularity Synthetic Data.
The model is fine-tuned based on Monkey. In order to speed up the training, we also made some minor modifications:
Dataloader in pytorch.The training dataset (i.e. all training QAs in .jsonl format, excluding images) is published in repository EDGE-Dataset.
The model training and inference scripts are published in anonymous repository EDGE.
Base model
echo840/Monkey-Chat