Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
luodi-7
/
internvl_memeloc_stage1
like
0
Safetensors
internvl_chat
custom_code
Model card
Files
Files and versions
xet
Community
main
internvl_memeloc_stage1
/
README.md
luodi-7
Create README.md
facccb7
verified
about 1 year ago
preview
code
|
raw
Copy download link
history
blame
contribute
delete
397 Bytes
模型介绍
微调internvl2_5-4B
希望能微调模型根据一串meme文字和meme底图学会输出【文字框位置+对应位置填的文字】
参考之前internvl微调visual grounding任务,检测抽象对话框,考虑框中放什么。
当前任务Writable text area[[xmin,ymin, xmax, ymax]]
测试结果
avg_iou=0.3192
avg_similarity=0.9698