guo mi
guo1006
·
AI & ML interests
NLP,计算机视觉,多模态, Agent
Organizations
None yet
multimodel
-
guo1006/layoutlmv2-base-uncased-finetuned-docvqa_1200_examples
Document Question Answering • 0.2B • Updated • 26 -
guo1006/git-base-pokemon-captioning-generate
Image-to-Text • 0.2B • Updated -
guo1006/vilt-b32-mlm-finetuned-vqa-800
Visual Question Answering • 0.1B • Updated • 1 -
guo1006/speecht5-finetuned-voxpopuli_nl
Text-to-Audio • 0.1B • Updated • 2
audio
computer vision
multimodel
-
guo1006/layoutlmv2-base-uncased-finetuned-docvqa_1200_examples
Document Question Answering • 0.2B • Updated • 26 -
guo1006/git-base-pokemon-captioning-generate
Image-to-Text • 0.2B • Updated -
guo1006/vilt-b32-mlm-finetuned-vqa-800
Visual Question Answering • 0.1B • Updated • 1 -
guo1006/speecht5-finetuned-voxpopuli_nl
Text-to-Audio • 0.1B • Updated • 2
nlp