Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
xjs521 's Collections
nlp
nlp_dataset
pre_training
multi-model
txt_to_img
embedding_model
img_class
t2i_data
classification
i2v
text2video
video2video
zero_shot_class
speech_recognition
cv_space
information_extract
tts
text_similarity
translate
chat_dataset
OCR_model

multi-model

updated Oct 17, 2024
Upvote
-

  • 01-ai/Yi-VL-6B

    Image-Text-to-Text • Updated Jun 26, 2024 • 83 • 124

  • Qwen/Qwen-VL-Chat

    Text Generation • Updated Jan 25, 2024 • 132k • 381

  • llava-hf/llava-v1.6-mistral-7b-hf

    Image-Text-to-Text • 8B • Updated Dec 22, 2025 • 597k • 303

  • Running
    62

    Insanelyfastwhisper

    💻
    62

    Convert audio to subtitles


  • zai-org/cogvlm2-llama3-chat-19B

    Text Generation • 20B • Updated Sep 3, 2024 • 1.98k • 219

  • zai-org/cogvlm2-llama3-caption

    Video-Text-to-Text • Updated May 14, 2025 • 9.6k • 113
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs