Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
lbourdois 's Collections
French packs
French Courses Translations
FAT5
Breton packs
French NER
French QA
French paraphrase dataset
French summarization datasets
French prompts datasets
French DPO and conversation datasets
French think and toolcalling datasets
French embedding datasets
French VQA datasets
French caption datasets
French OCR datasets
French retriever datasets
French table-to-text datasets
French audio datasets (pretraining)

French caption datasets

updated 18 days ago

Datasets I cleaned with an image, a prompt question (like "describe this image") and an answer. Can be used to train VLMs.

Upvote
-

  • lbourdois/caption-wit_base_french

    Viewer • Updated 17 days ago • 1.38M • 92

  • lbourdois/caption-maya-multimodal-pretrain-clean

    Viewer • Updated Jul 15, 2025 • 551k • 290

  • lbourdois/caption-localized_narratives

    Viewer • Updated 18 days ago • 200k • 53

  • lbourdois/caption-textcaps

    Viewer • Updated 18 days ago • 22k • 62

  • lbourdois/caption-vsr

    Viewer • Updated 18 days ago • 1.37k • 37

  • CATIE-AQ/caption-vidore-vdsid_french-clean

    Viewer • Updated Jul 15, 2025 • 5k • 28

  • CATIE-AQ/caption-vidore-tabfquad_test_subsampled-clean

    Viewer • Updated Jul 15, 2025 • 280 • 33

  • CATIE-AQ/caption-floschne-xm3600-clean

    Viewer • Updated Jul 15, 2025 • 8.56k • 27

  • CATIE-AQ/caption-manu-tabfquad_retrieving-clean

    Viewer • Updated Jul 15, 2025 • 1.83k • 26
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs