Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
tiendung 's Collections
prompt
Instructions
pretrain

pretrain

updated Oct 27, 2024
Upvote
-

  • cfli/pretrain_wiki

    Viewer • Updated Feb 14, 2024 • 10.4M • 629

  • cfli/bge-full-data

    Updated Oct 11, 2024 • 828 • 43

  • pszemraj/infinity-instruct-7m-T2T_en

    Viewer • Updated Dec 29, 2025 • 15.2M • 114 • 4

  • tiendung/ZIN01

    Viewer • Updated Oct 2, 2023 • 5.87M • 8

  • tiendung/cnen_novels

    Viewer • Updated Sep 24, 2023 • 283 • 13

  • tiendung/vi-books_tve-4u.org

    Updated Sep 8, 2023 • 109

  • tiendung/novels

    Viewer • Updated Jan 29, 2024 • 283 • 162 • 1

  • tiendung/myzinz

    Viewer • Updated Dec 26, 2023 • 100 • 68

  • 5CD-AI/Vietnamese-nampdn-ai-tiny-webtext-gg-translated

    Viewer • Updated Feb 27, 2024 • 1.84M • 36 • 10

  • VTSNLP/vietnamese_curated_dataset

    Viewer • Updated Nov 24, 2024 • 12.2M • 916 • 74

  • neuralwork/arxiver

    Viewer • Updated Nov 1, 2024 • 63.4k • 2.98k • 368

  • opencsg/chinese-fineweb-edu-v2

    Viewer • Updated Dec 12, 2025 • 188M • 4.13k • 74
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs