Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
magibu 's Collections
Pretrain Datasets
papers
Ekip karışık verileri
Fine-tuned LLMs
Turkish Language Healthcare Datasets

Pretrain Datasets

updated Oct 2

Datasets we use for pretraining large language models

Upvote
-

  • omarkamali/wikipedia-monthly

    Viewer • Updated 3 days ago • 132M • 18.6k • 45

  • alibayram/hukuk_soru_cevap

    Viewer • Updated Nov 6, 2024 • 2.08k • 91 • 12

  • umutertugrul/turkish-hospital-medical-articles

    Viewer • Updated Oct 2 • 24.6k • 213 • 6

  • umutertugrul/turkish-medical-articles

    Viewer • Updated Oct 2 • 42.8k • 47 • 3

  • alibayram/tr-books

    Viewer • Updated 12 days ago • 3.7k • 32

  • selimfirat/bilkent-turkish-writings-dataset

    Viewer • Updated May 24 • 25.1k • 136 • 8

  • umutertugrul/turkish-academic-theses-dataset

    Viewer • Updated Aug 18 • 649k • 294 • 8

  • alibayram/onedio_haberler

    Viewer • Updated Jun 18, 2024 • 66.7k • 7 • 5

  • habanoz/news-tr-1.8M

    Viewer • Updated Oct 6, 2024 • 1.85M • 474 • 7

  • alibayram/hepsiburada_yorumlar

    Viewer • Updated Jun 18, 2024 • 2.66M • 54 • 13

  • alibayram/kitapyurdu_yorumlar

    Viewer • Updated Jun 18, 2024 • 405k • 21

  • alibayram/beyazperde_yorumlar

    Viewer • Updated Jun 18, 2024 • 192k • 21 • 5
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs