NLP Text
updated
0x22almostEvil/russe-semantics-sim
Viewer
• Updated • 201k • 10
0x22almostEvil/semantics-ws-qna-oa
Viewer
• Updated • 1.81k • 16
0x22almostEvil/ws-semantics-simnrel
Viewer
• Updated • 1.81k • 7
Abdelkareem/Arabic-article-summarization-30-000
Viewer
• Updated • 8.38k • 11
Abdelkareem/arabic-article-summarization
Viewer
• Updated • 5.87k • 9
Abdelkareem/arabic-bbc-news
Viewer
• Updated • 9.38k • 38
• 3
Abdelkareem/arabic_articles
Abdelkareem/arabic_summarization_text
Preview
• Updated • 22
Abdelkareem/arabic_tweets_classification
Viewer
• Updated • 13.2k • 9
• 1
Abdelkareem/rwkv_articles_30_000
Abdelkareem/rwkv_articles_xp3all
Viewer
• Updated • 741 • 153
• 4
AlekseyKorshuk/dalio-book-handwritten-io
Viewer
• Updated • 767 • 3
AlekseyKorshuk/dalio-book-handwritten-io-sorted
Viewer
• Updated • 767 • 3
AlekseyKorshuk/dalio-book-handwritten-io-sorted-v2
Viewer
• Updated • 973 • 4
AlekseyKorshuk/drama-books
Viewer
• Updated • 1.11k • 76
• 4
AlekseyKorshuk/dummy-text
Viewer
• Updated • 100 • 3
AlekseyKorshuk/erotic-books
Viewer
• Updated • 646 • 157
• 29
AlekseyKorshuk/fairy-tale-books
Viewer
• Updated • 1.01k • 8
• 8
AlekseyKorshuk/fantasy-books
Viewer
• Updated • 3.51k • 14
• 11
AlekseyKorshuk/midjourney-prompts-text-dedup
Viewer
• Updated • 2.8M • 12
• 1
AlekseyKorshuk/mystery-crime-books
Viewer
• Updated • 359 • 78
• 4
AlekseyKorshuk/thriller-books
Viewer
• Updated • 366 • 4
• 3
Aratako/Magpie-Tanuki-Instruction-100k-Embeddings
Viewer
• Updated • 100k • 10
Arun63/query-domain-classification-sharegpt
Viewer
• Updated • 12.2k • 7
Arun63/query-domain-classification-sharegpt-v2
Viewer
• Updated • 12k • 6
Arun63/rag_domain_query_classification
Viewer
• Updated • 25k • 7
Arun63/rag_dsl_filter_classification
Viewer
• Updated • 10k • 5
Viewer
• Updated • 11.2k • 8
Arun63/text-to-opensearch-dsl-multi-turn-1
Viewer
• Updated • 1.43k • 8
Arun63/text-to-opensearch-dsl-multi-turn-2
Viewer
• Updated • 491 • 7
Arun63/text_to_dsl_opensearch
Viewer
• Updated • 7.33k • 5
Arun63/text_to_dsl_opensearch_new_2026
Viewer
• Updated • 9.25k • 11
Arun63/text_to_dsl_opensearch_v1_new
Viewer
• Updated • 951 • 8
Asap7772/contextual_attack_prompts
Viewer
• Updated • 8.42k • 4
Viewer
• Updated • 4.96k • 3
Ayushnangia/autotrain-data-qa_context
Preview
• Updated • 3
Ayushnangia/compactionbench-lme-text-subset
Viewer
• Updated • 20 • 22
Ayushnangia/moltbook-ayush-reanalysis-20260505
Ayushnangia/moltbook-base-model-experiment-test
Ayushnangia/moltbook-base-model-experiment-test-run3
Ayushnangia/moltbook-conspiracy-vs-factual
Ayushnangia/moltbook-ec-10m-base-model-experiments
Ayushnangia/moltbook-ec-1h-base-model-experiments
Ayushnangia/moltbook-entropy-collapse
Preview
• Updated • 54
Ayushnangia/moltbook-entropy-collapse-20agents
Ayushnangia/moltbook-entropy-collapse-30agents
Ayushnangia/moltbook-entropy-collapse-experiments
Preview
• Updated • 251
Ayushnangia/moltbook-entropy-collapse-gemini-flash-lite
Ayushnangia/moltbook-entropy-collapse-gemini-flash-lite-failures
Ayushnangia/moltbook-entropy-collapse-gemini-flash-lite-n10
Ayushnangia/moltbook-entropy-collapse-gemini-flash-lite-n20
Ayushnangia/moltbook-entropy-collapse-glm-5
Ayushnangia/moltbook-entropy-collapse-kimi-k2.5
Ayushnangia/moltbook-entropy-collapse-olmo-3-base
Ayushnangia/moltbook-entropy-collapse-olmo-3-instruct
Ayushnangia/moltbook-entropy-collapse-qwen-35b-base
Ayushnangia/moltbook-entropy-collapse-resumes
Viewer
• Updated • 163k • 21
Ayushnangia/moltbook-entropy-collapse-v2
Preview
• Updated • 66
Ayushnangia/moltbook-factcheck-conspiracy-grok
Preview
• Updated • 68
• 1
Ayushnangia/moltbook-factcheck-dose-response
Preview
• Updated • 60
Ayushnangia/moltbook-factual-threshold
Ayushnangia/moltbook-factual-threshold-v2
Ayushnangia/moltbook-frontier-mixed-1h
Ayushnangia/moltbook-frontier-mixed-mag25-1h
Ayushnangia/moltbook-obsession-gemini-flash-lite
Ayushnangia/moltbook-obsession-gpt5
Ayushnangia/moltbook-source-citation-gpt5-1h
Viewer
• Updated • 2.9k • 50
BEE-spoke-data/UltraTextbooks-2.1-fw_mix
Viewer
• Updated • 7.27M • 737
• 4
BEE-spoke-data/edgar-corpus
Viewer
• Updated • 517k • 21
BEE-spoke-data/financial-news-articles-filtered
Viewer
• Updated • 200k • 52
BEE-spoke-data/medium-articles-en
Viewer
• Updated • 180k • 36
• 2
BEE-spoke-data/rp_books-en
Viewer
• Updated • 120k • 200
• 1
BEE-spoke-data/wikipedia-20230901.en-deduped
Viewer
• Updated • 11.9M • 204
• 6
BEE-spoke-data/yahoo_answers_topics-long-text
Viewer
• Updated • 3.49k • 12
• 2
BUT-FIT/CzechSingleDocumentSummarization
Viewer
• Updated • 90k • 10
ChoudharyTAlhaArain/AdultClassificationdataset
Viewer
• Updated • 68k • 4
DCAgent/Kimi-2.5-exp-gfi-staqc-embedding-mean-filtered-10K-maxeps-32k
Viewer
• Updated • 11.9k • 15
DCAgent/e1_embedding_d1_original_sandboxes_glm_4.7_traces_jupiter
Viewer
• Updated • 12.1k • 42
DCAgent/exp-gfi-staqc-embedding-mean-filtered-10K_glm_4.7_traces_jupiter-10pct
Viewer
• Updated • 920 • 5
DCAgent/exp-gfi-staqc-embedding-mean-filtered-10K_glm_4.7_traces_jupiter-3pct
Viewer
• Updated • 276 • 6
DCAgent/nemotron-terminal-corpus-unified-100000-sandboxes
Viewer
• Updated • 100k • 7
Dahoas/full-single-context
Viewer
• Updated • 125k • 9
Viewer
• Updated • 89.5k • 53
• 1
Dahoas/sft-single-context
Viewer
• Updated • 35k • 69
• 1
Viewer
• Updated • 95.3k • 24
DamarJati/indocorpus-sastra
Viewer
• Updated • 28.8k • 26
Delta-Vector/Orion-Books-V2-ShareGPT
Viewer
• Updated • 3.99k • 4
• 2
Delta-Vector/Tauri-RL-Plaintext-System
Viewer
• Updated • 128 • 111
Delta-Vector/Ursa-Books-V2
Preview
• Updated • 3
• 2
Delta-Vector/Ursa-LN-Books-Catto
Viewer
• Updated • 2.01k • 9
Updated • 149
EMBO/sd-nlp-non-tokenized
Updated • 79
El-chapoo/Urdu-1M-news-text
Viewer
• Updated • 1.04M • 63
• 4
Emm9625/nsw_commonwealth_corpus
Viewer
• Updated • 223k • 2
Viewer
• Updated • 194k • 9
Emm9625/textwork-00-dedupe-0.5
Viewer
• Updated • 656 • 6
Emm9625/textwork-00-dedupe-0.75
Viewer
• Updated • 37k • 2
Emm9625/textwork-00-dedupe-0.8
Viewer
• Updated • 69.4k • 6
Emm9625/textwork-00-dedupe-0.85
Viewer
• Updated • 122k • 8
Emm9625/textwork-00-dedupe-optimal_threshold
Viewer
• Updated • 138k • 48
Emm9625/textwork-00-deduped
Viewer
• Updated • 71.2k • 2
Viewer
• Updated • 231k • 36
FreedomIntelligence/ApolloCorpus
Viewer
• Updated • 3.74M • 813
• 41
FreedomIntelligence/TCM-Text-Exams
Updated • 75
• 3
Granther/assorted-notebooks-bin
HiTZ/composite_corpus_es_v1.0
Viewer
• Updated • 526k • 58
HiTZ/composite_corpus_eseu_v1.0
Viewer
• Updated • 742k • 242
• 2
HiTZ/composite_corpus_eu_v2.1
Viewer
• Updated • 407k • 770
• 3
Viewer
• Updated • 4.13M • 405
• 2
Viewer
• Updated • 4.18M • 289
• 1
HuggingFaceTB/cosmopedia_web_textbooks
Viewer
• Updated • 9.97M • 10
• 1
HuggingFaceTB/cosmopedia_web_textbooks_all_2B
Updated • 2
• 1
HuggingFaceTB/issues-kaggle-notebooks
Viewer
• Updated • 16.1M • 1.29k
• 16
HuggingFaceTB/openstax_paragraphs
Viewer
• Updated • 77 • 253
• 6
HuggingFaceTB/smollm-corpus
Viewer
• Updated • 237M • 34.5k
• 469
HumynLabs/Arabic_Documents_Dataset_PDF
Viewer
• Updated • 127 • 447
HumynLabs/Chinese_Documents_Dataset_PDF
Viewer
• Updated • 30 • 335
HumynLabs/French_Documents_Dataset_PDF
Viewer
• Updated • 60 • 641
HumynLabs/German_Documents_Dataset_PDF
Viewer
• Updated • 54 • 497
HumynLabs/Italian_Documents_Dataset_PDF
Viewer
• Updated • 293 • 282
HumynLabs/Japanese_Documents_Dataset_PDF
Viewer
• Updated • 63 • 514
• 1
HumynLabs/Korean-Documents-Dataset
Viewer
• Updated • 3 • 31
HumynLabs/Russian_Documents_Dataset_PDF
Viewer
• Updated • 17 • 185
• 1
HumynLabs/Spanish_Documents_Dataset_PDF
Viewer
• Updated • 21 • 227
HumynLabs/Turkish_Documents_Datasets_PDF
Viewer
• Updated • 4 • 61
HumynLabs/flight-booking-screen-recording
Viewer
• Updated • 9 • 27
• 7
IDEA-CCNL/PretrainCorpusDemo
Viewer
• Updated • 969k • 680
• 17
Updated • 310
• 7
Intuit-GenSRF/hackathon-somos-nlp-2023-suicide-comments-es
Viewer
• Updated • 10.1k • 3
Intuit-GenSRF/hackathon-somos-nlp-2023-suicide-comments-es-en
Viewer
• Updated • 8.82k • 9
KevinZ/psycholinguistic_eval
Viewer
• Updated • 156 • 74
• 4
Viewer
• Updated • 125 • 40
• 10
Locutusque/UltraTextbooks
Viewer
• Updated • 5.52M • 2.16k
• 200
Locutusque/UltraTextbooks-2.0
Viewer
• Updated • 3.22M • 154
• 52
Viewer
• Updated • 8.4M • 268
• 4
Madras1/rag-qa-fulltext-ptbr
Viewer
• Updated • 1.43M • 66
Viewer
• Updated • 260k • 45
• 20
Viewer
• Updated • 61.7M • 13
MongoDB/airbnb_embeddings
Viewer
• Updated • 5.56k • 478
• 7
MongoDB/devcenter-articles
Viewer
• Updated • 619 • 26
MongoDB/devcenter-articles-embedded
Viewer
• Updated • 218 • 13
MongoDB/subset_arxiv_papers_with_embeddings
Viewer
• Updated • 50k • 6.95k
• 2
MongoDB/tech-news-embeddings
Viewer
• Updated • 1.58M • 929
• 6
MongoDB/wikipedia-22-12-en-annotation
Viewer
• Updated • 87.2k • 180
MongoDB/wikipedia-22-12-en-nomic-embedded
Viewer
• Updated • 951k • 52
MongoDB/wikipedia-22-12-en-voyage-embed
Viewer
• Updated • 342k • 542
NLPC-UOM/AnanyaSinhalaNERDataset
Preview
• Updated • 5
NLPC-UOM/English-Tamil-Parallel-Corpus
Viewer
• Updated • 62.9k • 15
• 3
NLPC-UOM/LLM-Eval-Sinhala
Preview
• Updated • 16
Viewer
• Updated • 22.1k • 32
• 3
Viewer
• Updated • 66.3k • 14
NLPC-UOM/Sentiment-tagger
Viewer
• Updated • 68.4k • 7
Viewer
• Updated • 1k • 7
NLPC-UOM/Sinhala-Neuspellcorrector
NLPC-UOM/Sinhala-News-Category-classification
Viewer
• Updated • 3.33k • 79
• 1
NLPC-UOM/Sinhala-News-Source-classification
Viewer
• Updated • 24.1k • 11
NLPC-UOM/Sinhala-POS-Data
NLPC-UOM/Sinhala-Stopword-list
Updated • 124
NLPC-UOM/Sinhala-Tamil-Aligned-Parallel-Corpus
Viewer
• Updated • 2.27k • 9
NLPC-UOM/Sinhala-news-clustering
NLPC-UOM/Sinhala-short-sentences
Updated • 8
• 1
NLPC-UOM/Student_feedback_analysis_dataset
Preview
• Updated • 28
• 6
NLPC-UOM/Tamil-Sinhala-short-sentence-similarity-deep-learning
Updated • 15
NLPC-UOM/Travel-Dataset-5000
Updated • 14
• 8
NLPC-UOM/ensi_enta_sita_curated_parallel_data
Preview
• Updated • 36
NLPC-UOM/nllb-top25k-ensi-cleaned
Viewer
• Updated • 25k • 8
• 2
NLPC-UOM/nllb-top25k-enta-cleaned
Viewer
• Updated • 25k • 4
NLPC-UOM/sinhala-sentiment-lexicon-generation
Viewer
• Updated • 1.84M • 134
• 18
OALL/details_princeton-nlp__Llama-3-8B-ProLong-512k-Instruct
Viewer
• Updated • 146k • 2.62k
Viewer
• Updated • 5.3k • 8
• 14
Viewer
• Updated • 2.45k • 54
OdiaGenAI/odia_context_10K_llama2_set
Viewer
• Updated • 10.5k • 6
• 1
OdiaGenAI/odia_context_qa_98k
Viewer
• Updated • 98k • 12
OdiaGenAI/odia_domain_context_train_v1
Viewer
• Updated • 10.5k • 8
OdiaGenAI/sentiment_analysis_hindi
Viewer
• Updated • 2.5k • 85
• 2
OmniAICreator/Japanese-Wikipedia-202506
Viewer
• Updated • 1.44M • 153
• 4
Open-Orca/gpt4-1m-orca-embeddings
Viewer
• Updated • 355k • 86
• 6
OusiaResearch/Aureth-Corpus-Hermes4.3-Generated
Viewer
• Updated • 654k • 64
• 14
Viewer
• Updated • 1.04k • 22
• 2
PJMixers/AP-News-2024-CGPT-Summarize-ShareGPT
Viewer
• Updated • 616 • 7
• 1
ResplendentAI/Luna_NSFW_Text
Viewer
• Updated • 2.9k • 22
• 11
SEACrowd/indolem_sentiment
Updated • 52
SEACrowd/indonesian_news_dataset
Updated • 36
SEACrowd/mtop_intent_classification
Updated • 18
Updated • 17
Salesforce/ContextualBench
Viewer
• Updated • 216k • 136
• 15
Salesforce/ContextualJudgeBench
Viewer
• Updated • 2k • 43
• 3
Viewer
• Updated • 3.71M • 1.32M
• 727
SeppeV/jokeTailor_embeddings
SeppeV/user_embeddings_jester
Viewer
• Updated • 45k • 15
SeppeV/user_embeddings_jester_bert
Viewer
• Updated • 45k • 6
agentlans/HuggingFaceFW-finetranslations-100-languages-sample
Viewer
• Updated • 200k • 245
agentlans/en-document-classification
Viewer
• Updated • 8.13M • 459
Viewer
• Updated • 30k • 33
agentlans/grammar-classification
Viewer
• Updated • 600k • 44
• 2
agentlans/grammar-correction
Viewer
• Updated • 125k • 163
• 10
agentlans/high-quality-english-sentences
Viewer
• Updated • 1.71M • 400
• 37
agentlans/high-quality-multilingual-sentences
Viewer
• Updated • 3.11M • 317
• 9
agentlans/high-quality-text
Viewer
• Updated • 888k • 61
agentlans/library-classification-systems
Viewer
• Updated • 26.5k • 254
• 2
agentlans/lime-nlp-difficulty
Viewer
• Updated • 118k • 21
agentlans/multilingual-document-classification
Viewer
• Updated • 700k • 122
agentlans/multilingual-sentences
Viewer
• Updated • 19.5M • 265
• 6
agentlans/multilingual-text
Viewer
• Updated • 5.03M • 251
• 5
Viewer
• Updated • 1.5M • 27
agentlans/sql-text-collection
Viewer
• Updated • 384k • 28
• 1
Viewer
• Updated • 100k • 47
• 2
agentlans/text-sft-questions-answers-only
Viewer
• Updated • 151k • 170
• 2
agentlans/wikipedia-first-paragraph
Viewer
• Updated • 23.2M • 31
agentlans/wikipedia-first-paragraph-ner
Viewer
• Updated • 7.79M • 66
• 2
agentlans/wikipedia-paragraph-keywords
Viewer
• Updated • 21.8k • 16
• 1
agentlans/wikipedia-paragraph-summaries
Viewer
• Updated • 21.8k • 10
agentlans/wikipedia-paragraphs
Viewer
• Updated • 21.8k • 1.1k
• 3
agentlans/wikipedia-paragraphs-complete
Viewer
• Updated • 3.32M • 86
• 1
allenai/olmoearth-paper-embeddings
Updated • 4.42k
• 8
Viewer
• Updated • 11.9k • 155k
• 133
Viewer
• Updated • 32.6k • 6
argilla/banking_sentiment_setfit
Viewer
• Updated • 144 • 46
• 2
argilla/end2end_textclassification
Viewer
• Updated • 1k • 42
• 2
argilla/end2end_textclassification_with_metadata
Viewer
• Updated • 1k • 128
• 1
argilla/end2end_textclassification_with_suggestions_and_responses
Viewer
• Updated • 1k • 33
• 3
argilla/end2end_textclassification_with_vectors
Viewer
• Updated • 1k • 44
• 1
Viewer
• Updated • 38.1k • 6
Viewer
• Updated • 44.9k • 10
• 1
Viewer
• Updated • 21.4k • 186
• 40
Viewer
• Updated • 114 • 6
• 1
argilla/rag-embeddings-relevance-similarity
Viewer
• Updated • 6.25k • 18
• 1
argilla/sharegpt-text-descriptives
Viewer
• Updated • 3.24k • 5
argilla/synthetic-domain-text-classification
Viewer
• Updated • 1k • 29
• 6
argilla/synthetic-text-classification-news
Viewer
• Updated • 100 • 94
• 10
argilla/synthetic-text-classification-news-multi-label
Viewer
• Updated • 100 • 23
• 5
argilla/text-descriptives-metadata
Viewer
• Updated • 1.03k • 45
argilla/textcat-tokencat-pii-per-domain
Viewer
• Updated • 2.1k • 11
astarostap/autonlp-data-antisemitism-2
Preview
• Updated • 125
• 1
autoevaluate/autoeval-staging-eval-project-kmfoda__booksum-636bebc2-11085484
Viewer
• Updated • 1.43k • 4
autoevaluate/autoeval-staging-eval-project-kmfoda__booksum-79c1c0d8-10905464
Viewer
• Updated • 1.43k • 5
autoevaluate/autoeval-staging-eval-project-kmfoda__booksum-e703e34d-10975474
Viewer
• Updated • 1.43k • 4
Viewer
• Updated • 138 • 13
Viewer
• Updated • 78.6k • 4.3k
• 501
behavior-in-the-wild/content-behavior-corpus
Viewer
• Updated • 24.6k • 144
• 5
beyoru/synthetic_text_to_sql_filter
Viewer
• Updated • 71.2k • 16
breadlicker45/gender-bluesky-classification
Viewer
• Updated • 63.1k • 9
breadlicker45/gender-bluesky-classification-v2
Viewer
• Updated • 8.11k • 2
breadlicker45/gender-bluesky-classification-v3
Viewer
• Updated • 975k • 21
breadlicker45/gender-bluesky-classification-v4
Viewer
• Updated • 8.24M • 16
breadlicker45/gender-classification-v4.5
Viewer
• Updated • 79.6M • 7
Preview
• Updated • 6
chillies/course-review-multilabel-sentiment-analysis
Viewer
• Updated • 8.21k • 26
Viewer
• Updated • 27.9M • 5
Viewer
• Updated • 20.7M • 5
communityai/apt_pretrain_textbook_16k
Viewer
• Updated • 116k • 5
• 2
communityai/apt_pretrain_textbook_16k-100
Viewer
• Updated • 100 • 9
communityai/apt_pretrain_textbook_16k-1k
Viewer
• Updated • 1k • 4
communityai/gretelai___synthetic_text_to_sql
Viewer
• Updated • 100k • 8
communityai/gretelai___synthetic_text_to_sql-10k
Viewer
• Updated • 10k • 10
communityai/gretelai___synthetic_text_to_sql-15k
Viewer
• Updated • 15k • 8
communityai/gretelai___synthetic_text_to_sql-20k
Viewer
• Updated • 20k • 11
communityai/gretelai___synthetic_text_to_sql-25k
Viewer
• Updated • 25k • 10
communityai/gretelai___synthetic_text_to_sql-30k
Viewer
• Updated • 30k • 10
cyberlangke/whitesilkmarisa-corpus
Preview
• Updated • 51
darkknight25/Adversarial_Machine_Learning_TextFooler_Dataset
Updated • 20
darkknight25/Incident_Response_Playbook_Dataset
Updated • 804
• 2
davanstrien/newspaper_navigator
Viewer
• Updated • 48M • 177
davidquicast/wikipedia-txt-spanish
Viewer
• Updated • 13M • 75
• 3
derek-thomas/autotrain-data-i-bert-twitter-sentiment
Preview
• Updated • 4
derek-thomas/classification-ie-optimization
Viewer
• Updated • 246 • 25
derek-thomas/embedding-ie-optimization
Viewer
• Updated • 80 • 35
Viewer
• Updated • 82.5k • 12
• 1
Viewer
• Updated • 7.14k • 9
Viewer
• Updated • 3k • 4
dinushiTJ/nz_hansard_classification
Viewer
• Updated • 6.23k • 30
• 1
dinushiTJ/nz_hansard_classification_10k_tokens
Viewer
• Updated • 2.61k • 4
dinushiTJ/nz_hansard_classification_4096_tokens
Viewer
• Updated • 1.06k • 4
dinushiTJ/nz_research_commons_classification
Viewer
• Updated • 16.6k • 102
• 1
diwank/IBMDebaterEvidenceSentences
Viewer
• Updated • 5.78k • 6
diwank/imaginary-nlp-dataset
Viewer
• Updated • 1.04M • 28
• 1
diwank/llmlingua-compressed-text
Viewer
• Updated • 222k • 9
• 2
dmayhem93/random-walk-reddit-corpus-55-cleaned
Viewer
• Updated • 6.14M • 4
dmayhem93/random-walk-reddit-corpus-small
Viewer
• Updated • 8.29k • 6
dmayhem93/self-critiquing-base-topic-embeddings
Viewer
• Updated • 2.76k • 3
dmayhem93/top-2-reddit-corpus-small
Viewer
• Updated • 8.29k • 2
dmayhem93/top-n-reddit-corpus-55-cleaned
Viewer
• Updated • 6.14M • 5
emozilla/Long-Data-Collections-Pretrain-Without-Books
Viewer
• Updated • 9.38M • 712
• 2
emozilla/booksum-summary-analysis
Viewer
• Updated • 15.7k • 44
• 5
emozilla/booksum-summary-analysis_gptneox-8192
Viewer
• Updated • 14.1k • 24
• 6
emozilla/booksum-summary-analysis_llama-16384
Viewer
• Updated • 15.7k • 22
• 1
emozilla/booksum-summary-analysis_llama-2048
Viewer
• Updated • 2.27k • 22
• 3
emozilla/booksum-summary-analysis_llama-8192
Viewer
• Updated • 13.5k • 23
• 10
emozilla/dolma-v1_7-books
Viewer
• Updated • 56k • 37
• 2
emozilla/pg_books-tokenized-bos-eos-chunked-65536
Viewer
• Updated • 79.5k • 476
• 7
Viewer
• Updated • 59.1k • 459
• 13
Viewer
• Updated • 2.02k • 363
• 2
Updated • 332
• 6
Viewer
• Updated • 1.65k • 331
• 17
Viewer
• Updated • 1.19M • 334
• 9
Viewer
• Updated • 71.9k • 82
• 2
Viewer
• Updated • 23k • 5.17k
• 12
Preview
• Updated • 664
• 4
Viewer
• Updated • 3.99M • 392
• 14
Viewer
• Updated • 13.3k • 1.85k
• 47
Preview
• Updated • 146
• 7
Viewer
• Updated • 2.46M • 40
• 23
Viewer
• Updated • 1.07k • 858
• 14
Viewer
• Updated • 4.5k • 2.54k
• 38
Preview
• Updated • 339
• 73
Viewer
• Updated • 219k • 214
• 7
Viewer
• Updated • 43.6k • 103
• 1
Updated • 115
• 23
Viewer
• Updated • 21 • 9.91k
• 21
Viewer
• Updated • 9 • 37
• 6
Viewer
• Updated • 2 • 60
• 4
facebook/Self-taught-evaluator-DPO-data
Viewer
• Updated • 57.5k • 27
• 35
facebook/ShapeR-Evaluation
Updated • 1.18k
• 15
facebook/action100m-preview
Viewer
• Updated • 120k • 3.88k
• 146
Preview
• Updated • 802
• 9
Viewer
• Updated • 20 • 62
• 5
Viewer
• Updated • 169k • 18.5k
• 50
Viewer
• Updated • 6.86k • 1.86k
• 14
Viewer
• Updated • 10.4k • 908
• 13
Viewer
• Updated • 110k • 21.1k
• 128
facebook/beyond_the_lab_neurips_paper
Viewer
• Updated • 138k • 25
• 1
Updated • 2.12k
• 44
Preview
• Updated • 205
• 6
facebook/collaborative_agent_bench
Preview
• Updated • 107
• 60
facebook/content_rephrasing
Viewer
• Updated • 3.21k • 59
• 16
Updated • 532
• 46
Updated • 85
• 3
facebook/curiosity_dialogs
Updated • 161
• 14
Updated • 1.85k
• 4
Viewer
• Updated • 491k • 65.6k
• 15
facebook/emu_edit_test_set
Viewer
• Updated • 5.61k • 497
• 46
facebook/emu_edit_test_set_generations
Viewer
• Updated • 5.61k • 229
• 38
Viewer
• Updated • 1.23M • 6.81k
• 106
facebook/gelsight-force-estimation
Viewer
• Updated • 1 • 547
• 2
Viewer
• Updated • 615 • 331
• 1
Viewer
• Updated • 4.21M • 99
• 4
facebook/hand_tracking_challenge_umetrack
Viewer
• Updated • 934k • 756
• 1
Viewer
• Updated • 25.5k • 382
• 1
Viewer
• Updated • 524 • 332
• 19
Viewer
• Updated • 3.23M • 6.45k
• 68
Updated • 412
• 20
Updated • 993
• 19
Viewer
• Updated • 160 • 173
• 5
facebook/map-anything-benchmarking
Updated • 814
• 3
Viewer
• Updated • 6.42k • 161
• 12
facebook/meta-active-reading
Viewer
• Updated • 1.83B • 1.57k
• 36
Updated • 2.11k
• 44
Viewer
• Updated • 67.3k • 194
• 6
facebook/omnilingual-asr-corpus
Viewer
• Updated • 548k • 3.84k
• 207
facebook/optimal_thinking_bench
Viewer
• Updated • 1.88k • 51
• 1
Viewer
• Updated • 106k • 35
• 8
Updated • 35
• 41
Viewer
• Updated • 2.24k • 199
• 20
facebook/principia-collection
Viewer
• Updated • 554k • 299
• 45
facebook/recycling_the_web
Viewer
• Updated • 60.3M • 1.43k
• 68
facebook/research-plan-gen
Viewer
• Updated • 22.5k • 469
• 302
facebook/seamless-interaction
Updated • 44k
• 190
facebook/sparsh-x-dataset
Updated • 71
Updated • 1.34k
• 38
Viewer
• Updated • 24 • 207
Updated • 67.8k
• 45
Updated • 252
• 15
Viewer
• Updated • 400 • 1.62k
• 113
Viewer
• Updated • 6.4M • 21.8k
• 72
femboysLover/gemini_trader_embeddings_dataset
Viewer
• Updated • 60.1k • 8
flamesbob/Line_style-Embedding
Updated • 1
• 3
free-law/Caselaw_Access_Project_embeddings
Viewer
• Updated • 2M • 40
• 8
free-law/alaska_embeddings
Viewer
• Updated • 10.7k • 1
free-law/arizona_embeddings
Viewer
• Updated • 28.4k • 12
• 1
free-law/arkansas_embeddings
Viewer
• Updated • 60.5k • 1
free-law/california_embeddings
Viewer
• Updated • 144k • 1
free-law/colorado_embeddings
Viewer
• Updated • 40.9k • 8
Viewer
• Updated • 56.4k • 10
Viewer
• Updated • 172k • 5
Viewer
• Updated • 18.4k • 1
free-law/idaho_embeddings
Viewer
• Updated • 19.4k • 1
Viewer
• Updated • 184k • 10
Viewer
• Updated • 92.7k • 7
Viewer
• Updated • 57.7k • 6
Viewer
• Updated • 79.6k • 1
Viewer
• Updated • 313k • 10
Viewer
• Updated • 91.7k • 1
Viewer
• Updated • 43.9k • 1
Viewer
• Updated • 82.8k • 3
Viewer
• Updated • 56.1k • 1
Viewer
• Updated • 60.4k • 1
Viewer
• Updated • 140k • 1
free-law/n_mar_i_embeddings
Viewer
• Updated • 395 • 1
free-law/navajo_nation_embeddings
Viewer
• Updated • 966 • 1
Viewer
• Updated • 108k • 4
• 1
Viewer
• Updated • 21.5k • 1
Viewer
• Updated • 18.5k • 8
Viewer
• Updated • 683k • 1
Viewer
• Updated • 67.1k • 1
Viewer
• Updated • 56.8k • 1
Viewer
• Updated • 239k • 6
Viewer
• Updated • 45.7k • 1
Viewer
• Updated • 41.9k • 6
Viewer
• Updated • 16.6k • 1
Viewer
• Updated • 38.4k • 1
Viewer
• Updated • 251k • 1
free-law/tribal_embeddings
Viewer
• Updated • 1.4k • 1
Viewer
• Updated • 2 • 1
Viewer
• Updated • 3.47k • 1
Viewer
• Updated • 27.7k • 5
Viewer
• Updated • 106k • 9
free-law/wikitext-2-v1-with-embeddings
Viewer
• Updated • 36.7k • 5
Viewer
• Updated • 49.1k • 3
french-open-data/principaux-corpus-d-archives-numerises-et-mis-en-ligne-par-les-archives-departementales
Viewer
• Updated • 1.67M • 30.1k
• 237
hac541309/polyglot-ko-tokenizer-corpus
Viewer
• Updated • 11.8M • 486
• 1
hac541309/polyglot-ko-tokenizer-corpus-merge_ws
Viewer
• Updated • 11.8M • 199
harpreetsahota/CVPR_2024_Papers_with_Embeddings
Viewer
• Updated • 2.38k • 6
• 2
harpreetsahota/Instruction-Following-Evaluation-for-Large-Language-Models
Viewer
• Updated • 541 • 50
• 7
harpreetsahota/elicit-offensive-language-prompts
Viewer
• Updated • 73 • 21
• 3
harpreetsahota/eval_sentence_split_chunk_size_512_answer
Viewer
• Updated • 174 • 3
harpreetsahota/fiftyone-qa-with-qwen-embeddings
Viewer
• Updated • 28.1k • 8
harpreetsahota/testing_qwen3vl_embeddings
Viewer
• Updated • 412 • 492
haseong8012/Korean_Political-News_By_Media-Outlet
Updated • 74
ibm-research/Wikipedia_contradict_benchmark
Viewer
• Updated • 506 • 644
• 27
Viewer
• Updated • 6.28k • 355
• 3
Viewer
• Updated • 25.2k • 627
• 57
Updated • 4
• 1
irds/msmarco-document_trec-dl-hard
irds/nfcorpus_test_nontopic
irds/wapo_v2_trec-news-2018
irds/wapo_v2_trec-news-2019
irds/wapo_v3_trec-news-2020
jayavibhav/classification-gen-ai
Viewer
• Updated • 141k • 4
jayavibhav/new-updated-gen-classification
jayavibhav/text2sql-cleaned
Viewer
• Updated • 262k • 7
• 1
jondurbin/contextual-dpo-v0.1
Viewer
• Updated • 1.37k • 153
• 33
Viewer
• Updated • 4.34M • 4
jtatman/master_textbook_lines
Viewer
• Updated • 3.12M • 3
jtatman/movie_sentiment_reviews
Viewer
• Updated • 1k • 14
jtatman/myers_briggs_text_classify
Viewer
• Updated • 8.68k • 10
jtatman/textbooks-are-all-you-need-lite-instruct
Viewer
• Updated • 682k • 14
• 2
jtatman/textbooks-lite-100k-sharegpt
Viewer
• Updated • 114k • 5
jtatman/textbooks-lite-15k-sharegpt
Viewer
• Updated • 18.9k • 8
jtatman/textbooks-lite-700k-sharegpt
Viewer
• Updated • 682k • 62
• 2
Viewer
• Updated • 24 • 6
justinphan3110/textquests
Viewer
• Updated • 407 • 8
justinphan3110/wmdp-bio-forget-corpus
Viewer
• Updated • 24.5k • 5
Updated • 51
• 2
lamini/bird_spider_train_text_to_sql
Viewer
• Updated • 17.5k • 28
• 5
Viewer
• Updated • 11k • 114
• 7
lamini/lamini-wikipedia-page
Updated • 42
lamini/spider_text_to_sql
Viewer
• Updated • 8.03k • 76
• 9
lamini/text_to_sql_finetune
Viewer
• Updated • 17.5k • 62
• 15
lianghsun/bird-text2sql-bench
Viewer
• Updated • 9.43k • 78
• 1
lianghsun/free_english_news
Viewer
• Updated • 1.6M • 9
lianghsun/spider-text2sql-bench
Viewer
• Updated • 7k • 38
lianghsun/tw-gov-news-90M
Viewer
• Updated • 117k • 11
lianghsun/tw-hokkien-seed-text
Viewer
• Updated • 1.24M • 13
• 4
lianghsun/tw-law-article-evolution
Viewer
• Updated • 1.42M • 9
lianghsun/tw-law-article-num-convention
Viewer
• Updated • 2.61k • 43
lianghsun/tw-law-article-qa-DPO
Viewer
• Updated • 108 • 11
lianghsun/tw-legal-news-24M
Viewer
• Updated • 17.7k • 9
Viewer
• Updated • 171 • 62
• 4
Viewer
• Updated • 649k • 12
• 1
lianghsun/tw-processed-law-article
Viewer
• Updated • 231k • 91
• 3
lianghsun/tw-structured-law-article
lianghsun/tw-textbook-dpo
Preview
• Updated • 11
lianghsun/wikipedia-zh-742M
Viewer
• Updated • 5.92M • 102
• 4
lianghsun/wikipedia-zh-filtered
Viewer
• Updated • 26.3k • 16
lightonai/embeddings-fine-tuning
Viewer
• Updated • 53.7M • 2.85k
• 21
lightonai/embeddings-pre-training
Viewer
• Updated • 1.38B • 3.36k
• 48
lightonai/embeddings-pre-training-curated
Viewer
• Updated • 665M • 6.1k
• 12
lightonai/embeddings-pre-training-test
Viewer
• Updated • 323k • 8
lightonai/embeddings_supervised
Viewer
• Updated • 3.43M • 1.78k
• 10
lightonai/nfcorpus-decontaminated
Viewer
• Updated • 18.3k • 49
lionelchg/dolly_classification
Viewer
• Updated • 2.14k • 19
Viewer
• Updated • 16.4k • 106
• 24
litagin/jvnv_corpus_v1_no_nv
Viewer
• Updated • 1.62k • 649
• 4
Viewer
• Updated • 44.5k • 17
Viewer
• Updated • 509k • 452
• 11
manishiitg/en-embeddings-bge
Viewer
• Updated • 724k • 44
maxidl/FineNews-unfiltered
Viewer
• Updated • 31.4M • 2.33k
• 1
meandyou200175/word_embedding
Viewer
• Updated • 10.4k • 4
• 1
meandyou200175/word_embedding_200k
Viewer
• Updated • 207k • 19
mehuldamani/neurips-grammarly-eval-v1
Viewer
• Updated • 200 • 66
meoconxinhxan/text_books_gemini
Viewer
• Updated • 228k • 3
• 1
microsoft/msr_text_compression
Updated • 164
• 10
Viewer
• Updated • 1.41k • 136
• 8
mlabonne/synthetic_text_to_sql-ShareGPT
Viewer
• Updated • 106k • 14
• 4
Viewer
• Updated • 200k • 5
multi-train/ccnews_title_text_1107
Viewer
• Updated • 200k • 4
multi-train/downloaded_notebooks
multi-train/sentence-compression_1107
Viewer
• Updated • 180k • 10
nahiar/sentiment_30k_data_train_sentimen_id_post-processing
Viewer
• Updated • 30k • 7
nahiar/sentiment_3kdata-inggris
Viewer
• Updated • 2.9k • 9
nahiar/sentiment_clean_20k-60k_ham_only
Viewer
• Updated • 32.5k • 9
nahiar/sentiment_data-20-60k-labelling
Viewer
• Updated • 32.5k • 7
nahiar/sentiment_data-en-3k-labelling
Viewer
• Updated • 3.42k • 8
nahiar/sentiment_data-en-sentiment-3k
Viewer
• Updated • 3.42k • 7
nahiar/sentiment_data-inggris
Viewer
• Updated • 3.42k • 7
nahiar/sentiment_data-testing-300k-labelling
Viewer
• Updated • 300 • 6
nahiar/sentiment_data-testing-sentiment-300
Viewer
• Updated • 300 • 4
nahiar/sentiment_data-train-30k-id
Viewer
• Updated • 30k • 7
nahiar/sentiment_data-train-30k-sentimen-id-en
Viewer
• Updated • 47.5k • 6
nahiar/sentiment_data-train-bahasa-inggris
Viewer
• Updated • 31.2k • 7
nahiar/sentiment_data-train-sentiment-32k-up-id
Viewer
• Updated • 34.9k • 10
nahiar/sentiment_data-train-sentiment-40k-id-en
Viewer
• Updated • 32.5k • 5
nahiar/sentiment_data-train_db_sentimen_full
Viewer
• Updated • 67.4k • 12
nahiar/sentiment_data-train_db_sentimen_full-copy1
Viewer
• Updated • 67.4k • 9
nahiar/sentiment_data_train_id_en_sentiment_30k_post-processing
Viewer
• Updated • 30k • 17
nahiar/sentiment_data_train_sentimen_id_post-processing
Viewer
• Updated • 30k • 12
nahiar/sentiment_inggris_3k_csv
Viewer
• Updated • 3.42k • 10
nahiar/sentiment_tmp_20k-100k_sentimen
Viewer
• Updated • 67.4k • 13
Viewer
• Updated • 200 • 3
Viewer
• Updated • 12.3k • 4
nekofura/tooth_classification
Preview
• Updated • 41
• 1
Viewer
• Updated • 9.72k • 3
nlplabtdtu/Extract-QA-question-answer-with-context
Viewer
• Updated • 7.6k • 2
nlplabtdtu/Extractive-QA-type-2
Viewer
• Updated • 9.22k • 1
Viewer
• Updated • 393k • 8
• 1
nlplabtdtu/OpenOrca-2-fact-vi
Viewer
• Updated • 2.72k • 11
nlplabtdtu/OpenOrca-conclusion-condition-vi
Viewer
• Updated • 1.11k • 6
nlplabtdtu/OpenOrca-describe-vi
Viewer
• Updated • 3.22k • 5
nlplabtdtu/OpenOrca-movieplot-vi
Viewer
• Updated • 25.4k • 5
nlplabtdtu/OpenOrca-predict-people-action-vi
Viewer
• Updated • 2.15k • 7
nlplabtdtu/OpenOrca-processes-QA-vi
Viewer
• Updated • 32.9k • 6
nlplabtdtu/OpenOrca-solution-for-a-goal-vi
Viewer
• Updated • 1.25k • 11
nlplabtdtu/ai_la_trieu_phu
Viewer
• Updated • 13.6k • 1
nlplabtdtu/biosses-sts-vi
Viewer
• Updated • 100 • 3
Viewer
• Updated • 18.7k • 1
• 2
nlplabtdtu/classification_fqa
Viewer
• Updated • 1.95k • 3
nlplabtdtu/classification_fqa_cmc
Viewer
• Updated • 2.3k • 6
nlplabtdtu/classification_fqa_cmc_31
Viewer
• Updated • 341 • 8
Viewer
• Updated • 4.17k • 5
Viewer
• Updated • 6.38k • 5
Viewer
• Updated • 25.7k • 5
Viewer
• Updated • 1.13k • 3
nlplabtdtu/daily_dialog_gan
Viewer
• Updated • 13.1k • 5
nlplabtdtu/daily_dialog_gan_discriminator
Viewer
• Updated • 6.45k • 1
• 1
nlplabtdtu/data-synthetic-part-2
Viewer
• Updated • 467k • 33
• 1
Viewer
• Updated • 27 • 6
nlplabtdtu/diem_chuan_dai_hoc
Viewer
• Updated • 36.2k • 1
Viewer
• Updated • 2.39k • 7
nlplabtdtu/ds-synthetic-version-2
Viewer
• Updated • 416k • 46
nlplabtdtu/edu-crawl-with-date
Viewer
• Updated • 279k • 1
nlplabtdtu/edu_data_with_tag
Viewer
• Updated • 214k • 3
Viewer
• Updated • 251 • 3
nlplabtdtu/first_step_intent_data
Viewer
• Updated • 483 • 9
nlplabtdtu/general-multi-choices-ailatrieuphu-870
Viewer
• Updated • 870 • 3
nlplabtdtu/general-multi-choices-food-100-v2
Viewer
• Updated • 78 • 4
nlplabtdtu/general-multi-choices-geo
Viewer
• Updated • 62 • 2
nlplabtdtu/general-multi-choices-tech
nlplabtdtu/general-people-multichoices-vi
Viewer
• Updated • 100 • 2
Viewer
• Updated • 1.31k • 3
Viewer
• Updated • 4.25k • 3
Viewer
• Updated • 9.72k • 6
nlplabtdtu/legal-citation-choosen-qa
Viewer
• Updated • 775 • 2
nlplabtdtu/legal-multiple-choice
Viewer
• Updated • 1.78k • 3
nlplabtdtu/legal_qa_with_old_docs
Viewer
• Updated • 16.9k • 3
Viewer
• Updated • 19 • 2
Viewer
• Updated • 27.2k • 2
nlplabtdtu/multi-choices-food-100-v2
Viewer
• Updated • 78 • 1
nlplabtdtu/multi-choices-text
Viewer
• Updated • 58.3k • 1
Viewer
• Updated • 56.2k • 42
nlplabtdtu/people-wiki-vi
Viewer
• Updated • 10.1k • 3
Viewer
• Updated • 19.6k • 1
Viewer
• Updated • 20.6k • 5
Viewer
• Updated • 1k • 5
Viewer
• Updated • 48 • 7
Viewer
• Updated • 203 • 5
nlplabtdtu/review_edu_data
Viewer
• Updated • 684 • 7
• 1
Viewer
• Updated • 777k • 5
Viewer
• Updated • 1.12M • 5
Viewer
• Updated • 16.4k • 4
nlplabtdtu/sentiment-analysis-UIT
Viewer
• Updated • 16.4k • 3
nlplabtdtu/sentiment-analysis-se
Viewer
• Updated • 494 • 3
Viewer
• Updated • 9.93k • 3
Viewer
• Updated • 3.11k • 6
Viewer
• Updated • 1.5k • 3
Viewer
• Updated • 3.75k • 3
Viewer
• Updated • 3k • 3
Viewer
• Updated • 1.19k • 3
nlplabtdtu/summarization_sft
Viewer
• Updated • 1.2k • 3
nlplabtdtu/summarization_sft_65K_prompted
Viewer
• Updated • 66.4k • 1
nlplabtdtu/summarization_sft_prompted
Viewer
• Updated • 1.2k • 15
Viewer
• Updated • 66.4k • 4
Viewer
• Updated • 570 • 3
nlplabtdtu/tdtu_info_major
Viewer
• Updated • 44 • 3
Viewer
• Updated • 106 • 5
nlplabtdtu/train-tokenizor-ds-T5
Viewer
• Updated • 1.89M • 3
Viewer
• Updated • 14.3k • 1
nlplabtdtu/tvpl-chinh-sach-moi
Viewer
• Updated • 49.7k • 1
nlplabtdtu/tvpl-chinh-sach-moi-links
Viewer
• Updated • 49.7k • 1
nlplabtdtu/tvpl-qa-detail
Viewer
• Updated • 46.4k • 5
• 1
Viewer
• Updated • 329k • 145
nlplabtdtu/tvpl_split_error
Viewer
• Updated • 2.63k • 9
nlplabtdtu/uni_collection
Viewer
• Updated • 224k • 4
nlplabtdtu/uni_law_review_data
Viewer
• Updated • 10.4k • 3
nlplabtdtu/university-dataset
Viewer
• Updated • 214k • 1
nlplabtdtu/val-tokenizor-ds-T5
Viewer
• Updated • 210k • 2
Viewer
• Updated • 329k • 2
nlplabtdtu/vi-legal-docs-html
Viewer
• Updated • 329k • 1
Viewer
• Updated • 3.69k • 2
nlplabtdtu/wikihow-processes-vi
Viewer
• Updated • 2.44k • 4
Viewer
• Updated • 330k • 5
Preview
• Updated • 20
nlplabtdtu/xquad_benchmark
Viewer
• Updated • 1.19k • 37
nvidia/Nemotron-Terminal-Corpus
Viewer
• Updated • 366k • 4.67k
• 134
Viewer
• Updated • 13.8M • 118
omarkamali/wikipedia-labels
Viewer
• Updated • 108 • 49
• 2
omarkamali/wikipedia-monthly
Viewer
• Updated • 195M • 5.81k
• 72
omarkamali/wikipedia-monthly-next
Viewer
• Updated • 163k • 120
open-llm-leaderboard-old/details_pe-nlp__llama-2-13b-platypus-vicuna-wizard
Updated • 27
openai/BrowseCompLongContext
Viewer
• Updated • 295 • 9.99k
• 53
Viewer
• Updated • 2.91k • 46
• 9
opensporks/hackernews-top
Viewer
• Updated • 503k • 9
opensporks/stocknewseventssentiment-snes-10
Viewer
• Updated • 218k • 5
• 1
Viewer
• Updated • 77.5k • 24
prayslaks/wikimedia_wikipedia_100K
Viewer
• Updated • 100k • 28
prayslaks/wikimedia_wikipedia_1M
Viewer
• Updated • 1M • 44
projectlosangeles/midisim-embeddings
rcds/wikipedia-for-mask-filling
Viewer
• Updated • 828k • 118
rcds/wikipedia-persons-masked
Viewer
• Updated • 68.7k • 25
• 3
rohanbalkondekar/rework-book-qna
Viewer
• Updated • 305 • 3
sam2ai/odia_cc_news_parallel
Viewer
• Updated • 6.15k • 6
Viewer
• Updated • 10k • 58
Viewer
• Updated • 736k • 165
semran1/cci4-extra-books-subset
Viewer
• Updated • 163k • 27
semran1/dclm-ohfwfwbookyulan
Viewer
• Updated • 1.62M • 75
Viewer
• Updated • 5.17M • 4
semran1/fineweb-edu-book-4
Viewer
• Updated • 1.4M • 4
semran1/opc-annealing-corpus-synth-qa
Viewer
• Updated • 5.32M • 20
Preview
• Updated • 5
semran1/textbooks_sample_ffw
Viewer
• Updated • 608k • 73
sert121/adult_dataset_balanced_text_shuffled
Viewer
• Updated • 15.7k • 4
sert121/spambase_dataset_balanced_text
Viewer
• Updated • 3.63k • 7
sert121/spambase_dataset_balanced_text_serialized
Viewer
• Updated • 3.26k • 4
sert121/synthetic_data_textual
Viewer
• Updated • 9.54k • 6
sert121/synthetic_data_textual_leavingT_Q_W_O_V_U_X
Viewer
• Updated • 9.54k • 4
sert121/synthetic_data_textual_leaving_T
Viewer
• Updated • 9.54k • 7
sert121/synthetic_data_textual_leaving_T_Q
Viewer
• Updated • 9.54k • 6
sert121/synthetic_data_textual_leaving_T_Q_W
Viewer
• Updated • 9.54k • 4
sert121/synthetic_data_textual_leaving_T_Q_W_L
Viewer
• Updated • 9.54k • 6
sert121/synthetic_data_textual_leaving_T_Q_W_L_N2
Viewer
• Updated • 9.54k • 4
sert121/synthetic_data_textual_leaving_T_Q_W_L_N2_O
Viewer
• Updated • 9.54k • 7
sert121/synthetic_data_textual_leaving_T_Q_W_L_N2_O_V
Viewer
• Updated • 9.54k • 5
sert121/synthetic_data_textual_leaving_T_Q_W_L_N2_O_V_U
Viewer
• Updated • 9.54k • 5
sert121/synthetic_data_textual_leaving_T_Q_W_L_N2_O_V_U_X
Viewer
• Updated • 9.54k • 5
sert121/synthetic_data_textual_leaving_T_Q_W_L_N2_O_V_U_X_A
Viewer
• Updated • 9.54k • 4
sert121/synthetic_data_textual_leaving_T_Q_W_L_N2_O_V_U_X_A_Z
Viewer
• Updated • 9.54k • 6
sert121/synthetic_data_textual_leaving_T_Q_W_L_N2_O_V_U_X_A_Z_R
Viewer
• Updated • 9.54k • 4
sert121/synthetic_data_textual_leaving_T_Q_W_L_N2_O_V_U_X_A_Z_R_B
Viewer
• Updated • 9.54k • 4
sert121/synthetic_data_textual_leaving_T_Q_W_L_N2_O_V_U_X_A_Z_R_B_S
Viewer
• Updated • 9.54k • 4
sert121/synthetic_data_textual_leaving_T_Q_W_L_N2_O_V_U_X_A_Z_R_B_S_M
Viewer
• Updated • 9.54k • 4
sert121/synthetic_data_textual_leaving_T_Q_W_L_N2_O_V_U_X_A_Z_R_B_S_M_P
Viewer
• Updated • 9.54k • 5
sert121/synthetic_data_textual_leaving_T_Q_W_O_V_U_X
Viewer
• Updated • 9.54k • 6
shawhin/ai-job-embedding-finetuning
Viewer
• Updated • 1.01k • 38
• 4
shawhin/phishing-site-classification
Viewer
• Updated • 3k • 131
• 8
skbose/indian-english-nptel-v0-tags-gender-accent-text
Viewer
• Updated • 544k • 11
skbose/indian-english-nptel-v0-tags-gender-text
Viewer
• Updated • 544k • 14
skbose/indian-english-nptel-v0-tags-text
Viewer
• Updated • 544k • 8
swagat-panda/POS_language_detect_tagged
thangvip/cls-processes-embedding-0909
Viewer
• Updated • 71.9k • 4
thangvip/cls-processes-embedding-2608
Viewer
• Updated • 12.4k • 4
thangvip/cls-processes-embedding-400k
Viewer
• Updated • 446k • 3
thangvip/cls-processes-embedding-678
Viewer
• Updated • 119k • 4
thangvip/cls-processes-embedding-t7
Viewer
• Updated • 122k • 4
thangvip/cls-processes-embedding-t8
Viewer
• Updated • 69.2k • 3
thangvip/combined-vietnamese-legal-text
Viewer
• Updated • 215k • 20
• 1
Viewer
• Updated • 329k • 10
thangvip/legal-documents-splits-filtered
Viewer
• Updated • 207k • 6
thangvip/legal-documents-splitted
Viewer
• Updated • 2.93M • 30
thangvip/legaldocuments-nli-test
Viewer
• Updated • 1.92k • 3
thangvip/legaldocuments-nli-test-v2
Viewer
• Updated • 1.42k • 3
thangvip/legaldocuments-nli-test-v3
Viewer
• Updated • 1.42k • 15
tum-nlp/German4All-Corpus
Preview
• Updated • 110
• 2
Updated • 38
• 9
Viewer
• Updated • 77.4k • 29
tum-nlp/neural-news-benchmark
Viewer
• Updated • 27.2k • 502
• 4
tum-nlp/sexism-socialmedia-balanced
Viewer
• Updated • 20.1k • 27
• 2
tum-nlp/span-similarity-dataset
Viewer
• Updated • 1k • 50
Viewer
• Updated • 144k • 13
valurank/News_Articles_Categorization
Viewer
• Updated • 3.72k • 109
• 5
Viewer
• Updated • 13.4k • 7
• 1
valurank/Topic_Classification
Viewer
• Updated • 22.5k • 72
• 4
Viewer
• Updated • 81 • 24
voidful/earica_text_train
Viewer
• Updated • 497k • 2
waifu-research-department/embeddings
Updated • 26
• 3
Viewer
• Updated • 367 • 59
• 1
wandb/ragbench-sentence-relevance-balanced
Viewer
• Updated • 624k • 47
• 1
wandb/weave_cookbook_datasets
wikimedia/structured-wikipedia
Viewer
• Updated • 10.5M • 16.1k
• 387
Viewer
• Updated • 61.6M • 179k
• 1.26k
wow2000/japanese_fake_news
Viewer
• Updated • 6.85k • 6
xinzhang/wikipedia_summary
Preview
• Updated • 10
• 1
ywan111/macbook-dataset-b1
ywan111/macbook-dataset-b2
ywan111/macbook-dataset-b3
ywan111/macbook-dataset-b4
ywan111/macbook-dataset-b5