Automatic Metadata Generation and Extraction datasets Collection Datasets which can help train or evaluate various approaches to automatic metadata generation and extraction. • 4 items • Updated Oct 16, 2025 • 4
Historic Newsaper Datasets Collection Historic Newspaper Datasets on the Hub • 16 items • Updated May 8, 2025 • 6
ARK Annif Models Collection Contains 5 Annif models for the languages German, Latin, English, French and multilingual. • 5 items • Updated Oct 20, 2025 • 2
Format: CSV and TSV Collection 6 datasets showcase how to configure and load CSV and TSV files. • 6 items • Updated Nov 23, 2023 • 9
File names and splits Collection 8 datasets showcase the diversity of splits configuration on HuggingFace. See docs: https://huggingface.co/docs/hub/datasets-file-names-and-splits. • 8 items • Updated Nov 22, 2023 • 10
Manual Configuration Collection 5 datasets showcase YAML configuration on HuggingFace. See docs: https://huggingface.co/docs/hub/datasets-manual-configuration. • 5 items • Updated Nov 23, 2023 • 7
Zeroshot Classifiers Collection These are my current best zeroshot classifiers. Some of my older models are downloaded more often, but the models in this collection are newer/better. • 12 items • Updated Jan 6, 2025 • 149