Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Finnish-NLP 's Collections
TTS-models
Instruction datasets
Nordic datasets with Fineweb-edu predictions
Continued pretrain research datasets
Ahma models
Finnish Wav2vec2-xlsr speech recognition
Finnish Whisper speech recognition
Finnish pretrain datasets
Finnish SFT/DPO dataset
Finnish-Fineweb-edu
Finnish LLama models
Instruction tuned models

Continued pretrain research datasets

updated Oct 29, 2025
Upvote
-

  • Finnish-NLP/Culturax_Finnish_fineweb_edu_predicted

    Viewer • Updated Nov 5, 2024 • 28.8M • 1.29k

  • Finnish-NLP/Reddit_Finnish_fineweb_edu_predicted

    Viewer • Updated Jan 9, 2025 • 3.96M • 1.12k

  • Finnish-NLP/HPLT_Finnish_fineweb_edu_predicted

    Viewer • Updated Jan 9, 2025 • 5.11M • 48

  • Finnish-NLP/Wikipedia_20231101_Finnish_cleaned_fineweb_edu_predicted

    Viewer • Updated Jan 10, 2025 • 410k • 156

  • Finnish-NLP/Fineweb2_Finnish_fineweb_edu_predicted

    Viewer • Updated Jun 5, 2025 • 33.2M • 33

  • Finnish-NLP/finepdf_fi_edu_score_topic_classified

    Viewer • Updated Sep 14, 2025 • 1.98M • 37
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs