Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Michael Fromm's picture
13 5 25

Michael Fromm

mfromm
Voller's profile picture Bjornsund's profile picture 21world's profile picture
·
https://fromm-m.github.io/fromm/
  • effi288
  • fromm-m
  • michael-fromm-a2069772

AI & ML interests

NLP, LLM, ConvAI

Recent Activity

published a dataset 4 days ago
openGPT-X/leaderboard_data
upvoted a collection 5 days ago
Nemotron-Pre-Training-Datasets
updated a dataset 12 days ago
Eurolingua/hplt3_edu_scores
View all activity

Organizations

Fraunhofer Institute for Intelligent Analysis and Information Systems's profile picture OpenGPT-X's profile picture Lamarr's profile picture Modalities's profile picture EuroLingua-GPT's profile picture EuropeanLLM-Beta's profile picture EuropeanLLM-Eval's profile picture Lamarr LLM Development's profile picture TrustLLM EU's profile picture OpenGPT-X's profile picture JupiterAI's profile picture JQL-AI's profile picture JackalFactory's profile picture Jackal-Factory's profile picture Jackal-AI's profile picture Stealth Nomis 's profile picture

upvoted a collection 5 days ago

Nemotron-Pre-Training-Datasets

Collection
Large scale pre-training datasets used in the Nemotron family of models. • 11 items • Updated 6 days ago • 94
upvoted a paper 3 months ago

Tokenizer Choice For LLM Training: Negligible or Crucial?

Paper • 2310.08754 • Published Oct 12, 2023 • 3
upvoted an article 7 months ago
view article
Article

SmolLM3: smol, multilingual, long-context reasoner

  • +21
Jul 8, 2025
•
751
upvoted a paper 8 months ago

Judging Quality Across Languages: A Multilingual Approach to Pretraining Data Filtering with Language Models

Paper • 2505.22232 • Published May 28, 2025 • 18
upvoted a collection over 1 year ago

EU20-Benchmarks

Collection
Evaluation Benchmarks for 20 European languages. • 5 items • Updated Oct 11, 2024 • 9
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs