Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
ML Chair Internal Org
university
Activity Feed
Follow
8
AI & ML interests
None defined yet.
Recent Activity
PatrickHaller
authored
a paper
2 days ago
Repetition over Diversity: High-Signal Data Filtering for Sample-Efficient German Language Modeling
aynetdia
authored
a paper
3 days ago
Pre-Training Curriculum for Multi-Token Prediction in Language Models
aynetdia
authored
a paper
4 days ago
Repetition over Diversity: High-Signal Data Filtering for Sample-Efficient German Language Modeling
View all activity
Team members
8
HU-Berlin-ML-Internal
's datasets
2
Sort: Recently updated
HU-Berlin-ML-Internal/fineweb-2-edu-50k-annotated-llama-test
Viewer
•
Updated
Jul 7, 2025
•
5k
•
36
HU-Berlin-ML-Internal/toxicity-dataset
Viewer
•
Updated
Jun 11, 2024
•
9.58k
•
13