Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
German Tokenizer Benchmark
community
Activity Feed
Follow
2
AI & ML interests
German, Tokenizer, Benchmark
Recent Activity
stefan-it
submitted
a paper
about 21 hours ago
Repetition over Diversity: High-Signal Data Filtering for Sample-Efficient German Language Modeling
stefan-it
submitted
a paper
3 months ago
FineInstructions: Scaling Synthetic Instructions to Pre-Training Scale
stefan-it
updated
a Space
6 months ago
german-tokenizer-benchmark/README
View all activity
Team members
1
german-tokenizer-benchmark
's models
None public yet