Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Keaton Elvins's picture
1 7 25

Keaton Elvins PRO

keatone
YOYO-AI's profile picture lucazsh's profile picture
·

AI & ML interests

None yet

Organizations

camfer's profile picture Hugging Face MCP Course's profile picture Scratch to Scale's profile picture

upvoted a collection 3 months ago

Common Pile v0.1 Filtered Data

Collection
An LLM pre-training dataset produced by filtering and deduplicating the raw text collected in the Common Pile v0.1 • 31 items • Updated Jun 6, 2025 • 21
upvoted a collection 4 months ago

SmolLM3 pretraining datasets

Collection
datasets used in SmolLM3 pretraining • 15 items • Updated Aug 12, 2025 • 44
upvoted an article 5 months ago
view article
Article

Gotchas in Tokenizer Behavior Every Developer Should Know

Apr 18, 2025
•
69
upvoted a paper 5 months ago

FLARE: Fast Low-rank Attention Routing Engine

Paper • 2508.12594 • Published Aug 18, 2025 • 7
upvoted an article 6 months ago
view article
Article

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

  • +3
Jul 29, 2025
•
211
upvoted a collection 8 months ago

Deepseek Papers

Collection
Deepseek papers collection • 28 items • Updated about 12 hours ago • 317
upvoted a collection 11 months ago

olmOCR

Collection
olmOCR is a document recognition pipeline for efficiently converting documents into plain text. olmocr.allenai.org • 12 items • Updated Dec 23, 2025 • 145
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs