Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Jadon's picture
5 7

Jadon

jadodev
  • phase

AI & ML interests

Machine Learning, Programming Language Theory, Category Theory, Quantum Computing

Recent Activity

upvoted a paper about 1 month ago
Nemotron-CC-Math: A 133 Billion-Token-Scale High Quality Math Pretraining Dataset
liked a model 9 months ago
deepseek-ai/DeepSeek-V3-0324
updated a collection over 1 year ago
transformer
View all activity

Organizations

None yet

upvoted a paper about 1 month ago

Nemotron-CC-Math: A 133 Billion-Token-Scale High Quality Math Pretraining Dataset

Paper • 2508.15096 • Published Aug 20 • 4
upvoted a paper over 1 year ago

Mixture-of-Depths: Dynamically allocating compute in transformer-based language models

Paper • 2404.02258 • Published Apr 2, 2024 • 107
upvoted 2 papers almost 2 years ago

Teaching Large Language Models to Reason with Reinforcement Learning

Paper • 2403.04642 • Published Mar 7, 2024 • 50

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Paper • 2403.03507 • Published Mar 6, 2024 • 189
upvoted a collection almost 2 years ago

Frankenmodels

Collection
They're not supposed to be that size! Neat, right? • 8 items • Updated Dec 12, 2023 • 3
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs