Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Jadon's picture
5 13

Jadon

jadodev
  • phase

AI & ML interests

Machine Learning, Programming Language Theory, Category Theory, Quantum Computing

Recent Activity

liked a model about 5 hours ago
ByteDance/Ouro-1.4B
liked a Space about 6 hours ago
HuggingFaceTB/smol-training-playbook
liked a model 1 day ago
HuggingFaceTB/FineMath-Llama-3B
View all activity

Organizations

None yet

Collections 1

transformer
  • GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

    Paper • 2403.03507 • Published Mar 6, 2024 • 189
  • Mixture-of-Depths: Dynamically allocating compute in transformer-based language models

    Paper • 2404.02258 • Published Apr 2, 2024 • 107
transformer
  • GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

    Paper • 2403.03507 • Published Mar 6, 2024 • 189
  • Mixture-of-Depths: Dynamically allocating compute in transformer-based language models

    Paper • 2404.02258 • Published Apr 2, 2024 • 107

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs