Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
deqing 's Collections
Fourier Language Model
Convergent Evolution
Convergent Evolution (Addition)
Convergent Evolution (Architecture and Optimizer)
Convergent Evolution (Data)

Convergent Evolution (Architecture and Optimizer)

updated 30 days ago
Upvote
-

  • deqing/convergent-llama-300M-muon-original

    Text Generation • 0.3B • Updated Mar 29 • 95

  • deqing/convergent-gdn-300M-muon-original

    Text Generation • 0.3B • Updated Mar 29 • 32

  • deqing/convergent-mamba2-300M-muon-original

    Text Generation • 0.3B • Updated Mar 29 • 30

  • deqing/convergent-lstm-4layer-muon-original

    Text Generation • 0.2B • Updated Mar 29 • 32

  • deqing/convergent-lstm-12layer-muon-original

    Text Generation • 0.2B • Updated Mar 29 • 31

  • deqing/convergent-llama-300M-adamw-original

    Text Generation • 0.3B • Updated Mar 29 • 80

  • deqing/convergent-gdn-300M-adamw-original

    Text Generation • 0.3B • Updated Mar 29 • 33

  • deqing/convergent-mamba2-300M-adamw-original

    Text Generation • 0.3B • Updated Mar 29 • 72
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs