Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

Sssplendid
/
opentome_optimizer_comparisons

Safetensors
Model card Files Files and versions
xet
Community
opentome_optimizer_comparisons
Ctrl+K
Ctrl+K
  • 1 contributor
History: 79 commits
Sssplendid's picture
Sssplendid
Add 1b_archs_fwe/transformer_1b_fwe_soap_pdim2048_pfreq10_lr3e_3_b1_0_9_b2_0_95_eps_1e_15_20260515_195110
b12805a verified 1 day ago
  • 1b_archs_fwe
    Add 1b_archs_fwe/transformer_1b_fwe_soap_pdim2048_pfreq10_lr3e_3_b1_0_9_b2_0_95_eps_1e_15_20260515_195110 1 day ago
  • 340m_archs_lrtuning
    Add 340m_archs_lrtuning/deltanet_340m_fwe_marsadamw_lr3e_3_b1_0_95_b2_0_99_eps_1e_8_20260504_063226 3 days ago
  • gated_deltanet_1b_v3
    Add gated_deltanet_1b_v3/gated_deltanet_1b_adamw_lr1e_3_b1_0_9_b2_0_99_eps_1e_8_20260503_013622 2 days ago
  • gated_deltanet_340m_v3
    Add gated_deltanet_340m_v3/gated_deltanet_340m_mars_adamw_lr5e_3_b1_0_95_b2_0_99_eps_1e_15_20260503_225007 2 days ago
  • transformer_pp_1b_c4
    Add transformer_pp_1b_c4/transformer_pp_1b_c4_valc4_soap_pdim2048_pfreq10_lr3e_3_b1_0_9_b2_0_95_eps_1e_15_20260508_191338 2 days ago
  • transformer_pp_340m_c4
    Add transformer_pp_340m_c4/transformer_pp_340m_c4_valc4_lion_lr1e_4_b1_0_9_b2_0_99_eps_1e_15_20260427_054600 2 days ago
  • .gitattributes
    130 kB
    Add 1b_archs_fwe/transformer_1b_fwe_soap_pdim2048_pfreq10_lr3e_3_b1_0_9_b2_0_95_eps_1e_15_20260515_195110 1 day ago