Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Fsoft-AIC
/
backup_checkpoints_libmoe

Image-Text-to-Text
TensorBoard
Safetensors
English
Model card Files Files and versions
xet
Metrics Training metrics Community
6187
backup_checkpoints_libmoe / Pretrain_language_model /save_final
566 GB
  • 1 contributor
History: 21 commits
DavidNguyen's picture
DavidNguyen
b081038d5aa4ee61ce6c3909b628ce200e914ec9497f7bc96f9d084c45a3188d
930104f verified 6 months ago
  • slimpajama_moe_no_attmoe_154M_deepseek_highlb_shared_only_v2
    Upload folder using huggingface_hub (#276) 6 months ago
  • slimpajama_moe_no_attmoe_154M_deepseek_sigmoidonly_v2
    Upload folder using huggingface_hub (#265) 6 months ago
  • slimpajama_moe_no_attmoe_154M_sigmoid_standard_lb_v2
    Upload folder using huggingface_hub (#268) 6 months ago
  • slimpajama_moe_no_attmoe_154M_standard_lb_v2
    Upload folder using huggingface_hub (#274) 6 months ago
  • slimpajama_moe_no_attmoe_660M_sigmoid_standardlb
    b081038d5aa4ee61ce6c3909b628ce200e914ec9497f7bc96f9d084c45a3188d 6 months ago
  • slimpajama_moe_no_attmoe_660M_sigmoid_standardlb_v2
    Upload folder using huggingface_hub (#279) 6 months ago
  • slimpajama_moe_no_attmoe_660M_standardlb_deepseek_shared_only
    Upload folder using huggingface_hub (#263) 6 months ago
  • slimpajama_moe_no_attmoe_660M_standardlb_deepseek_sigmoidonly_v2
    Upload folder using huggingface_hub (#271) 6 months ago