Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

unconst
/
souped_slerp_a20

Safetensors
deepseek_v3
Model card Files Files and versions
xet
Community
souped_slerp_a20
Ctrl+K
Ctrl+K
  • 1 contributor
History: 3 commits
unconst's picture
unconst
iter73: add num_local_experts/num_shared_experts aliases (validator MoE accounting)
e905e14 verified 10 days ago
  • .gitattributes
    1.57 kB
    iter68 souped_slerp_a20 SLERP alpha=0.20 (hope_king<->new_king) merged via merge_slerp.py; probe top1=0.5677 10 days ago
  • chat_template.jinja
    4.02 kB
    iter68 souped_slerp_a20 SLERP alpha=0.20 (hope_king<->new_king) merged via merge_slerp.py; probe top1=0.5677 10 days ago
  • config.json
    1.37 kB
    iter73: add num_local_experts/num_shared_experts aliases (validator MoE accounting) 10 days ago
  • generation_config.json
    146 Bytes
    iter68 souped_slerp_a20 SLERP alpha=0.20 (hope_king<->new_king) merged via merge_slerp.py; probe top1=0.5677 10 days ago
  • model.safetensors
    31.9 GB
    xet
    iter68 souped_slerp_a20 SLERP alpha=0.20 (hope_king<->new_king) merged via merge_slerp.py; probe top1=0.5677 10 days ago
  • tokenizer.json
    19.5 MB
    xet
    iter68 souped_slerp_a20 SLERP alpha=0.20 (hope_king<->new_king) merged via merge_slerp.py; probe top1=0.5677 10 days ago
  • tokenizer_config.json
    645 Bytes
    iter68 souped_slerp_a20 SLERP alpha=0.20 (hope_king<->new_king) merged via merge_slerp.py; probe top1=0.5677 10 days ago