Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

J10Official
/
sparse-moe-Token-Choice-GQA

Model card Files Files and versions
xet
Community
sparse-moe-Token-Choice-GQA
783 MB
  • 1 contributor
History: 7 commits
J10Official's picture
J10Official
Update README.md
da5983e verified 2 months ago
  • .gitattributes
    1.52 kB
    initial commit 2 months ago
  • README.md
    168 Bytes
    Update README.md 2 months ago
  • Token-Choice-GQA_retrain_results.json
    657 Bytes
    Upload Token-Choice-GQA_retrain_results.json with huggingface_hub 2 months ago
  • Token-Choice-GQA_retrained.pt
    391 MB
    xet
    Upload Token-Choice-GQA_retrained.pt with huggingface_hub 2 months ago
  • Token-Choice-GQA_train_results.json
    655 Bytes
    Upload Token-Choice-GQA_train_results.json with huggingface_hub 2 months ago
  • Token-Choice-GQA_trained.pt
    391 MB
    xet
    Upload Token-Choice-GQA_trained.pt with huggingface_hub 2 months ago