Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
tzervas
/
bwsk-switch-base-8
like
0
Summarization
Transformers
wikitext
bwsk
combinator-analysis
Mixture of Experts
reversible-backprop
convergence-training
Eval Results (legacy)
License:
mit
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
bwsk-switch-base-8
2.17 MB
1 contributor
History:
4 commits
tzervas
Add BWSK model card
791a08b
verified
11 days ago
.gitattributes
Safe
1.52 kB
initial commit
11 days ago
README.md
6.3 kB
Add BWSK model card
11 days ago
results.json
2.16 MB
Add aggregated training results
11 days ago