Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
theapemachine
/
sparse-transformer-experiments
like
0
Model card
Files
Files and versions
xet
Community
main
sparse-transformer-experiments
/
paper
20.6 kB
Ctrl+K
Ctrl+K
1 contributor
History:
2 commits
theapemachine
Major revision: add phantom momentum ablation, compute-matched baselines, multi-seed predictor accuracy
96bc237
verified
7 days ago
main.tex
Safe
20.6 kB
Major revision: add phantom momentum ablation, compute-matched baselines, multi-seed predictor accuracy
7 days ago