Anon
Sparsity-Moves-Computation
AI & ML interests
Checkpoints for the paper: Sparsity Moves Computation: How FFN Architecture Reshapes Attention in Small Transformers
Recent Activity
updated a model 21 days ago
Sparsity-Moves-Computation/moe-redistribution-checkpoints published a model 21 days ago
Sparsity-Moves-Computation/moe-redistribution-checkpointsOrganizations
None yet