perf: remove CPU-GPU sync bottleneck in SharedMoE routing loop a6e6ea4 verified anthonym21 commited on Feb 22
Restore full README with training history, swarm table, and specialist status cf6f81c verified anthonym21 commited on Feb 17
Add instruction-tuned weights (3 epochs on alpaca-cleaned) d46c8c6 verified anthonym21 commited on Feb 17
Fix: move super().__init__() before attribute assignments to prevent PretrainedConfig clobbering MoE top_k 09b1451 verified anthonym21 commited on Feb 15
Eve-2-MoE-IT-272M: heavy IT patch (open-perfectblend, LoRA r=128, merged) 5e78b3d verified anthonym21 commited on Feb 7