Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters Paper • 2602.10604 • Published Feb 11 • 193
Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters Paper • 2602.10604 • Published Feb 11 • 193
Running on CPU Upgrade Featured 3.07k The Smol Training Playbook 📚 3.07k The secrets to building world-class LLMs
Farseer-Scaling-Law/step2v2_0726_2049_sc_h1920_ffnh5184_numh30_numl30_lr2.11e-03_bs1280_ti164237 Updated Jun 19, 2025
Farseer-Scaling-Law/step2v2_0726_2049_sc_h1920_ffnh5184_numh30_numl30_lr2.11e-03_bs1280_ti164237 Updated Jun 19, 2025
Farseer-Scaling-Law/step2v2_0726_2049_sc_h3328_ffnh9136_numh52_numl47_lr4.29e-04_bs512_ti86316 Updated Jun 19, 2025
Farseer-Scaling-Law/step2v2_0726_2049_sc_h3328_ffnh9136_numh52_numl47_lr4.29e-04_bs512_ti86316 Updated Jun 19, 2025
Farseer-Scaling-Law/step2v2_0726_2049_sc_h2176_ffnh5712_numh34_numl34_lr1.48e-03_bs1024_ti145166 Updated Jun 19, 2025