Resa-Yi
/

DeepScaleR-Trained-from-Scratch-SAE-R1-Distill-Qwen-1.5B-65k

Model card Files Files and versions

DeepScaleR-Trained-from-Scratch-SAE-R1-Distill-Qwen-1.5B-65k

1.61 GB

2 contributors

History: 3 commits

farukakgul

add trained-from-scratch saes around ckpt 1000

d4b4907 9 months ago