Deeper is Not Always Better: Mitigating the Alignment Tax via Confident Layer Decoding Paper โข 2606.21906 โข Published 6 days ago โข 20 โข 13
Iwaku-Real/Qwen3-0.6B-Base-heretic-test Text Generation โข 0.6B โข Updated 6 days ago โข 164 โข 1
view post Post 10925 THIS IS CRAZY! THE MODEL ON THE IMAGE(Supra-50M-Reasoning) answered correctly and its QUANTIZED IN 2BIT! THE RESPONSE IS CORRECT, IN A 15MB SIZE FILE! See translation 14 replies ยท ๐ฅ 31 31 ๐ 10 10 ๐ง 4 4 ๐ 2 2 + Reply
Iwaku-Real/Qwen3-0.6B-Base-heretic-test Text Generation โข 0.6B โข Updated 6 days ago โข 164 โข 1