Post
1940
We compressed SmolLMs to make 135 variations of them (see
PrunaAI
) with different quantization configurations with
We made a blog to summarize our findings (see https://www.pruna.ai/blog/smollm2-smaller-faster) and small LM can be made smaller! :)
pruna (https://docs.pruna.ai/en/latest/). We made a blog to summarize our findings (see https://www.pruna.ai/blog/smollm2-smaller-faster) and small LM can be made smaller! :)