@sharpenb on Hugging Face: "We compressed SmolLMs to make 135 variations of them (see…"

Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Back to feed

sharpenb

posted an update Jan 22, 2025

Post

1958

We compressed SmolLMs to make 135 variations of them (see

PrunaAI ) with different quantization configurations with pruna (https://docs.pruna.ai/en/latest/).

We made a blog to summarize our findings (see https://www.pruna.ai/blog/smollm2-smaller-faster) and small LM can be made smaller! :)

AtAndDev

Jan 23, 2025

That non centered emoji...
But cool blog

sugatoray

Jan 24, 2025

•

edited Jan 24, 2025

I tried using prune from google colab from one of their colab friendly notebooks. It does not support any other OS platform other than Linux for now. The attempt from colab to generate a token failed multiple times (trying with token=None) while using the smash() function.

It would certainly help to have a web UI where the users can generate the tokens and revoke them if need be.

cc: @prunaaitools

sharpenb

Jan 24, 2025

•

edited Jan 24, 2025

Thanks for the notification! Indeed we support only Linux for now. We would be happy to work on the token problem. If you could share your colab config and the trace here or on our discord, it would help us :)

In this post