Efficient Large Language Models Collection Code is available at: https://github.com/c2d-usp/Efficient-LLMs-with-AMP/tree/main • 24 items • Updated 30 days ago
c2d-usp/AMP_llama_3.3_70B_quant_40_percent_compression_without_finetuning 44B • Updated 30 days ago • 298
c2d-usp/AMP_llama_3.3_70B_quant_40_percent_compression_without_finetuning 44B • Updated 30 days ago • 298
Compressing LLMs with MoP: Mixture of Pruners Collection Code is available at: https://github.com/c2d-usp/Efficient-LLMs-with-MoP • 11 items • Updated Feb 12