Sara Han Díaz
sdiazlor
AI & ML interests
Data curation and generation, RLHF, RAG, Prompt Engineering
Recent Activity
posted an
update
3 days ago
More OSS than ever with the latest pruna 0.3.2 release. It extends existing algorithm families, such as compilers, kernels, and pruners, and adds new ones, including decoders, distillers, enhancers, and recoverers. But it's not only a collection of algorithms; instead, you can easily combine them to get the biggest efficiency win.
Read the full blog here: https://huggingface.co/blog/PrunaAI/pruna-0-3-2-open-source-optimization-algorithms upvoted an article 3 days ago
KV Caching Explained: Optimizing Transformer Inference Efficiency published an
article
3 days ago
Pruna 0.3.2: More OSS Algos, More Ways to Optimize