somosnlp (SomosNLP)

posted an update about 2 months ago

Post

448

Open agents on AWS SageMaker AI with open models from the Hugging Face Hub!

> Deploy an open model from the Hugging Face Hub on SageMaker AI
> Connect the deployed model to Strands Agents
> Add built-in and custom tools for tool calling
> Expose external capabilities through MCP integration
> Bonus: talk to your agent and visualize traces with Gradio

https://alvarobartt.com/agents-on-aws-sagemaker

alvarobartt

posted an update about 2 months ago

Post

3347

Latest hf-mem release added a breakdown of Mixture-of-Experts (MoE) memory usage!

TL; DR MoEs can be misleading to reason about from active parameters alone, since each token only activates a subset of experts, while the serving setup still needs to account for the full resident memory footprint.

🧠 hf-mem now splits MoE memory into base model weights, routed experts, and KV cache
🏗️ Dense models usually load and use most weights every forward pass, while MoEs load many experts but only route each token to a few of them
⚡ Active params isn't the same as memory footprint, especially for sparse architectures
📦 Runtime memory is about what is used per request/token, while loading memory also includes the expert weights that need to be resident
📚 KV cache can still dominate depending on context length, batch size, and concurrency
🔀 Expert Parallelism (EP) helps shard experts across accelerators when expert weights dominate
🚀 Data Parallelism (DP) + EP is often a good fit for throughput-oriented MoE serving

Check the repository at https://github.com/alvarobartt/hf-mem

alvarobartt

posted an update 4 months ago

Post

3755

Learn how to deploy Microsoft Research VibeVoice ASR on Microsoft Azure Foundry with Hugging Face to generate rich audio transcriptions with Who, When, and What! 💥

> 🕒 60-minute single-pass processing, no chunking or stitching
> 👤 Customized hotwords to guide recognition on domain-specific content
> 📝 Rich transcription: joint ASR + diarization + timestamping in one pass
> 🌍 50+ languages with automatic detection and code-switching support
> 🤗 Deployed on Microsoft Foundry via an OpenAI-compatible Chat Completions API

https://huggingface.co/docs/microsoft-azure/foundry/examples/deploy-vibevoice-asr

alvarobartt

posted an update 5 months ago

Post

3297

💥 hf-mem v0.4.1 now also estimates KV cache memory requirements for any context length and batch size with the --experimental flag!

uvx hf-mem --model-id ... --experimental will automatically pull the required information from the Hugging Face Hub to include the KV cache estimation, when applicable.

💡 Alternatively, you can also set the --max-model-len, --batch-size and --kv-cache-dtype arguments (à la vLLM) manually if preferred.

1 reply

·

reddrex

updated a dataset 6 months ago

somosnlp/LingComp_QA

Viewer • Updated Jan 15 • 1k • 58 • 1

davidquicast

posted an update 7 months ago

Post

4306

Check out your 2025 Hugging Face Wrapped, a small experimental recap
hf-wrapped/2025

3 replies

·

mariagrandury

updated a dataset 10 months ago

somosnlp/recursos-pln-es

Viewer • Updated Sep 18, 2025 • 183 • 82 • 1

mariagrandury

published a dataset 10 months ago

somosnlp/recursos-pln-es

Viewer • Updated Sep 18, 2025 • 183 • 82 • 1

mariagrandury

updated a dataset 10 months ago

somosnlp/recursos-pln-es-models

Viewer • Updated Sep 16, 2025 • 22 • 9

mariagrandury

published a dataset 10 months ago

somosnlp/recursos-pln-es-models

Viewer • Updated Sep 16, 2025 • 22 • 9

davidquicast

posted an update 11 months ago

Post

2899

Just applied for HF Community Grant for “Hugging Research” — a lightweight CodeAgent‑based research assistant built on Hugging Face’s Open Deep Research project for the Hugging Face Hub (models, datasets, Spaces, users, collections, papers). It gathers links via dedicated tools and organizes them for easy review.

As this is for the community, comments and suggestions are appreciated: https://huggingface.co/spaces/daqc/hugging-research/discussions/1#68a94d9bcb035c54bc671119

mariagrandury

updated a Space 11 months ago

Leaderboard Retos Hackathon SomosNLP 2025

🏆

1

Leaderboard Retos Hackathon SomosNLP 2025

frascuchon

posted an update about 1 year ago

Post

2854

Extended Dataset with Sheets 🚀

I used Sheets to extend the fka/awesome-chatgpt-prompts dataset with a single prompt 💡. Check out the result: https://huggingface.co/datasets/frascuchon/extended_fka_awesome_chatgpt_prompts

Try Sheets to expand your datasets: aisheets/sheets 🛠️

frascuchon

posted an update about 1 year ago

Post

839

🚀 Are you ready to take control of your data? 📊 Follow the step-by-step guide to setup and run Sheets locally on your own machine 🖥️!

💻 Click the link to get started and become a Sheets master 🎯!
👉 https://huggingface.co/blog/frascuchon/running-sheets-locally

Try Sheet 👉

aisheets

mariagrandury

published a dataset about 1 year ago

somosnlp/babylm-es

Updated Jun 19, 2025 • 3

frascuchon

posted an update about 1 year ago

Post

2889

Extending datasets just got a whole lot easier! 🚀 With Sheets, I was able to create a Spanish version of the popular fka/awesome-chatgpt-prompts dataset in just a few minutes ⏱️.

Check out the resulting dataset: frascuchon/fka_awesome_chatgpt_es 📊

Want to try it out for yourself? Head over to the Sheets space and see how easy it is to extend and modify existing datasets 🤯. The possibilities are endless! 🌐

frascuchon

posted an update about 1 year ago

Post

1352

Unlock the full potential of your datasets with SHEETS! It's incredibly easy to extend existing datasets and unlock new insights.

Leverage open-source models to translate, summarize, classify, and more - all directly within your existing columns.

Ready to give it a try? Explore the possibilities here: aisheets/sheets

2 replies

·

frascuchon

posted an update about 1 year ago

Post

3020

Hey! I built RAG MCP Server Space, a simple Gradio MCP server for RAG systems that allows you to search relevant results without passing huge contexts to your LLM.

You can use this space to integrate with your agents and improve the efficiency of your search results. Feel free to try it out and let me know if you have any feedback or questions!

frascuchon/rag-mcp-server

Thanks for checking it out!