YaPO: Learnable Sparse Activation Steering Vectors for Domain Adaptation Paper • 2601.08441 • Published Jan 13 • 8
YaPO: Learnable Sparse Activation Steering Vectors for Domain Adaptation Paper • 2601.08441 • Published Jan 13 • 8
Wasm: A Pipeline for Constructing Structured Arabic Interleaved Multimodal Corpora Paper • 2511.07080 • Published Nov 10, 2025 • 33
view post Post 1346 The #1 trending AI/ML dataset today 🏆Massive scale, diversity and end-to-end potential from nvidia ! nvidia/PhysicalAI-Autonomous-Vehicles See translation 🔥 1 1 + Reply
view post Post 809 The new King 👑has arrived! Moonshot AI now the top model on Hugging Face 🔥 moonshotai/Kimi-K2-Thinking See translation 🔥 1 1 🤗 1 1 + Reply
view post Post 2870 💸🤑You don’t need 100 GPUs to train something amazing!Our Smol Training Playbook teaches you a better path to world-class LLMs, for free! Check out the #1 trending space on 🤗 : HuggingFaceTB/smol-training-playbook See translation 🤗 7 7 🚀 3 3 🔥 2 2 + Reply
Shorter but not Worse: Frugal Reasoning via Easy Samples as Length Regularizers in Math RLVR Paper • 2511.01937 • Published Nov 2, 2025 • 16
Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures Paper • 2510.24081 • Published Oct 28, 2025 • 24
MeXtract: Light-Weight Metadata Extraction from Scientific Papers Paper • 2510.06889 • Published Oct 8, 2025 • 1
view post Post 2349 Cool stuff these past weeks on huggingface! 🤗 🚀 !• 📈Trackio, local-first W&B alternativehttps://github.com/gradio-app/trackio/issues• 🌍EmbeddingGemma, 300M-param, multilingual embeddings, on-devicehttps://huggingface.co/blog/embeddinggemma• 💻Open LLMs in VS Code (Inference Providers)https://x.com/reach_vb/status/1966185427582497171• 🤖Smol2Operator GUI agentshttps://huggingface.co/blog/smol2operator• 🖼️Gradio visible watermarkinghttps://huggingface.co/blog/watermarking-with-gradio See translation 🔥 4 4 🤗 3 3 + Reply
Baseer: A Vision-Language Model for Arabic Document-to-Markdown OCR Paper • 2509.18174 • Published Sep 17, 2025 • 134
Saudi-Dialect-ALLaM: LoRA Fine-Tuning for Dialectal Arabic Generation Paper • 2508.13525 • Published Aug 19, 2025 • 1
Nile-Chat: Egyptian Language Models for Arabic and Latin Scripts Paper • 2507.04569 • Published Jul 6, 2025 • 23
Gazal-R1: Achieving State-of-the-Art Medical Reasoning with Parameter-Efficient Two-Stage Training Paper • 2506.21594 • Published Jun 18, 2025 • 8
Llama-3-Nanda-10B-Chat: An Open Generative Large Language Model for Hindi Paper • 2504.06011 • Published Apr 8, 2025 • 2
Mutarjim: Advancing Bidirectional Arabic-English Translation with a Small Language Model Paper • 2505.17894 • Published May 23, 2025 • 220
Masader: Metadata Sourcing for Arabic Text and Speech Data Resources Paper • 2110.06744 • Published Oct 13, 2021