Running Agents Featured 262 Qwen3 ASR Demo 👀 262 Convert uploaded audio to text with language detection
view article Article LeMaterial: an open source initiative to accelerate materials discovery and research +8 AlexDuvalinho, lritchie, msiron, inelgnu, etiennedufayet, amandinerossello, Ramlaoui, IAMJB, lvwerra, thomwolf • Dec 10, 2024 • 56
view article Article Finally, a Replacement for BERT: Introducing ModernBERT +13 bwarner, NohTow, bclavie, orionweller, ohallstrom, staghado, alexisgallagher, rbiswasfc, fladhak, tomaarsen, ncoop57, griffin, jph00, johnowhitaker, iacolippo • Dec 19, 2024 • 747
BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature Paper • 2501.07171 • Published Jan 13, 2025 • 55
ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities Paper • 2407.14482 • Published Jul 19, 2024 • 26
swap-uniba/LLaMAntino-3-ANITA-8B-Inst-DPO-ITA Text Generation • 8B • Updated Sep 1, 2025 • 2.91k • • 30
Building on CPU Upgrade Agents 84 Open Ita Llm Leaderboard 🏆 84 Track, rank and evaluate open LLMs in the italian language!
view post Post 9696 Working on a concept GPT-2 (small) that uses KANs instead of MLPs.The ckpt and training code will be soon on the hub. 6 replies · 🚀 31 31 👍 14 14 🔥 11 11 🤯 4 4 ➕ 4 4 + Reply
Rethinking Interpretability in the Era of Large Language Models Paper • 2402.01761 • Published Jan 30, 2024 • 23
RAG vs Fine-tuning: Pipelines, Tradeoffs, and a Case Study on Agriculture Paper • 2401.08406 • Published Jan 16, 2024 • 38