Tahoe-x1 Collection Resources related to the Tx1 family of single-cell foundation models from Tahoe. • 2 items • Updated Oct 22, 2025 • 4
view article Article Swift Transformers Reaches 1.0 – and Looks to the Future +2 pcuenq, FL33TW00D-HF, mattt, reach-vb • Sep 26, 2025 • 43
view article Article "Anemll-style" Root-Mean-Square (RMS) Normalization on the Apple Neural Engine: A Simple Hack anemll • Sep 16, 2025 • 15
view article Article Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers +5 ariG23498, sergiopaniego, reach-vb, pcuenq, ArthurZ, SaylorTwift, cyrilvallez • Sep 11, 2025 • 188
view article Article Accelerating AI for Drug Discovery: Ginkgo’s GDPx Functional Genomics and GDPa Antibody Developability Dataset Series cgeorgiaw • Jun 24, 2025 • 20
Enhancing Training Efficiency Using Packing with Flash Attention Paper • 2407.09105 • Published Jul 12, 2024 • 17
view article Article Predicting the Effects of Mutations on Protein Function with ESM-2 AmelieSchreiber • Dec 13, 2023 • 28
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency Paper • 2508.18265 • Published Aug 25, 2025 • 222
view article Article From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels drbh, danieldk • Aug 18, 2025 • 100
view article Article Introducing Command A Vision: Multimodal AI built for Business CohereLabs • Jul 31, 2025 • 64
view article Article Arc Virtual Cell Challenge: A Primer FL33TW00D-HF, abhinadduri • Jul 18, 2025 • 66
view article Article SmolLM3: smol, multilingual, long-context reasoner +21 eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf • Jul 8, 2025 • 777
view article Article cocogold: training Marigold for text-grounded segmentation pcuenq • Jul 8, 2025 • 31
view article Article Gemma 3n fully available in the open-source ecosystem! +6 ariG23498, pcuenq, sergiopaniego, reach-vb, FL33TW00D-HF, Xenova, Steveeeeeeen, kashif • Jun 26, 2025 • 121
Quartet: Native FP4 Training Can Be Optimal for Large Language Models Paper • 2505.14669 • Published May 20, 2025 • 78