view article Article Party is over: regularizing ColBERT models to fix efficient ANN methods lightonai • 10 days ago • 23
view article Article DenseOn with the LateOn: Open State-of-the-Art Single and Multi-Vector Models lightonai • Apr 21 • 42
Tmax Collection Data and models associated with "Tmax: A simple recipe for terminal agents". paper: https://arxiv.org/abs/2606.23321 • 23 items • Updated 3 days ago • 12
Cosmos3 Collection Omnimodal World Models for Physical AI • 16 items • Updated about 17 hours ago • 131
XTR Replicability Collection All the models used in experiments from "A Replicability Study of XTR" • 16 items • Updated May 5 • 7
view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift • Apr 2 • 909
view article Article TRL v1.0: Post-Training Library Built to Move with the Field +2 qgallouedec, stevhliu, pcuenq, sergiopaniego • Mar 31 • 57
view article Article Training and Finetuning Reranker Models with Sentence Transformers tomaarsen • Mar 26, 2025 • 195
Transformers.js V4 demos Collection A collection of demos built with Transformers.js V4 • 24 items • Updated Apr 16 • 65
The Y-Combinator for LLMs: Solving Long-Context Rot with λ-Calculus Paper • 2603.20105 • Published Mar 20 • 37
CodeScout Collection RL-trained code search agents (1.7B, 4B, 14B) that outperform 2–18× larger models using only a Unix terminal. 📄 arxiv.org/abs/2603.17829 • 12 items • Updated Mar 19 • 8
PyLate 🐕 Collection State-of-the-art late interaction models trained using PyLate • 7 items • Updated 2 days ago • 6
ColBERT-Zero 🐶 Collection First large-scale fully pre-trained ColBERT model using only public data, outperforming GTE-ModernColBERT and GTE-ModernBERT • 10 items • Updated 2 days ago • 23