view article Article Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers 24 days ago • 69
propella-1: Multi-Property Document Annotation for LLM Data Curation at Scale Paper • 2602.12414 • Published Feb 12 • 2
Lost in Backpropagation: The LM Head is a Gradient Bottleneck Paper • 2603.10145 • Published Mar 10 • 13