view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 Feb 20 • 505
view article Article DeepSeek-V4: a million-token context that agents can actually use 13 days ago • 42
view article Article LateOn-Code & ColGrep: LightOn unveils state-of-the-art code retrieval models and code search tooling Feb 12 • 56
view article Article DenseOn with the LateOn: Open State-of-the-Art Single and Multi-Vector Models 16 days ago • 37
view article Article Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers 21 days ago • 69
view article Article Multimodal Embedding & Reranker Models with Sentence Transformers 28 days ago • 57
view article Article AssetOpsBench: Bridging the Gap Between AI Agent Benchmarks and Industrial Reality Jan 21 • 33
view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 Mar 10 • 142
PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model Paper • 2510.14528 • Published Oct 16, 2025 • 124
view article Article Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA +3 May 24, 2023 • 180