Apertus LLM Collection Democratizing Open and Compliant LLMs for Global Language Environments: 8B and 70B open-data open-weights models, multilingual in >1000 languages β’ 4 items β’ Updated Oct 1, 2025 β’ 336
DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization Paper β’ 2508.14460 β’ Published Aug 20, 2025 β’ 85
Seed-X: Building Strong Multilingual Translation LLM with 7B Parameters Paper β’ 2507.13618 β’ Published Jul 18, 2025 β’ 16
How Well Does GPT-4o Understand Vision? Evaluating Multimodal Foundation Models on Standard Computer Vision Tasks Paper β’ 2507.01955 β’ Published Jul 2, 2025 β’ 36
Scaling Laws of Decoder-Only Models on the Multilingual Machine Translation Task Paper β’ 2409.15051 β’ Published Sep 23, 2024 β’ 2
Babel: Open Multilingual Large Language Models Serving Over 90% of Global Speakers Paper β’ 2503.00865 β’ Published Mar 2, 2025 β’ 64
Optimizing Large Language Model Training Using FP4 Quantization Paper β’ 2501.17116 β’ Published Jan 28, 2025 β’ 36
OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations Paper β’ 2412.07626 β’ Published Dec 10, 2024 β’ 29
PaliGemma 2: A Family of Versatile VLMs for Transfer Paper β’ 2412.03555 β’ Published Dec 4, 2024 β’ 133
view article Article πΊπ¦ββ¬ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs Dec 4, 2024 β’ 80
X-ALMA: Plug & Play Modules and Adaptive Rejection for Quality Translation at Scale Paper β’ 2410.03115 β’ Published Oct 4, 2024 β’ 1
Is Preference Alignment Always the Best Option to Enhance LLM-Based Translation? An Empirical Analysis Paper β’ 2409.20059 β’ Published Sep 30, 2024 β’ 16
InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning Paper β’ 2409.12568 β’ Published Sep 19, 2024 β’ 50
Training Language Models to Self-Correct via Reinforcement Learning Paper β’ 2409.12917 β’ Published Sep 19, 2024 β’ 140