Better Models, Faster Training: Sigmoid Attention for single-cell Foundation Models Paper β’ 2604.27124 β’ Published 7 days ago β’ 5
T^2PO: Uncertainty-Guided Exploration Control for Stable Multi-Turn Agentic Reinforcement Learning Paper β’ 2605.02178 β’ Published 2 days ago β’ 4
PhysicianBench: Evaluating LLM Agents in Real-World EHR Environments Paper β’ 2605.02240 β’ Published 2 days ago β’ 6
AcademiClaw: When Students Set Challenges for AI Agents Paper β’ 2605.02661 β’ Published 2 days ago β’ 8
Repetition over Diversity: High-Signal Data Filtering for Sample-Efficient German Language Modeling Paper β’ 2604.28075 β’ Published 6 days ago β’ 14
HUMAN-WRITTEN & LEGALLY-SOURCED* Collection Datasets written by humans and/or reverse-engineered from text with deterministic algorithms. No illegal scraping or unethical synthesis *...mostly. β’ 169 items β’ Updated about 20 hours ago β’ 3
Large Language Models Explore by Latent Distilling Paper β’ 2604.24927 β’ Published 9 days ago β’ 71 β’ 7
Large Language Models Explore by Latent Distilling Paper β’ 2604.24927 β’ Published 9 days ago β’ 71
WTF GENIUS PAPERS Collection Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. β’ 117 items β’ Updated 5 days ago β’ 17
WTF GENIUS PAPERS Collection Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. β’ 117 items β’ Updated 5 days ago β’ 17
WTF GENIUS PAPERS Collection Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. β’ 117 items β’ Updated 5 days ago β’ 17
Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning Paper β’ 2509.24372 β’ Published Sep 29, 2025 β’ 14
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach Paper β’ 2502.05171 β’ Published Feb 7, 2025 β’ 155
SelfCodeAlign: Self-Alignment for Code Generation Paper β’ 2410.24198 β’ Published Oct 31, 2024 β’ 25
WTF GENIUS PAPERS Collection Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. β’ 117 items β’ Updated 5 days ago β’ 17