Walkie Cyrile/dataset-the-stack-v2-dedup-sub Viewer • Updated Apr 1, 2025 • 82.8M • 1.92k • 6 zaydzuhri/stack-edu-python Viewer • Updated Aug 29, 2025 • 25.3M • 1.06k • 1 OpenCoder-LLM/opc-annealing-corpus Viewer • Updated May 29, 2025 • 15.6M • 1.35k • 43 OpenCoder-LLM/opc-fineweb-math-corpus Viewer • Updated Nov 24, 2024 • 5.24M • 467 • 30
Overall Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges Paper • 2604.13602 • Published 23 days ago • 32 Self-Evolving LLM Memory Extraction Across Heterogeneous Tasks Paper • 2604.11610 • Published 25 days ago • 7
Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges Paper • 2604.13602 • Published 23 days ago • 32
Self-Evolving LLM Memory Extraction Across Heterogeneous Tasks Paper • 2604.11610 • Published 25 days ago • 7
Walkie Cyrile/dataset-the-stack-v2-dedup-sub Viewer • Updated Apr 1, 2025 • 82.8M • 1.92k • 6 zaydzuhri/stack-edu-python Viewer • Updated Aug 29, 2025 • 25.3M • 1.06k • 1 OpenCoder-LLM/opc-annealing-corpus Viewer • Updated May 29, 2025 • 15.6M • 1.35k • 43 OpenCoder-LLM/opc-fineweb-math-corpus Viewer • Updated Nov 24, 2024 • 5.24M • 467 • 30
Overall Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges Paper • 2604.13602 • Published 23 days ago • 32 Self-Evolving LLM Memory Extraction Across Heterogeneous Tasks Paper • 2604.11610 • Published 25 days ago • 7
Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges Paper • 2604.13602 • Published 23 days ago • 32
Self-Evolving LLM Memory Extraction Across Heterogeneous Tasks Paper • 2604.11610 • Published 25 days ago • 7