Bridging the Data Provenance Gap Across Text, Speech and Video Paper • 2412.17847 • Published Dec 19, 2024 • 13
Saturation-Driven Dataset Generation for LLM Mathematical Reasoning in the TPTP Ecosystem Paper • 2509.06809 • Published Sep 8, 2025 • 3
Reasoning Core: A Scalable RL Environment for LLM Symbolic Reasoning Paper • 2509.18083 • Published Sep 22, 2025 • 5
MortalMATH: Evaluating the Conflict Between Reasoning Objectives and Emergency Contexts Paper • 2601.18790 • Published Jan 26 • 2
Adaptive Text Anonymization: Learning Privacy-Utility Trade-offs via Prompt Optimization Paper • 2602.20743 • Published Feb 24 • 2
Reasoning Core: A Scalable Procedural Data Generation Suite for Symbolic Pre-training and Post-Training Paper • 2603.02208 • Published Mar 2 • 4
Reasoning Core: A Scalable Procedural Data Generation Suite for Symbolic Pre-training and Post-Training Paper • 2603.02208 • Published Mar 2 • 4
MortalMATH: Evaluating the Conflict Between Reasoning Objectives and Emergency Contexts Paper • 2601.18790 • Published Jan 26 • 2
TAROT: Task-Oriented Authorship Obfuscation Using Policy Optimization Methods Paper • 2407.21630 • Published Jul 31, 2024 • 8
Consent in Crisis: The Rapid Decline of the AI Data Commons Paper • 2407.14933 • Published Jul 20, 2024 • 15
Generating multiple-choice questions for medical question answering with distractors and cue-masking Paper • 2303.07069 • Published Mar 13, 2023
Attention Overflow: Language Model Input Blur during Long-Context Missing Items Recommendation Paper • 2407.13481 • Published Jul 18, 2024 • 10