Bridging the Data Provenance Gap Across Text, Speech and Video Paper • 2412.17847 • Published Dec 19, 2024 • 12
Saturation-Driven Dataset Generation for LLM Mathematical Reasoning in the TPTP Ecosystem Paper • 2509.06809 • Published Sep 8, 2025 • 3
Reasoning Core: A Scalable RL Environment for LLM Symbolic Reasoning Paper • 2509.18083 • Published Sep 22, 2025 • 5
MortalMATH: Evaluating the Conflict Between Reasoning Objectives and Emergency Contexts Paper • 2601.18790 • Published Jan 26 • 2
Adaptive Text Anonymization: Learning Privacy-Utility Trade-offs via Prompt Optimization Paper • 2602.20743 • Published Feb 24 • 2
Reasoning Core: A Scalable Procedural Data Generation Suite for Symbolic Pre-training and Post-Training Paper • 2603.02208 • Published Mar 2 • 4
Reasoning Core: A Scalable Procedural Data Generation Suite for Symbolic Pre-training and Post-Training Paper • 2603.02208 • Published Mar 2 • 4
MortalMATH: Evaluating the Conflict Between Reasoning Objectives and Emergency Contexts Paper • 2601.18790 • Published Jan 26 • 2
DeDisCo Collection Models for the DeDisCo discourse relation classification system from the DISRPT 2025 shared task • 2 items • Updated Sep 18, 2025
A Second Wave of UD Hebrew Treebanking and Cross-Domain Parsing Paper • 2210.07873 • Published Oct 14, 2022
TAROT: Task-Oriented Authorship Obfuscation Using Policy Optimization Methods Paper • 2407.21630 • Published Jul 31, 2024 • 8
Consent in Crisis: The Rapid Decline of the AI Data Commons Paper • 2407.14933 • Published Jul 20, 2024 • 15
Generating multiple-choice questions for medical question answering with distractors and cue-masking Paper • 2303.07069 • Published Mar 13, 2023