Who Evaluates AI's Social Impacts? Mapping Coverage and Gaps in First and Third Party Evaluations Paper • 2511.05613 • Published Nov 6, 2025
Position: The Complexity of Perfect AI Alignment -- Formalizing the RLHF Trilemma Paper • 2511.19504 • Published Nov 23, 2025 • 2
Catch Me If You Can: How Smaller Reasoning Models Pretend to Reason with Mathematical Fidelity Paper • 2512.00552 • Published Nov 29, 2025
When Shallow Wins: Silent Failures and the Depth-Accuracy Paradox in Latent Reasoning Paper • 2603.03475 • Published Mar 3
I Can't Believe It's Not Robust: Catastrophic Collapse of Safety Classifiers under Embedding Drift Paper • 2603.01297 • Published Mar 1
Dial E for Ethical Enforcement: institutional VETO power as a governance primitive Paper • 2603.00617 • Published Feb 28
SAHOO: Safeguarded Alignment for High-Order Optimization Objectives in Recursive Self-Improvement Paper • 2603.06333 • Published Mar 6 • 1
The Reasoning Trap -- Logical Reasoning as a Mechanistic Pathway to Situational Awareness Paper • 2603.09200 • Published Mar 10 • 5
High Performance of Gradient Boosting in Binding Affinity Prediction Paper • 2205.07023 • Published May 14, 2022
34 Examples of LLM Applications in Materials Science and Chemistry: Towards Automation, Assistants, Agents, and Accelerated Scientific Discovery Paper • 2505.03049 • Published May 5, 2025
ComProScanner: A multi-agent based framework for composition-property structured data extraction from scientific literature Paper • 2510.20362 • Published Oct 23, 2025 • 3
Reflections from the 2024 Large Language Model (LLM) Hackathon for Applications in Materials Science and Chemistry Paper • 2411.15221 • Published Nov 20, 2024 • 30
Reflections from the 2024 Large Language Model (LLM) Hackathon for Applications in Materials Science and Chemistry Paper • 2411.15221 • Published Nov 20, 2024 • 30
Reflections from the 2024 Large Language Model (LLM) Hackathon for Applications in Materials Science and Chemistry Paper • 2411.15221 • Published Nov 20, 2024 • 30
CIDAR: Culturally Relevant Instruction Dataset For Arabic Paper • 2402.03177 • Published Feb 5, 2024 • 8