Linguistics Theory Meets LLM: Code-Switched Text Generation via Equivalence Constrained Large Language Models Paper • 2410.22660 • Published Oct 30, 2024
Evaluating Deep Research Agents on Expert Consulting Work: A Benchmark with Verifiers, Rubrics, and Cognitive Traps Paper • 2605.17554 • Published 4 days ago