Stress-testing Machine Generated Text Detection: Shifting Language Models Writing Style to Fool Detectors Paper • 2505.24523 • Published May 30 • 9
ITA-Bench: Italian Benchmarks for LLMs Collection A collection of Italian benchmarks for Large Language Models. See also our Github repo: https://github.com/SapienzaNLP/ita-bench • 23 items • Updated 20 days ago • 8
🔍 Interpretability & Analysis of LMs Collection Outstanding research in LM interpretability and evaluation, summarized • 134 items • Updated Oct 20 • 116