QueryAttack: Jailbreaking Aligned Large Language Models Using Structured Non-natural Query Language Paper • 2502.09723 • Published Feb 13, 2025
From Runnable to Shippable: Multi-Agent Test-Driven Development for Generating Full-Stack Web Applications from Requirements Paper • 2605.17242 • Published 3 days ago • 9
DesignBench: A Comprehensive Benchmark for MLLM-based Front-end Code Generation Paper • 2506.06251 • Published Jun 6, 2025 • 2