Human Decision-making is Susceptible to AI-driven Manipulation Paper • 2502.07663 • Published Feb 11, 2025
SocialEval: Evaluating Social Intelligence of Large Language Models Paper • 2506.00900 • Published Jun 1, 2025
JumpStarter: Human-AI Planning with Task-Structured Context Curation Paper • 2410.03882 • Published Oct 4, 2024
Generalization or Memorization: Dynamic Decoding for Mode Steering Paper • 2510.22099 • Published Oct 25, 2025 • 4
Cognition-of-Thought Elicits Social-Aligned Reasoning in Large Language Models Paper • 2509.23441 • Published Sep 27, 2025
Simulating and Understanding Deceptive Behaviors in Long-Horizon Interactions Paper • 2510.03999 • Published Oct 5, 2025
Deeper is Not Always Better: Mitigating the Alignment Tax via Confident Layer Decoding Paper • 2606.21906 • Published 16 days ago • 24
Deeper is Not Always Better: Mitigating the Alignment Tax via Confident Layer Decoding Paper • 2606.21906 • Published 16 days ago • 24 • 13
Deeper is Not Always Better: Mitigating the Alignment Tax via Confident Layer Decoding Paper • 2606.21906 • Published 16 days ago • 24 • 13
Deeper is Not Always Better: Mitigating the Alignment Tax via Confident Layer Decoding Paper • 2606.21906 • Published 16 days ago • 24
Deeper is Not Always Better: Mitigating the Alignment Tax via Confident Layer Decoding Paper • 2606.21906 • Published 16 days ago • 24
Generalization or Memorization: Dynamic Decoding for Mode Steering Paper • 2510.22099 • Published Oct 25, 2025 • 4
Generalization or Memorization: Dynamic Decoding for Mode Steering Paper • 2510.22099 • Published Oct 25, 2025 • 4 • 1
ReST-RL: Achieving Accurate Code Reasoning of LLMs with Optimized Self-Training and Decoding Paper • 2508.19576 • Published Aug 27, 2025 • 2
MetaMind: Modeling Human Social Thoughts with Metacognitive Multi-Agent Systems Paper • 2505.18943 • Published May 25, 2025 • 25 • 4
MetaMind: Modeling Human Social Thoughts with Metacognitive Multi-Agent Systems Paper • 2505.18943 • Published May 25, 2025 • 25
MetaMind: Modeling Human Social Thoughts with Metacognitive Multi-Agent Systems Paper • 2505.18943 • Published May 25, 2025 • 25 • 4
Agent-SafetyBench: Evaluating the Safety of LLM Agents Paper • 2412.14470 • Published Dec 19, 2024 • 12
MetaMind: Modeling Human Social Thoughts with Metacognitive Multi-Agent Systems Paper • 2505.18943 • Published May 25, 2025 • 25
MetaMind: Modeling Human Social Thoughts with Metacognitive Multi-Agent Systems Paper • 2505.18943 • Published May 25, 2025 • 25 • 4