view article Article Announcing LiteCoder-Terminal: Lightweight Terminal Agents with <1k Synthesized Trajectories 8 days ago • 9
When Models Outthink Their Safety: Mitigating Self-Jailbreak in Large Reasoning Models with Chain-of-Guardrails Paper • 2510.21285 • Published Oct 24 • 4
Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering Paper • 2411.11504 • Published Nov 18, 2024 • 24
Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering Paper • 2411.11504 • Published Nov 18, 2024 • 24
StructEval: Deepen and Broaden Large Language Model Assessment via Structured Evaluation Paper • 2408.03281 • Published Aug 6, 2024 • 10
StructEval: Deepen and Broaden Large Language Model Assessment via Structured Evaluation Paper • 2408.03281 • Published Aug 6, 2024 • 10
StructEval: Deepen and Broaden Large Language Model Assessment via Structured Evaluation Paper • 2408.03281 • Published Aug 6, 2024 • 10 • 2
Towards Scalable Automated Alignment of LLMs: A Survey Paper • 2406.01252 • Published Jun 3, 2024 • 2