AgentDoG Collection A Diagnostic Guardrail Framework for AI Agent Safety and Security โข 12 items โข Updated 10 days ago โข 112
Kronos: A Foundation Model for the Language of Financial Markets Paper โข 2508.02739 โข Published Aug 2, 2025 โข 44
LiveCodeBench Pro: How Do Olympiad Medalists Judge LLMs in Competitive Programming? Paper โข 2506.11928 โข Published Jun 13, 2025 โข 25