Qihan Ren's picture

Qihan Ren

jasonrqh

·

https://nebularaid2000.github.io/

AI & ML interests

explainable AI, LLM

Recent Activity

upvoted a paper 2 days ago

OpenSeeker-v2: Pushing the Limits of Search Agents with Informative and High-Difficulty Trajectories

upvoted a paper 2 days ago

ARIS: Autonomous Research via Adversarial Multi-Agent Collaboration

upvoted a paper 22 days ago

Seedance 2.0: Advancing Video Generation for World Complexity

View all activity

Organizations

authored 3 papers 25 days ago

Code2Math: Can Your Code Agent Effectively Evolve Math Problems Through Exploration?

Paper • 2603.03202 • Published Mar 3 • 17

ATBench: A Diverse and Realistic Trajectory Benchmark for Long-Horizon Agent Safety

Paper • 2604.02022 • Published Apr 2 • 15

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published about 1 month ago • 324

submitted a paper to Daily Papers 28 days ago

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published about 1 month ago • 324

submitted a paper to Daily Papers 3 months ago

Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report v1.5

Paper • 2602.14457 • Published Feb 16 • 29

authored a paper 3 months ago

AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security

Paper • 2601.18491 • Published Jan 26 • 125

submitted a paper to Daily Papers 3 months ago

AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security

Paper • 2601.18491 • Published Jan 26 • 125

authored 2 papers 7 months ago

Conditional Advantage Estimation for Reinforcement Learning in Large Reasoning Models

Paper • 2509.23962 • Published Sep 28, 2025 • 5

Your Agent May Misevolve: Emergent Risks in Self-evolving LLM Agents

Paper • 2509.26354 • Published Sep 30, 2025 • 18

authored 3 papers 9 months ago

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

Paper • 2507.21046 • Published Jul 28, 2025 • 85

SafeWork-R1: Coevolving Safety and Intelligence under the AI-45$^{\circ}$ Law

Paper • 2507.18576 • Published Jul 24, 2025 • 10

Benchmarking Multimodal Knowledge Conflict for Large Multimodal Models

Paper • 2505.19509 • Published May 26, 2025 • 7

authored a paper 12 months ago

Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evolution

Paper • 2505.20286 • Published May 26, 2025 • 8