-
Towards General Agentic Intelligence via Environment Scaling
Paper • 2509.13311 • Published • 71 -
Establishing Best Practices for Building Rigorous Agentic Benchmarks
Paper • 2507.02825 • Published • 1 -
LoongRL:Reinforcement Learning for Advanced Reasoning over Long Contexts
Paper • 2510.19363 • Published • 62 -
ProfBench: Multi-Domain Rubrics requiring Professional Knowledge to Answer and Judge
Paper • 2510.18941 • Published • 8
Shang Hong Sim
shanghong
AI & ML interests
Neural decoding, neuroengineering, signal processing
Organizations
RAG
-
RPO: Retrieval Preference Optimization for Robust Retrieval-Augmented Generation
Paper • 2501.13726 • Published • 1 -
RAG-Star: Enhancing Deliberative Reasoning with Retrieval Augmented Verification and Refinement
Paper • 2412.12881 • Published • 2 -
DeepRAG: Thinking to Retrieval Step by Step for Large Language Models
Paper • 2502.01142 • Published • 24
to read
-
Towards General Agentic Intelligence via Environment Scaling
Paper • 2509.13311 • Published • 71 -
Establishing Best Practices for Building Rigorous Agentic Benchmarks
Paper • 2507.02825 • Published • 1 -
LoongRL:Reinforcement Learning for Advanced Reasoning over Long Contexts
Paper • 2510.19363 • Published • 62 -
ProfBench: Multi-Domain Rubrics requiring Professional Knowledge to Answer and Judge
Paper • 2510.18941 • Published • 8
gold_datasets
RAG
-
RPO: Retrieval Preference Optimization for Robust Retrieval-Augmented Generation
Paper • 2501.13726 • Published • 1 -
RAG-Star: Enhancing Deliberative Reasoning with Retrieval Augmented Verification and Refinement
Paper • 2412.12881 • Published • 2 -
DeepRAG: Thinking to Retrieval Step by Step for Large Language Models
Paper • 2502.01142 • Published • 24