Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

Shang Hong Sim's picture

Shang Hong Sim

shanghong

21world's profile picture

·

https://shanghongsim.github.io/

shanghong_sim
shanghongsim
shanghongsim

AI & ML interests

Neural decoding, neuroengineering, signal processing

Organizations

shanghong 's collections 5

agentic training datasets

interstellarninja/hermes_reasoning_tool_use

Viewer • Updated Dec 26, 2025 • 51k • 2.79k • 171
open-r1/codeforces

Viewer • Updated May 19, 2025 • 34.8k • 8.17k • 101
lmarena-ai/webdev-arena-preference-10k

Viewer • Updated Mar 10, 2025 • 10.5k • 51.2k • 17
SWE-bench/SWE-smith-trajectories

Viewer • Updated Jul 19, 2025 • 76k • 4.3k • 64

Towards General Agentic Intelligence via Environment Scaling

Paper • 2509.13311 • Published Sep 16, 2025 • 73
Establishing Best Practices for Building Rigorous Agentic Benchmarks

Paper • 2507.02825 • Published Jul 3, 2025 • 1
LoongRL:Reinforcement Learning for Advanced Reasoning over Long Contexts

Paper • 2510.19363 • Published Oct 22, 2025 • 63
ProfBench: Multi-Domain Rubrics requiring Professional Knowledge to Answer and Judge

Paper • 2510.18941 • Published Oct 21, 2025 • 13

RPO: Retrieval Preference Optimization for Robust Retrieval-Augmented Generation

Paper • 2501.13726 • Published Jan 23, 2025 • 1
RAG-Star: Enhancing Deliberative Reasoning with Retrieval Augmented Verification and Refinement

Paper • 2412.12881 • Published Dec 17, 2024 • 2
DeepRAG: Thinking to Retrieval Step by Step for Large Language Models

Paper • 2502.01142 • Published Feb 3, 2025 • 25

Strong OSS models

moonshotai/Kimi-Linear-48B-A3B-Instruct

Text Generation • 49B • Updated Dec 16, 2025 • 72.3k • • 564
PleIAs/Baguettotron

Text Generation • 0.3B • Updated Apr 27 • 982 • 260
baidu/ERNIE-4.5-21B-A3B-Thinking

Text Generation • 22B • Updated Nov 26, 2025 • 14.9k • 786

UCSC-VLAA/MedReason

Viewer • Updated May 27, 2025 • 32.7k • 670 • 87
interstellarninja/hermes_reasoning_tool_use

Viewer • Updated Dec 26, 2025 • 51k • 2.79k • 171
HuggingFaceM4/DoclingMatix

Viewer • Updated Jul 31, 2025 • 1.27M • 2.07k • 52
Amod/mental_health_counseling_conversations

Viewer • Updated Nov 25, 2025 • 3.51k • 3.33k • 486

agentic training datasets

interstellarninja/hermes_reasoning_tool_use

Viewer • Updated Dec 26, 2025 • 51k • 2.79k • 171
open-r1/codeforces

Viewer • Updated May 19, 2025 • 34.8k • 8.17k • 101
lmarena-ai/webdev-arena-preference-10k

Viewer • Updated Mar 10, 2025 • 10.5k • 51.2k • 17
SWE-bench/SWE-smith-trajectories

Viewer • Updated Jul 19, 2025 • 76k • 4.3k • 64

Strong OSS models

moonshotai/Kimi-Linear-48B-A3B-Instruct

Text Generation • 49B • Updated Dec 16, 2025 • 72.3k • • 564
PleIAs/Baguettotron

Text Generation • 0.3B • Updated Apr 27 • 982 • 260
baidu/ERNIE-4.5-21B-A3B-Thinking

Text Generation • 22B • Updated Nov 26, 2025 • 14.9k • 786

Towards General Agentic Intelligence via Environment Scaling

Paper • 2509.13311 • Published Sep 16, 2025 • 73
Establishing Best Practices for Building Rigorous Agentic Benchmarks

Paper • 2507.02825 • Published Jul 3, 2025 • 1
LoongRL:Reinforcement Learning for Advanced Reasoning over Long Contexts

Paper • 2510.19363 • Published Oct 22, 2025 • 63
ProfBench: Multi-Domain Rubrics requiring Professional Knowledge to Answer and Judge

Paper • 2510.18941 • Published Oct 21, 2025 • 13

UCSC-VLAA/MedReason

Viewer • Updated May 27, 2025 • 32.7k • 670 • 87
interstellarninja/hermes_reasoning_tool_use

Viewer • Updated Dec 26, 2025 • 51k • 2.79k • 171
HuggingFaceM4/DoclingMatix

Viewer • Updated Jul 31, 2025 • 1.27M • 2.07k • 52
Amod/mental_health_counseling_conversations

Viewer • Updated Nov 25, 2025 • 3.51k • 3.33k • 486

RPO: Retrieval Preference Optimization for Robust Retrieval-Augmented Generation

Paper • 2501.13726 • Published Jan 23, 2025 • 1
RAG-Star: Enhancing Deliberative Reasoning with Retrieval Augmented Verification and Refinement

Paper • 2412.12881 • Published Dec 17, 2024 • 2
DeepRAG: Thinking to Retrieval Step by Step for Large Language Models

Paper • 2502.01142 • Published Feb 3, 2025 • 25

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs