5 49 7

Weida Wang

weidawang

https://davidweidawang.github.io/

davidweida

AI & ML interests

None yet

Recent Activity

upvoted a paper about 16 hours ago

NatureBench: Can Coding Agents Match the Published SOTA of Nature-Family Papers?

upvoted an article 11 days ago

PhysicsIntern: from an Autonomous Benchmark-runner to a Research Sidekick

liked a Space 11 days ago

huggingface/physics-intern

View all activity

Organizations

upvoted a paper about 16 hours ago

NatureBench: Can Coding Agents Match the Published SOTA of Nature-Family Papers?

Paper • 2606.24530 • Published 3 days ago • 55

upvoted an article 11 days ago

Article

PhysicsIntern: from an Autonomous Benchmark-runner to a Research Sidekick

dlouapre

•

14 days ago

• 6

liked a Space 11 days ago

physics-intern: an Autonomous Agent for Physics Research

📝

Explore an autonomous AI workflow for physics research

upvoted a paper 16 days ago

ResearchClawBench: A Benchmark for End-to-End Autonomous Scientific Research

Paper • 2606.07591 • Published 29 days ago • 95

upvoted a paper 23 days ago

COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation

Paper • 2605.31264 • Published 28 days ago • 118

upvoted 2 papers about 1 month ago

AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration

Paper • 2605.20025 • Published May 19 • 190

Teaching Thinking Models to Reason with Tools: A Full-Pipeline Recipe for Tool-Integrated Reasoning

Paper • 2605.06326 • Published May 7 • 26

authored 4 papers about 1 month ago

Plan Then Action:High-Level Planning Guidance Reinforcement Learning for LLM Reasoning

Paper • 2510.01833 • Published Oct 2, 2025

QCBench: Evaluating Large Language Models on Domain-Specific Quantitative Chemistry

Paper • 2508.01670 • Published Aug 3, 2025

PolyReal: A Benchmark for Real-World Polymer Science Workflows

Paper • 2604.02934 • Published Apr 3

$δ$-mem: Efficient Online Memory for Large Language Models

Paper • 2605.12357 • Published May 12 • 131

upvoted a paper about 1 month ago

δ-mem: Efficient Online Memory for Large Language Models

Paper • 2605.12357 • Published May 12 • 131

liked a dataset about 2 months ago

AdrianMiao/PRL_Bench

Viewer • Updated Apr 7 • 101 • 35 • 3

liked a dataset 2 months ago

stellalisy/cognitive_foundations

Preview • Updated Dec 16, 2025 • 185 • 2

updated a dataset 3 months ago

weidawang/PolyReal

Viewer • Updated Apr 5 • 545 • 14 • 1

published a dataset 3 months ago

weidawang/PolyReal

Viewer • Updated Apr 5 • 545 • 14 • 1

upvoted 4 papers 3 months ago

PRBench: End-to-end Paper Reproduction in Physics Research

Paper • 2603.27646 • Published Mar 29 • 29

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Paper • 2603.25040 • Published Mar 26 • 134

LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning

Paper • 2603.21065 • Published Mar 22 • 78

Memento-Skills: Let Agents Design Agents

Paper • 2603.18743 • Published Mar 19 • 58

Weida Wang

AI & ML interests

Recent Activity

Organizations

weidawang's activity

PhysicsIntern: from an Autonomous Benchmark-runner to a Research Sidekick

physics-intern: an Autonomous Agent for Physics Research