1 18

Songtao Huang

huangst

AI & ML interests

None yet

Recent Activity

upvoted a paper 16 days ago

ResearchClawBench: A Benchmark for End-to-End Autonomous Scientific Research

upvoted a paper 18 days ago

Self-Distilled Policy Gradient

upvoted a paper 20 days ago

MLEvolve: A Self-Evolving Framework for Automated Machine Learning Algorithm Discovery

View all activity

Organizations

None yet

upvoted a paper 16 days ago

ResearchClawBench: A Benchmark for End-to-End Autonomous Scientific Research

Paper • 2606.07591 • Published 28 days ago • 95

upvoted a paper 18 days ago

Self-Distilled Policy Gradient

Paper • 2606.04036 • Published 23 days ago • 27

upvoted a paper 20 days ago

MLEvolve: A Self-Evolving Framework for Automated Machine Learning Algorithm Discovery

Paper • 2606.06473 • Published 21 days ago • 19

upvoted a paper 24 days ago

COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation

Paper • 2605.31264 • Published 27 days ago • 118

upvoted a paper 3 months ago

AI Can Learn Scientific Taste

Paper • 2603.14473 • Published Mar 15 • 431

authored 2 papers 4 months ago

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2, 2025 • 239

InternAgent-1.5: A Unified Agentic Framework for Long-Horizon Autonomous Scientific Discovery

Paper • 2602.08990 • Published Feb 9 • 79

upvoted a paper 4 months ago

InternAgent-1.5: A Unified Agentic Framework for Long-Horizon Autonomous Scientific Discovery

Paper • 2602.08990 • Published Feb 9 • 79

upvoted a paper 8 months ago

Tongyi DeepResearch Technical Report

Paper • 2510.24701 • Published Oct 28, 2025 • 104

upvoted a paper 9 months ago

OmniCaptioner: One Captioner to Rule Them All

Paper • 2504.07089 • Published Apr 9, 2025 • 21

upvoted a paper 10 months ago

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2, 2025 • 239

upvoted 2 papers 12 months ago

BMMR: A Large-Scale Bilingual Multimodal Multi-Discipline Reasoning Dataset

Paper • 2507.03483 • Published Jul 4, 2025 • 24

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14, 2025 • 343

upvoted 4 papers about 1 year ago

VIKI-R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learning

Paper • 2506.09049 • Published Jun 10, 2025 • 37

Scientists' First Exam: Probing Cognitive Abilities of MLLM via Perception, Understanding, and Reasoning

Paper • 2506.10521 • Published Jun 12, 2025 • 75

Reward Reasoning Model

Paper • 2505.14674 • Published May 20, 2025 • 38

MME-Reasoning: A Comprehensive Benchmark for Logical Reasoning in MLLMs

Paper • 2505.21327 • Published May 27, 2025 • 83

authored a paper about 1 year ago

NovelSeek: When Agent Becomes the Scientist -- Building Closed-Loop System from Hypothesis to Verification

Paper • 2505.16938 • Published May 22, 2025 • 121

upvoted a paper about 1 year ago

NovelSeek: When Agent Becomes the Scientist -- Building Closed-Loop System from Hypothesis to Verification

Paper • 2505.16938 • Published May 22, 2025 • 121

upvoted a paper over 1 year ago

RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints

Paper • 2503.16408 • Published Mar 20, 2025 • 42

Songtao Huang

AI & ML interests

Recent Activity

Organizations

huangst's activity