Song Dingjie's picture

Song Dingjie

songdj

·

bbsngg

AI & ML interests

None yet

Recent Activity

authored a paper 19 days ago

OpenSkill: Open-World Self-Evolution for LLM Agents

upvoted a paper 20 days ago

OpenSkill: Open-World Self-Evolution for LLM Agents

updated a collection 23 days ago

View all activity

Organizations

None yet

upvoted a paper 20 days ago

OpenSkill: Open-World Self-Evolution for LLM Agents

Paper • 2606.06741 • Published 24 days ago • 29

upvoted a paper about 1 month ago

AutoResearch AI: Towards AI-Powered Research Automation for Scientific Discovery

Paper • 2605.23204 • Published May 22 • 29

upvoted 5 papers 3 months ago

ClawBench: Can AI Agents Complete Everyday Online Tasks?

Paper • 2604.08523 • Published Apr 9 • 265

Graph of Skills: Dependency-Aware Structural Retrieval for Massive Agent Skills

Paper • 2604.05333 • Published Apr 7 • 23

Towards a Medical AI Scientist

Paper • 2603.28589 • Published Mar 30 • 91

Can MLLMs Read Students' Minds? Unpacking Multimodal Error Analysis in Handwritten Math

Paper • 2603.24961 • Published Mar 26 • 4

Expert Threshold Routing for Autoregressive Language Modeling with Dynamic Computation Allocation and Load Balancing

Paper • 2603.11535 • Published Mar 12 • 10

upvoted 2 papers 4 months ago

Zooming without Zooming: Region-to-Image Distillation for Fine-Grained Multimodal Perception

Paper • 2602.11858 • Published Feb 12 • 63

LiveMedBench: A Contamination-Free Medical Benchmark for LLMs with Automated Rubric Evaluation

Paper • 2602.10367 • Published Feb 10 • 13

upvoted 2 papers 6 months ago

Digital Twin AI: Opportunities and Challenges from Large Language Models to World Models

Paper • 2601.01321 • Published Jan 4 • 20

DentalGPT: Incentivizing Multimodal Complex Reasoning in Dentistry

Paper • 2512.11558 • Published Dec 12, 2025 • 45

upvoted a paper 12 months ago

SAMed-2: Selective Memory Enhanced Medical Segment Anything Model

Paper • 2507.03698 • Published Jul 4, 2025 • 12

upvoted 7 papers about 1 year ago

ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image Generation

Paper • 2506.18095 • Published Jun 22, 2025 • 67

Agentic Robot: A Brain-Inspired Framework for Vision-Language-Action Models in Embodied Agents

Paper • 2505.23450 • Published May 29, 2025 • 9

CoRT: Code-integrated Reasoning within Thinking

Paper • 2506.09820 • Published Jun 11, 2025 • 18

A Survey on Post-training of Large Language Models

Paper • 2503.06072 • Published Mar 8, 2025 • 11

MMMR: Benchmarking Massive Multi-Modal Reasoning Tasks

Paper • 2505.16459 • Published May 22, 2025 • 45

Towards Understanding Camera Motions in Any Video

Paper • 2504.15376 • Published Apr 21, 2025 • 157

NodeRAG: Structuring Graph-based RAG with Heterogeneous Nodes

Paper • 2504.11544 • Published Apr 15, 2025 • 44

upvoted a paper over 1 year ago

Aligning Multimodal LLM with Human Preference: A Survey

Paper • 2503.14504 • Published Mar 18, 2025 • 26