Dongfu Jiang

DongfuJiang

·

https://jdf-prog.github.io/

AI & ML interests

Large Language Model, Modality Reasoning and their evaluation

Recent Activity

published a dataset 6 days ago

TIGER-Lab/GenAI-Arena-logs

upvoted a paper 13 days ago

Dr-DCI: Scaling Direct Corpus Interaction via Dynamic Workspace Expansion

liked a model 14 days ago

zai-org/GLM-5.2

View all activity

Organizations

upvoted a paper 13 days ago

Dr-DCI: Scaling Direct Corpus Interaction via Dynamic Workspace Expansion

Paper • 2606.14885 • Published 19 days ago • 11

upvoted an article 20 days ago

Article

Task-Seeded Synthetic Q&A Generation for Nemotron Pretraining

nvidia

•

27 days ago

• 17

upvoted a paper 25 days ago

Code2LoRA: Hypernetwork-Generated Adapters for Code Language Models under Software Evolution

Paper • 2606.06492 • Published 27 days ago • 95

upvoted 2 papers 26 days ago

Cosmos 3: Omnimodal World Models for Physical AI

Paper • 2606.02800 • Published about 1 month ago • 138

AutoLab: Can Frontier Models Solve Long-Horizon Auto Research and Engineering Tasks?

Paper • 2606.05080 • Published 28 days ago • 30

upvoted 3 papers about 2 months ago

WorldReasonBench: Human-Aligned Stress Testing of Video Generators as Future World-State Predictors

Paper • 2605.10434 • Published May 11 • 29

LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling

Paper • 2605.08083 • Published May 8 • 70

Beyond Semantic Similarity: Rethinking Retrieval for Agentic Search via Direct Corpus Interaction

Paper • 2605.05242 • Published May 3 • 126

upvoted 7 papers 3 months ago

Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Paper • 2604.12374 • Published Apr 14 • 37

FORGE:Fine-grained Multimodal Evaluation for Manufacturing Scenarios

Paper • 2604.07413 • Published Apr 8 • 97

ClawBench: Can AI Agents Complete Everyday Online Tasks?

Paper • 2604.08523 • Published Apr 9 • 265

Watch Before You Answer: Learning from Visually Grounded Post-Training

Paper • 2604.05117 • Published Apr 6 • 36

ImagenWorld: Stress-Testing Image Generation Models with Explainable Human Evaluation on Open-ended Real-World Tasks

Paper • 2603.27862 • Published Mar 29 • 33

OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis

Paper • 2603.20278 • Published Mar 17 • 101

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Paper • 2603.19220 • Published Mar 19 • 70

upvoted 2 papers 4 months ago

Attention Residuals

Paper • 2603.15031 • Published Mar 16 • 189

VisPhyWorld: Probing Physical Reasoning via Code-Driven Video Reconstruction

Paper • 2602.13294 • Published Feb 9 • 13

upvoted 3 papers 7 months ago

TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models

Paper • 2512.02014 • Published Dec 1, 2025 • 78

Guided Self-Evolving LLMs with Minimal Human Supervision

Paper • 2512.02472 • Published Dec 2, 2025 • 55

VisPlay: Self-Evolving Vision-Language Models from Images

Paper • 2511.15661 • Published Nov 19, 2025 • 45