Yuansheng Ni

yuanshengni

25 60 30

https://yuanshengni.github.io/

AI & ML interests

NLP

Recent Activity

authored a paper about 21 hours ago

SWE-QA-Pro: A Representative Benchmark and Scalable Training Recipe for Repository-Level Code Understanding

authored a paper about 21 hours ago

Towards Retrieving Interaction Spaces for Agentic Search

updated a dataset 1 day ago

MMMU/MMMU

View all activity

Organizations

upvoted a paper 2 months ago

Beyond Semantic Similarity: Rethinking Retrieval for Agentic Search via Direct Corpus Interaction

Paper • 2605.05242 • Published May 3 • 126

upvoted 3 papers 3 months ago

upvoted a paper 4 months ago

CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents

Paper • 2603.24440 • Published Mar 25 • 99

upvoted a collection 4 months ago

OpenResearcher

Collection

OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis • 8 items • Updated Mar 24 • 18

upvoted a paper 4 months ago

OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis

Paper • 2603.20278 • Published Mar 17 • 101

upvoted 3 papers 5 months ago

VisPhyWorld: Probing Physical Reasoning via Code-Driven Video Reconstruction

Paper • 2602.13294 • Published Feb 9 • 13

InnoEval: On Research Idea Evaluation as a Knowledge-Grounded, Multi-Perspective Reasoning Problem

Paper • 2602.14367 • Published Feb 16 • 17

Context Forcing: Consistent Autoregressive Video Generation with Long Context

Paper • 2602.06028 • Published Feb 5 • 36

upvoted a paper 6 months ago

Illusions of Confidence? Diagnosing LLM Truthfulness via Neighborhood Consistency

Paper • 2601.05905 • Published Jan 9 • 21

upvoted 2 papers 7 months ago

TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models

Paper • 2512.02014 • Published Dec 1, 2025 • 78

InnoGym: Benchmarking the Innovation Potential of AI Agents

Paper • 2512.01822 • Published Dec 1, 2025 • 36

upvoted a paper 9 months ago

VisCoder2: Building Multi-Language Visualization Coding Agents

Paper • 2510.23642 • Published Oct 24, 2025 • 22

upvoted a collection 9 months ago

VisCoder2

Collection

Building Multi-Language Visualization Coding Agents • 7 items • Updated Oct 29, 2025 • 4

upvoted 5 papers 9 months ago

LightMem: Lightweight and Efficient Memory-Augmented Generation

Paper • 2510.18866 • Published Oct 21, 2025 • 116

BrowserAgent: Building Web Agents with Human-Inspired Web Browsing Actions

Paper • 2510.10666 • Published Oct 12, 2025 • 29

UniVideo: Unified Understanding, Generation, and Editing for Videos

Paper • 2510.08377 • Published Oct 9, 2025 • 81

MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Policy Optimization

Paper • 2510.08540 • Published Oct 9, 2025 • 110

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9, 2025 • 276

Yuansheng Ni

AI & ML interests

Recent Activity

Organizations

yuanshengni's activity