8 34 2

Qingcheng Zeng

qcz

qcznlp

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Autodata: An agentic data scientist to create high quality synthetic data

upvoted a paper 4 days ago

PlanBench-XL: Evaluating Long-Horizon Planning of LLM Tool-Use Agents in Large-Scale Tool Ecosystems

upvoted a paper 10 days ago

Beyond Monolingual Deep Research: Evaluating Agents and Retrievers with Cross-Lingual BrowseComp-Plus

View all activity

Organizations

upvoted a paper 1 day ago

Autodata: An agentic data scientist to create high quality synthetic data

Paper • 2606.25996 • Published 3 days ago • 9

upvoted a paper 4 days ago

PlanBench-XL: Evaluating Long-Horizon Planning of LLM Tool-Use Agents in Large-Scale Tool Ecosystems

Paper • 2606.22388 • Published 6 days ago • 92

upvoted a paper 10 days ago

Beyond Monolingual Deep Research: Evaluating Agents and Retrievers with Cross-Lingual BrowseComp-Plus

Paper • 2606.15345 • Published 14 days ago • 16

upvoted a paper 22 days ago

Reinforcement Learning Elicits Contextual Learning of Unseen Language Translation

Paper • 2606.06428 • Published 23 days ago • 25

upvoted a paper about 1 month ago

ThoughtTrace: Understanding User Thoughts in Real-World LLM Interactions

Paper • 2605.20087 • Published May 19 • 18

upvoted 2 papers about 2 months ago

Beyond Semantic Similarity: Rethinking Retrieval for Agentic Search via Direct Corpus Interaction

Paper • 2605.05242 • Published May 3 • 126

CreativityBench: Evaluating Agent Creative Reasoning via Affordance-Based Tool Repurposing

Paper • 2605.02910 • Published May 6 • 23

upvoted 2 papers 2 months ago

Code-Switching Information Retrieval: Benchmarks, Analysis, and the Limits of Current Retrievers

Paper • 2604.17632 • Published Apr 19 • 12

Dual-View Training for Instruction-Following Information Retrieval

Paper • 2604.18845 • Published Apr 20 • 12

upvoted a collection 4 months ago

Qwen3.5

Collection

21 items • Updated Mar 9 • 1.69k

upvoted 2 papers 5 months ago

NAACL: Noise-AwAre Verbal Confidence Calibration for LLMs in RAG Systems

Paper • 2601.11004 • Published Jan 16 • 31

The Confidence Dichotomy: Analyzing and Mitigating Miscalibration in Tool-Use Agents

Paper • 2601.07264 • Published Jan 12 • 24

upvoted a paper 6 months ago

Can LLMs Estimate Student Struggles? Human-AI Difficulty Alignment with Proficiency Simulation for Item Difficulty Prediction

Paper • 2512.18880 • Published Dec 21, 2025 • 25

upvoted a paper 8 months ago

Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures

Paper • 2510.24081 • Published Oct 28, 2025 • 24

upvoted 4 papers 9 months ago

Good Intentions Beyond ACL: Who Does NLP for Social Good, and Where?

Paper • 2510.04434 • Published Oct 6, 2025 • 6

DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

Paper • 2509.25454 • Published Sep 29, 2025 • 147

Multiplayer Nash Preference Optimization

Paper • 2509.23102 • Published Sep 27, 2025 • 62

EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning

Paper • 2509.22576 • Published Sep 26, 2025 • 137

upvoted a paper 10 months ago

The Majority is not always right: RL training for solution aggregation

Paper • 2509.06870 • Published Sep 8, 2025 • 15

upvoted a paper 11 months ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7, 2025 • 190

Qingcheng Zeng

AI & ML interests

Recent Activity

Organizations

qcz's activity