Ming Zhang

konglongge

·

konglonggeFDU

AI & ML interests

LLMs

Recent Activity

upvoted a paper 3 days ago

The Verification Horizon: No Silver Bullet for Coding Agent Rewards

liked a dataset about 1 month ago

llmeval-fdu/LLMEval-Logic

upvoted a paper about 1 month ago

LLMEval-Logic: A Solver-Verified Chinese Benchmark for Logical Reasoning of LLMs with Adversarial Hardening

View all activity

Organizations

Papers 23

arxiv:2603.14473

arxiv:2602.12984

arxiv:2602.05890

arxiv:2602.03587

models 0

None public yet

datasets 3

konglongge/TaxoBench

Updated May 12 • 24

konglongge/TransferTOD

Preview • Updated May 12 • 16

konglongge/PFDial

Preview • Updated May 12 • 37