arxiv:2603.14473
Ming Zhang
konglongge
ยท
AI & ML interests
LLMs
Recent Activity
authored a paper about 6 hours ago
DFPO: Scaling Value Modeling via Distributional Flow towards Robust and Generalizable LLM Post-Training authored a paper about 6 hours ago
SciAgentGym: Benchmarking Multi-Step Scientific Tool-use in LLM Agents authored a paper about 6 hours ago
AI Can Learn Scientific Taste