Ziheng Li
ChillingDream
ยท
AI & ML interests
Natural Language Processing
Recent Activity
authored a paper about 6 hours ago
To Mix or To Merge: Toward Multi-Domain Reinforcement Learning for Large Language Models authored a paper about 6 hours ago
LiveClawBench: Benchmarking LLM Agents on Complex, Real-World Assistant Tasks authored a paper about 6 hours ago
Trust Region On-Policy DistillationOrganizations
None yet