From Raw Experience to Skill Consumption: A Systematic Study of Model-Generated Agent Skills Paper • 2605.23899 • Published 4 days ago • 24 • 1
Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges Paper • 2604.13602 • Published Apr 15 • 32
SkillOpt: Executive Strategy for Self-Evolving Agent Skills Paper • 2605.23904 • Published 4 days ago • 163
From Raw Experience to Skill Consumption: A Systematic Study of Model-Generated Agent Skills Paper • 2605.23899 • Published 4 days ago • 24
From Raw Experience to Skill Consumption: A Systematic Study of Model-Generated Agent Skills Paper • 2605.23899 • Published 4 days ago • 24
SkillOpt: Executive Strategy for Self-Evolving Agent Skills Paper • 2605.23904 • Published 4 days ago • 163
zisuh/cot-scale-checkpoint-990-2epoch-4lang-self-generated-13k-checkpoint-104 15B • Updated 13 days ago • 16
zisuh/cot-scale-checkpoint-990-2epoch-4lang-self-generated-13k-checkpoint-104 15B • Updated 13 days ago • 16
zisuh/cot-scale-checkpoint-990-2epoch-4lang-self-generated-13k-checkpoint-68 15B • Updated 13 days ago • 18
zisuh/cot-scale-checkpoint-990-2epoch-4lang-self-generated-13k-checkpoint-68 15B • Updated 13 days ago • 18
zisuh/cot-scale-checkpoint-990-2epoch-4lang-self-generated-13k-checkpoint-34 15B • Updated 13 days ago • 18
zisuh/cot-scale-checkpoint-990-2epoch-4lang-self-generated-13k-checkpoint-34 15B • Updated 13 days ago • 18
World-R1: Reinforcing 3D Constraints for Text-to-Video Generation Paper • 2604.24764 • Published 29 days ago • 118
Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges Paper • 2604.13602 • Published Apr 15 • 32
BizGenEval: A Systematic Benchmark for Commercial Visual Content Generation Paper • 2603.25732 • Published Mar 26 • 11
Learning Query-Specific Rubrics from Human Preferences for DeepResearch Report Generation Paper • 2602.03619 • Published Feb 3 • 28
BatCoder: Self-Supervised Bidirectional Code-Documentation Learning via Back-Translation Paper • 2602.02554 • Published Jan 30 • 8
BatCoder: Self-Supervised Bidirectional Code-Documentation Learning via Back-Translation Paper • 2602.02554 • Published Jan 30 • 8
Learning Query-Specific Rubrics from Human Preferences for DeepResearch Report Generation Paper • 2602.03619 • Published Feb 3 • 28