trl-internal-testing/tiny-DeepseekV3ForCausalLM Text Generation • 5.52M • Updated Dec 19, 2025 • 162k • 3
SkillOpt: Executive Strategy for Self-Evolving Agent Skills Paper • 2605.23904 • Published 21 days ago • 227
Toward Generalist Autonomous Research via Hypothesis-Tree Refinement Paper • 2606.11926 • Published 2 days ago • 84