mini-swe-agent-plus Collection A collection of mini-swe-agent-plus and corresponding rollout traces that drive Qwen3-8B to a 39% solve rate on SWE-bench Verified. Enjoy! • 2 items • Updated Nov 12
Modality Curation: Building Universal Embeddings for Advanced Multimodal Information Retrieval Paper • 2505.19650 • Published May 26 • 5
Modality Curation: Building Universal Embeddings for Advanced Multimodal Information Retrieval Paper • 2505.19650 • Published May 26 • 5
view article Article OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models Jul 18 • 50
RLEP: Reinforcement Learning with Experience Replay for LLM Reasoning Paper • 2507.07451 • Published Jul 10 • 5
RLEP: Reinforcement Learning with Experience Replay for LLM Reasoning Paper • 2507.07451 • Published Jul 10 • 5 • 1