Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning Paper • 2512.15687 • Published 9 days ago • 17
MotionEdit: Benchmarking and Learning Motion-Centric Image Editing Paper • 2512.10284 • Published 15 days ago • 25
lhl616/Qwen3-8B-Base-axon-error-aware-128-8-dense-nstd-0.5-0.8-start-relu 8B • Updated 27 days ago • 18
lhl616/Qwen3-8B-Base-axon-error-aware-128-8-dense-nstd-0.5-0.8-start-relu 8B • Updated 27 days ago • 18