Saved - a morginalium Collection

morginalium 's Collections

Saved

updated May 19

I found this article on HF and saved it to read.

Distilling Long-CoT Reasoning through Collaborative Step-wise Multi-Teacher Decoding

Paper • 2605.02290 • Published May 4 • 42
Listwise Policy Optimization: Group-based RLVR as Target-Projection on the LLM Response Simplex

Paper • 2605.06139 • Published May 7 • 69
ZAYA1-8B Technical Report

Paper • 2605.05365 • Published May 6 • 5
Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

Paper • 2511.06221 • Published Nov 9, 2025 • 141