Learning to Reason in 13 Parameters
Paper
•
2602.04118
•
Published
•
5
List of research papers, architectures, and techniques I re implemented in LLM-quest or Hugging Face's TRL. Missing papers: Qwen3-Next, GPT-2