rlvr-weak-supervision Collection Models from "When Can LLMs Learn to Reason with Weak Supervision?" — Llama-3.2-3B with continual pre-training and Thinking SFT. • 3 items • Updated 18 days ago • 2
rlvr-weak-supervision Collection Models from "When Can LLMs Learn to Reason with Weak Supervision?" — Llama-3.2-3B with continual pre-training and Thinking SFT. • 3 items • Updated 18 days ago • 2
CoDaS: AI Co-Data-Scientist for Biomarker Discovery via Wearable Sensors Paper • 2604.14615 • Published 23 days ago • 7