arxiv:2601.21590
xiaotong
xtongji
AI & ML interests
None yet
Recent Activity
authored
a paper
about 10 hours ago
Bourbaki: Self-Generated and Goal-Conditioned MDPs for Theorem Proving
authored
a paper
about 10 hours ago
Rethinking Large Language Model Distillation: A Constrained Markov
Decision Process Perspective
authored
a paper
about 10 hours ago
Scalable Power Sampling: Unlocking Efficient, Training-Free Reasoning for LLMs via Distribution Sharpening
Organizations
None yet