DocTron 's Collections

MSRL

Breaking the SFT Plateau: Multimodal Structured Reinforcement Learning for Chart-to-Code Generation