AEM: Adaptive Entropy Modulation for Multi-Turn Agentic Reinforcement Learning Paper โข 2605.00425 โข Published 4 days ago โข 15
Qianfan-OCR: A Unified End-to-End Model for Document Intelligence Paper โข 2603.13398 โข Published Mar 11 โข 154
view article Article Qianfan-VL: A Milestone Achievement in Chinese Multimodal AI with Domestic Chips Sep 24, 2025 โข 9
Qianfan-VL Collection Qianfan-vl model series. The models are mainly domain enhanced vision language model, targeting enterprise level multi modal understanding scenarios. โข 5 items โข Updated Mar 18 โข 28
On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification Paper โข 2508.05629 โข Published Aug 7, 2025 โข 189