StoneShi
SSS
AI & ML interests
None yet
Recent Activity
upvoted a paper about 9 hours ago
Beyond SFT-to-RL: Pre-alignment via Black-Box On-Policy Distillation for Multimodal RL liked a model 6 days ago
nvidia/Lyra-2.0 liked a model 8 days ago
XiaomiMiMo/MiMo-V2.5-Pro