周 昊天
zhqi6m
AI & ML interests
None yet
Recent Activity
upvoted a paper about 5 hours ago
DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards liked a model about 12 hours ago
tencent/Hy-MT2-1.8BOrganizations
None yet