Datasets, and model checkpoints of our Group Relative Reward Model (GRRM) framework
Sen Yang PRO
double7
AI & ML interests
None yet
Recent Activity
updated
a dataset about 15 hours ago
double7/TowerBlocks-MT-CoT-ZhEn updated
a dataset about 15 hours ago
double7/MT_Ranking_Metric_Test updated
a dataset about 15 hours ago
double7/TowerBlocks-MT-Ranking Organizations
None yet