🔄 In a Training Loop

10 58

Sijia Cui

cuisijia

https://github.com/SijiaCui

AI & ML interests

None yet

Recent Activity

upvoted a paper 9 days ago

GD^2PO: Mitigating Multi-Reward Conflicts via Group-Dynamic reward-Decoupled Policy Optimization

liked a dataset 2 months ago

phiyodr/coco2017

liked a dataset 2 months ago

jonathan-roberts1/zerobench

View all activity

Organizations

upvoted a paper 9 days ago

GD^2PO: Mitigating Multi-Reward Conflicts via Group-Dynamic reward-Decoupled Policy Optimization

Paper • 2606.16771 • Published 10 days ago • 13

liked 3 datasets 2 months ago

liked 3 datasets 3 months ago

nlphuji/flickr30k

Viewer • Updated Jan 19, 2023 • 31k • 10.7k • 105

HuggingFaceM4/A-OKVQA

Viewer • Updated Feb 8, 2024 • 24.9k • 4.47k • 17

LMMs-Lab-Turtle/Vision-SR1-47K

Viewer • Updated Aug 22, 2025 • 47.6k • 1.95k • 6

authored a paper 3 months ago

CLIPO: Contrastive Learning in Policy Optimization Generalizes RLVR

Paper • 2603.10101 • Published Mar 10 • 6

upvoted a collection 3 months ago

Qwen3-VL

Collection

37 items • Updated Dec 31, 2025 • 746

liked 2 datasets 4 months ago

rafaelpadilla/coco2017

Viewer • Updated Aug 11, 2023 • 123k • 3.2k • 31

takara-ai/image_captions

Viewer • Updated Feb 11, 2025 • 1.07M • 841 • 24

upvoted an article 4 months ago

Article

arXiv实用技巧，如何让你的paper关注度变高？

JessyTsu1

•

Jul 8, 2024

• 20

liked a model 4 months ago

allenai/Olmo-3-1025-7B

Text Generation • 7B • Updated Apr 21 • 86k • 71

liked 4 datasets 6 months ago

DigitalLearningGmbH/MATH-lighteval

Viewer • Updated Jan 15, 2025 • 25k • 26.4k • 65

edinburgh-dawg/mmlu-redux-2.0

Viewer • Updated Feb 25, 2025 • 5.7k • 14.8k • 37

cais/mmlu

Viewer • Updated Mar 8, 2024 • 231k • 453k • 775

livecodebench/code_generation_lite

Updated Jun 5, 2025 • 64k • 93

liked a model 6 months ago

nvidia/DLER-R1-1.5B-Research

2B • Updated Oct 25, 2025 • 123 • 19

liked a model 7 months ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B

Text Generation • 2B • Updated Feb 24, 2025 • 608k • 1.53k

liked a dataset 7 months ago

agentica-org/DeepScaleR-Preview-Dataset

Viewer • Updated Feb 10, 2025 • 40.3k • 20.3k • 200

Sijia Cui

AI & ML interests

Recent Activity

Organizations

cuisijia's activity

arXiv实用技巧，如何让你的paper关注度变高？