Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
16
4
zuijiang
zuijiang
Follow
Diluner's profile picture
xinyan233333's profile picture
BlackflashJKL's profile picture
3 followers
·
4 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 1 hour ago
CoBA-RL: Capability-Oriented Budget Allocation for Reinforcement Learning in LLMs
upvoted
a
paper
about 16 hours ago
Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning
upvoted
a
paper
1 day ago
Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text
View all activity
Organizations
zuijiang
's datasets
3
Sort: Recently updated
zuijiang/alpaca-alpaca-clean
Viewer
•
Updated
Aug 26, 2024
•
51.8k
•
3
zuijiang/mistral-alpaca-clean
Viewer
•
Updated
Aug 25, 2024
•
51.8k
•
54
zuijiang/ocr_vqa
Viewer
•
Updated
May 30, 2024
•
208k
•
38