Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
16
4
zuijiang
zuijiang
Follow
BlackflashJKL's profile picture
xinyan233333's profile picture
Diluner's profile picture
3 followers
·
4 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 1 hour ago
CoBA-RL: Capability-Oriented Budget Allocation for Reinforcement Learning in LLMs
upvoted
a
paper
about 16 hours ago
Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning
upvoted
a
paper
1 day ago
Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text
View all activity
Organizations
zuijiang
's models
1
Sort: Recently updated
zuijiang/llava-qwen1.5-14B-chat
Text Generation
•
15B
•
Updated
Jul 1, 2024