Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Open to Collab
4
1
Kai Ye
Kyleyee
Follow
Eehan's profile picture
ValentinaZangirolami's profile picture
lucky-tomato1122's profile picture
8 followers
·
10 following
https://noncollapse.github.io/
AI & ML interests
None yet
Recent Activity
authored
a paper
10 days ago
Demystifying Group Relative Policy Optimization: Its Policy Gradient is a U-Statistic
upvoted
a
paper
13 days ago
Learn-to-Distance: Distance Learning for Detecting LLM-Generated Text
authored
a paper
about 1 month ago
Learn-to-Distance: Distance Learning for Detecting LLM-Generated Text
View all activity
Organizations
Kyleyee
's models
144
Sort: Recently updated
Kyleyee/Qwen-GRPO-1.5B-1024bs-MATH-G16
Updated
Dec 12, 2025
Kyleyee/Qwen-GRPO-1.5B-1024bs-MATH-G128
Updated
Dec 12, 2025
Kyleyee/Qwen-GRPO-7B-4096bs-MATH-G8
Updated
Dec 12, 2025
Kyleyee/Qwen-GRPO-7B-4096bs-MATH-G64
Updated
Dec 12, 2025
Kyleyee/Qwen-GRPO-7B-4096bs-MATH-G4
Updated
Dec 12, 2025
Kyleyee/Qwen-GRPO-7B-4096bs-MATH-G32
Updated
Dec 6, 2025
Kyleyee/Qwen-GRPO-7B-4096bs-MATH-G16
Updated
Dec 6, 2025
Kyleyee/Qwen-GRPO-7B-4096bs-MATH-G128
Updated
Dec 6, 2025
Kyleyee/Qwen-GRPO-7B-1024bs-old-1epoch-MATH-G8
Updated
Dec 4, 2025
Kyleyee/Qwen-GRPO-7B-1024bs-old-1epoch-MATH-G64
Updated
Dec 4, 2025
Kyleyee/Qwen-GRPO-7B-1024bs-old-1epoch-MATH-G4
Updated
Dec 4, 2025
Kyleyee/Qwen-GRPO-7B-1024bs-old-1epoch-MATH-G32
Updated
Dec 4, 2025
Kyleyee/Qwen-GRPO-7B-1024bs-old-1epoch-MATH-G2
Updated
Dec 4, 2025
Kyleyee/Qwen-GRPO-7B-1024bs-old-1epoch-MATH-G16
Updated
Dec 4, 2025
Kyleyee/Qwen-GRPO-7B-2048bs-MATH-G8
Updated
Dec 3, 2025
Kyleyee/Qwen-GRPO-7B-2048bs-MATH-G64
Updated
Dec 3, 2025
Kyleyee/Qwen-GRPO-7B-2048bs-MATH-G4
Updated
Dec 3, 2025
Kyleyee/Qwen-GRPO-7B-2048bs-MATH-G32
Updated
Dec 3, 2025
Kyleyee/Qwen-GRPO-7B-2048bs-MATH-G16
Updated
Dec 3, 2025
Kyleyee/Qwen-GRPO-7B-2048bs-MATH-G128
Updated
Dec 3, 2025
Kyleyee/Qwen-GRPO-7B-1024bs-MATH-G64
Updated
Dec 1, 2025
Kyleyee/Qwen-GRPO-7B-1024bs-MATH-G32
Updated
Dec 1, 2025
Kyleyee/Qwen-GRPO-7B-1024bs-MATH-G8
Updated
Dec 1, 2025
Kyleyee/Qwen-GRPO-7B-1024bs-MATH-G4
Updated
Dec 1, 2025
Kyleyee/Qwen-GRPO-7B-1024bs-MATH-G16
Updated
Nov 30, 2025
Kyleyee/Qwen-GRPO-7B-1024bs-MATH-G128
Updated
Nov 30, 2025
Kyleyee/qwen2_5-1.5b-grpo-riaid_v4
Text Generation
•
2B
•
Updated
Oct 21, 2025
Kyleyee/qwen2_5-1.5b-grpo-riaid_v3
Text Generation
•
2B
•
Updated
Oct 20, 2025
Kyleyee/qwen2_5-1.5b-grpo-riaid_v2
Text Generation
•
2B
•
Updated
Oct 19, 2025
Kyleyee/qwen2_5-1.5b-grpo-riaid_v1
Text Generation
•
2B
•
Updated
Oct 19, 2025
Previous
1
2
3
4
5
Next