Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Anna
ajjyy
Follow
maharshpatelx's profile picture
1 follower
·
1 following
AI & ML interests
None yet
Recent Activity
updated
a model
6 days ago
ajjyy/school_of_reward_hacks_control_sft
published
a model
6 days ago
ajjyy/school_of_reward_hacks_control_sft
updated
a model
18 days ago
ajjyy/school_of_reward_hacks_sft
View all activity
Organizations
ajjyy
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
updated
a model
6 days ago
ajjyy/school_of_reward_hacks_control_sft
Updated
6 days ago
published
a model
6 days ago
ajjyy/school_of_reward_hacks_control_sft
Updated
6 days ago
updated
a model
18 days ago
ajjyy/school_of_reward_hacks_sft
Updated
18 days ago
published
a model
18 days ago
ajjyy/school_of_reward_hacks_sft
Updated
18 days ago
updated
2 models
9 months ago
ajjyy/Qwen2-0.5B-GRPO-Curiosity-attempt2-cp2380
Text Generation
•
0.5B
•
Updated
Aug 13, 2025
•
6
ajjyy/Qwen2-0.5B-PPO-gsm8k-attempt5
Updated
Aug 13, 2025
published
a model
9 months ago
ajjyy/Qwen2-0.5B-GRPO-Curiosity-attempt2-cp2380
Text Generation
•
0.5B
•
Updated
Aug 13, 2025
•
6
updated
a model
9 months ago
ajjyy/Qwen2-0.5B-PPO-Curiosity-gsm8k-attempt4
Updated
Aug 13, 2025
published
a model
9 months ago
ajjyy/Qwen2-0.5B-PPO-gsm8k-attempt5
Updated
Aug 13, 2025
updated
a model
9 months ago
ajjyy/Qwen2-0.5B-GRPO-Curiosity-attempt2
Text Generation
•
0.5B
•
Updated
Aug 13, 2025
•
4
published
2 models
9 months ago
ajjyy/Qwen2-0.5B-PPO-Curiosity-gsm8k-attempt4
Updated
Aug 13, 2025
ajjyy/Qwen2-0.5B-GRPO-Curiosity-attempt2
Text Generation
•
0.5B
•
Updated
Aug 13, 2025
•
4
updated
2 models
9 months ago
ajjyy/Qwen2-0.5B-GRPO-Curiosity-attempt1-cp2330
Text Generation
•
0.5B
•
Updated
Aug 11, 2025
•
3
ajjyy/Qwen2-0.5B-GRPO-attempt1-cp3180
Text Generation
•
0.5B
•
Updated
Aug 11, 2025
•
3
published
2 models
9 months ago
ajjyy/Qwen2-0.5B-GRPO-Curiosity-attempt1-cp2330
Text Generation
•
0.5B
•
Updated
Aug 11, 2025
•
3
ajjyy/Qwen2-0.5B-GRPO-attempt1-cp3180
Text Generation
•
0.5B
•
Updated
Aug 11, 2025
•
3
updated
2 models
9 months ago
ajjyy/Qwen2-0.5B-GRPO-Curiosity-attempt1
Text Generation
•
0.5B
•
Updated
Aug 11, 2025
•
4
ajjyy/Qwen2-0.5B-GRPO-attempt1
Text Generation
•
0.5B
•
Updated
Aug 11, 2025
•
8
published
2 models
9 months ago
ajjyy/Qwen2-0.5B-GRPO-Curiosity-attempt1
Text Generation
•
0.5B
•
Updated
Aug 11, 2025
•
4
ajjyy/Qwen2-0.5B-GRPO-attempt1
Text Generation
•
0.5B
•
Updated
Aug 11, 2025
•
8
Load more