Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
3
prathyusha guduru
Prathyusha101
Follow
AI & ML interests
None yet
Recent Activity
updated
a model
13 days ago
Prathyusha101/high_ent_critic_training_qwen7b_global_step_340
published
a model
13 days ago
Prathyusha101/high_ent_critic_training_qwen7b_global_step_340
updated
a model
13 days ago
Prathyusha101/low_ent_critic_training_qwen7b_global_step_340
View all activity
Organizations
None yet
Prathyusha101
's models
210
Sort: Recently updated
Prathyusha101/Qwen2.5-0.5B-PPO_WITHOUT_KL_global_step_200_ACTOR
0.6B
•
Updated
Sep 17, 2025
Prathyusha101/Qwen2.5-0.5B-PPO_WITHOUT_KL_global_step_100_CRITIC
0.5B
•
Updated
Sep 17, 2025
Prathyusha101/Qwen2.5-0.5B-PPO_WITHOUT_KL_global_step_100_ACTOR
0.6B
•
Updated
Sep 17, 2025
Prathyusha101/Qwen2.5-0.5B-PPO_WITH_KL_global_step_585_CRITIC
0.5B
•
Updated
Sep 17, 2025
Prathyusha101/Qwen2.5-0.5B-PPO_WITH_KL_global_step_585_ACTOR
0.6B
•
Updated
Sep 17, 2025
Prathyusha101/Qwen2.5-0.5B-PPO_WITH_KL_global_step_500_CRITIC
0.5B
•
Updated
Sep 17, 2025
Prathyusha101/Qwen2.5-0.5B-PPO_WITH_KL_global_step_500_ACTOR
0.6B
•
Updated
Sep 17, 2025
Prathyusha101/Qwen2.5-0.5B-PPO_WITH_KL_global_step_400_CRITIC
0.5B
•
Updated
Sep 17, 2025
Prathyusha101/Qwen2.5-0.5B-PPO_WITH_KL_global_step_400_ACTOR
0.6B
•
Updated
Sep 17, 2025
Prathyusha101/Qwen2.5-0.5B-PPO_WITH_KL_global_step_300_CRITIC
0.5B
•
Updated
Sep 17, 2025
Prathyusha101/Qwen2.5-0.5B-PPO_WITH_KL_global_step_300_ACTOR
0.6B
•
Updated
Sep 17, 2025
Prathyusha101/Qwen2.5-0.5B-PPO_WITH_KL_global_step_200_CRITIC
0.5B
•
Updated
Sep 17, 2025
Prathyusha101/Qwen2.5-0.5B-PPO_WITH_KL_global_step_200_ACTOR
0.6B
•
Updated
Sep 17, 2025
Prathyusha101/Qwen2.5-0.5B-PPO_WITH_KL_global_step_100_CRITIC
0.5B
•
Updated
Sep 17, 2025
Prathyusha101/Qwen2.5-0.5B-PPO_WITH_KL_global_step_100_ACTOR
0.6B
•
Updated
Sep 17, 2025
Prathyusha101/Qwen2.5-1.5B-RLOO_WITH_KL_global_step_500_ACTOR
2B
•
Updated
Sep 15, 2025
Prathyusha101/Qwen2.5-1.5B-GRPO_WITHOUT_KL_global_step_600_ACTOR
2B
•
Updated
Sep 13, 2025
•
1
Prathyusha101/Qwen2.5-1.5B-GRPO_WITH_KL_global_step_600_ACTOR
2B
•
Updated
Sep 13, 2025
•
1
Prathyusha101/Qwen2.5-1.5B-GRPO_WITHOUT_KL_global_step_500_ACTOR
2B
•
Updated
Sep 13, 2025
•
1
Prathyusha101/Qwen2.5-1.5B-GRPO_WITH_KL_global_step_500_ACTOR
2B
•
Updated
Sep 13, 2025
•
1
Prathyusha101/Qwen2.5-1.5B-PPO-with-KL_global_step_400_CRITIC
2B
•
Updated
Sep 13, 2025
Prathyusha101/Qwen2.5-1.5B-PPO-with-KL_global_step_400_ACTOR
2B
•
Updated
Sep 13, 2025
Prathyusha101/Qwen2.5-1.5B-PPO-with-KL_global_step_300_CRITIC
2B
•
Updated
Sep 13, 2025
Prathyusha101/Qwen2.5-1.5B-PPO-with-KL_global_step_300_ACTOR
2B
•
Updated
Sep 13, 2025
Prathyusha101/Qwen2.5-1.5B-PPO-with-KL_global_step_200_CRITIC
2B
•
Updated
Sep 13, 2025
Prathyusha101/Qwen2.5-1.5B-PPO-with-KL_global_step_200_ACTOR
2B
•
Updated
Sep 13, 2025
Prathyusha101/Qwen2.5-1.5B-PPO-with-KL_global_step_100_CRITIC
2B
•
Updated
Sep 13, 2025
Prathyusha101/Qwen2.5-1.5B-PPO-with-KL_global_step_100_ACTOR
2B
•
Updated
Sep 13, 2025
Prathyusha101/Qwen2.5-1.5B-PPO_global_step_900_CRITIC
2B
•
Updated
Sep 12, 2025
Prathyusha101/Qwen2.5-1.5B-PPO_global_step_900_ACTOR
2B
•
Updated
Sep 12, 2025
•
1
Previous
1
2
3
4
5
6
7
Next