Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
3
prathyusha guduru
Prathyusha101
Follow
AI & ML interests
None yet
Recent Activity
updated
a model
14 days ago
Prathyusha101/high_ent_critic_training_qwen7b_global_step_340
published
a model
14 days ago
Prathyusha101/high_ent_critic_training_qwen7b_global_step_340
updated
a model
14 days ago
Prathyusha101/low_ent_critic_training_qwen7b_global_step_340
View all activity
Organizations
None yet
Prathyusha101
's models
210
Sort: Recently updated
Prathyusha101/Qwen2.5-7B-low_lr_ppo_with_kl_global_step_400_ACTOR
8B
•
Updated
Oct 14, 2025
•
1
Prathyusha101/Qwen2.5-7B-low_lr_ppo_with_kl_global_step_300_CRITIC
8B
•
Updated
Oct 14, 2025
•
1
Prathyusha101/Qwen2.5-7B-low_lr_ppo_with_kl_global_step_300_ACTOR
8B
•
Updated
Oct 14, 2025
Prathyusha101/Qwen2.5-7B-low_lr_ppo_with_kl_global_step_200_CRITIC
8B
•
Updated
Oct 14, 2025
Prathyusha101/Qwen2.5-7B-low_lr_ppo_with_kl_global_step_200_ACTOR
8B
•
Updated
Oct 14, 2025
Prathyusha101/Qwen2.5-7B-low_lr_ppo_with_kl_global_step_100_CRITIC
8B
•
Updated
Oct 14, 2025
Prathyusha101/Qwen2.5-7B-low_lr_ppo_with_kl_global_step_100_ACTOR
8B
•
Updated
Oct 14, 2025
•
1
Prathyusha101/Qwen2.5-7B-PPO_WITHOUT_KL_global_step_400_CRITIC
8B
•
Updated
Oct 5, 2025
Prathyusha101/Qwen2.5-7B-PPO_WITHOUT_KL_global_step_400_ACTOR
8B
•
Updated
Oct 5, 2025
Prathyusha101/Qwen2.5-7B-PPO_WITHOUT_KL_global_step_300_CRITIC
8B
•
Updated
Oct 5, 2025
Prathyusha101/Qwen2.5-7B-PPO_WITHOUT_KL_global_step_300_ACTOR
8B
•
Updated
Oct 5, 2025
Prathyusha101/Qwen2.5-7B-PPO_WITHOUT_KL_global_step_200_CRITIC
8B
•
Updated
Oct 5, 2025
Prathyusha101/Qwen2.5-7B-PPO_WITHOUT_KL_global_step_200_ACTOR
8B
•
Updated
Oct 5, 2025
Prathyusha101/Qwen2.5-7B-PPO_WITHOUT_KL_global_step_100_CRITIC
8B
•
Updated
Oct 5, 2025
Prathyusha101/Qwen2.5-7B-PPO_WITHOUT_KL_global_step_100_ACTOR
8B
•
Updated
Oct 5, 2025
Prathyusha101/Qwen2.5-7B-PPO_WITH_KL_global_step_300_CRITIC
8B
•
Updated
Oct 5, 2025
Prathyusha101/Qwen2.5-7B-PPO_WITH_KL_global_step_300_ACTOR
8B
•
Updated
Oct 5, 2025
Prathyusha101/Qwen2.5-7B-PPO_WITH_KL_global_step_200_CRITIC
8B
•
Updated
Oct 5, 2025
Prathyusha101/Qwen2.5-7B-PPO_WITH_KL_global_step_200_ACTOR
8B
•
Updated
Oct 5, 2025
Prathyusha101/Qwen2.5-7B-PPO_WITH_KL_global_step_100_CRITIC
8B
•
Updated
Oct 5, 2025
Prathyusha101/Qwen2.5-7B-PPO_WITH_KL_global_step_100_ACTOR
8B
•
Updated
Oct 5, 2025
Prathyusha101/Qwen2.5-1.5B-RLOO_KL_0.1
Updated
Sep 18, 2025
Prathyusha101/Qwen2.5-0.5B-RLOO_WITH_KL_global_step_400_ACTOR
0.6B
•
Updated
Sep 18, 2025
Prathyusha101/Qwen2.5-0.5B-RLOO_WITH_KL_global_step_300_ACTOR
0.6B
•
Updated
Sep 18, 2025
Prathyusha101/Qwen2.5-0.5B-RLOO_WITH_KL_global_step_200_ACTOR
0.6B
•
Updated
Sep 18, 2025
Prathyusha101/Qwen2.5-0.5B-RLOO_WITH_KL_global_step_100_ACTOR
0.6B
•
Updated
Sep 18, 2025
Prathyusha101/Qwen2.5-0.5B-GRPO_WITH_KL_global_step_400_ACTOR
0.6B
•
Updated
Sep 18, 2025
Prathyusha101/Qwen2.5-0.5B-GRPO_WITH_KL_global_step_300_ACTOR
0.6B
•
Updated
Sep 18, 2025
Prathyusha101/Qwen2.5-0.5B-GRPO_WITH_KL_global_step_200_ACTOR
0.6B
•
Updated
Sep 18, 2025
Prathyusha101/Qwen2.5-0.5B-GRPO_WITH_KL_global_step_100_ACTOR
0.6B
•
Updated
Sep 18, 2025
Previous
1
2
3
4
...
7
Next