Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
samhitha harish
samhitha2601
Follow
samhitha harish
AI & ML interests
computer vision,mistralai,rag
Organizations
None yet
samhitha2601
's models
58
Sort: Recently updated
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-step32
Text Generation
•
3B
•
Updated
Oct 21
•
6
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-2-step142
Text Generation
•
3B
•
Updated
Oct 21
•
7
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-step30
Text Generation
•
3B
•
Updated
Oct 21
•
4
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-2-step122
Text Generation
•
3B
•
Updated
Oct 21
•
7
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-step28
Text Generation
•
3B
•
Updated
Oct 21
•
8
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-2-step102
Text Generation
•
3B
•
Updated
Oct 21
•
6
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-step26
Text Generation
•
3B
•
Updated
Oct 21
•
6
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-2-step82
Text Generation
•
3B
•
Updated
Oct 21
•
5
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-step24
Text Generation
•
3B
•
Updated
Oct 21
•
8
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-step22
Text Generation
•
3B
•
Updated
Oct 21
•
9
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-2-step62
Text Generation
•
3B
•
Updated
Oct 21
•
6
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-step20
Text Generation
•
3B
•
Updated
Oct 21
•
6
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-2-step42
Text Generation
•
3B
•
Updated
Oct 21
•
8
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-step18
Text Generation
•
3B
•
Updated
Oct 21
•
8
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-2-step22
Text Generation
•
3B
•
Updated
Oct 21
•
7
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-step16
Text Generation
•
3B
•
Updated
Oct 21
•
6
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-2-step2
Text Generation
•
3B
•
Updated
Oct 21
•
8
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-step14
Text Generation
•
3B
•
Updated
Oct 21
•
5
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-step4
Text Generation
•
3B
•
Updated
Oct 21
•
7
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-step2
Text Generation
•
3B
•
Updated
Oct 21
•
8
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-step10
Text Generation
•
3B
•
Updated
Oct 21
•
5
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-step8
Text Generation
•
3B
•
Updated
Oct 21
•
6
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-step6
Text Generation
•
3B
•
Updated
Oct 21
•
7
samhitha2601/llama3.2-3b-gsm8k-ppo-fsdp
Updated
Oct 20
samhitha2601/llama3.2-3b-rl-all-checkpoints1
Updated
Oct 17
samhitha2601/llama3.2-3b-rl-all-checkpoints
Updated
Oct 17
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl
Text Generation
•
3B
•
Updated
Oct 17
•
9
samhitha2601/llama-3.2-3b-gsm8k-ppo
Text Generation
•
3B
•
Updated
Oct 12
•
8
Previous
1
2
Next