Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
samhitha harish
samhitha2601
Follow
samhitha harish
AI & ML interests
computer vision,mistralai,rag
Organizations
None yet
samhitha2601
's models
58
Sort: Recently updated
samhitha2601/llama3-gsm8k-critic
3B
•
Updated
Oct 24
•
4
samhitha2601/llama3.2-3b-ppo-critic
Reinforcement Learning
•
Updated
Oct 23
•
6
samhitha2601/llama3.2-3b-ppo
Reinforcement Learning
•
Updated
Oct 23
•
4
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-step12
3B
•
Updated
Oct 21
•
22
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-2-step462
Text Generation
•
3B
•
Updated
Oct 21
•
8
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-2-step442
Text Generation
•
3B
•
Updated
Oct 21
•
7
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-2-step422
Text Generation
•
3B
•
Updated
Oct 21
•
6
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-2-step402
Text Generation
•
3B
•
Updated
Oct 21
•
6
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-2-step382
Text Generation
•
3B
•
Updated
Oct 21
•
8
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-2-step362
Text Generation
•
3B
•
Updated
Oct 21
•
4
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-step52
Updated
Oct 21
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-2-step342
Text Generation
•
3B
•
Updated
Oct 21
•
8
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-step50
Text Generation
•
3B
•
Updated
Oct 21
•
8
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-step48
Text Generation
•
3B
•
Updated
Oct 21
•
6
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-2-step322
Text Generation
•
3B
•
Updated
Oct 21
•
8
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-step46
Text Generation
•
3B
•
Updated
Oct 21
•
6
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-2-step302
Text Generation
•
3B
•
Updated
Oct 21
•
8
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-step44
Text Generation
•
3B
•
Updated
Oct 21
•
5
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-2-step282
Text Generation
•
3B
•
Updated
Oct 21
•
7
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-2-step262
Text Generation
•
3B
•
Updated
Oct 21
•
6
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-step42
Text Generation
•
3B
•
Updated
Oct 21
•
7
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-step40
Text Generation
•
3B
•
Updated
Oct 21
•
8
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-2-step242
Text Generation
•
3B
•
Updated
Oct 21
•
6
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-2-step222
Text Generation
•
3B
•
Updated
Oct 21
•
6
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-step38
Text Generation
•
3B
•
Updated
Oct 21
•
5
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-2-step202
Text Generation
•
3B
•
Updated
Oct 21
•
5
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-step36
Text Generation
•
3B
•
Updated
Oct 21
•
7
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-2-step182
Text Generation
•
3B
•
Updated
Oct 21
•
7
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-step34
Text Generation
•
3B
•
Updated
Oct 21
•
5
samhitha2601/llama-3.2-3b-gsm8k-ppo-verl-2-step162
Text Generation
•
3B
•
Updated
Oct 21
•
6
Previous
1
2
Next