AI & ML interests
None yet
Organizations
None yet
MMattaparthy/ppo_test_final
2B • Updated • 1
2B • Updated • 1
MMattaparthy/reward_model_test_final
2B • Updated • 1
2B • Updated • 1
MMattaparthy/reward_model_test
2B • Updated • 1
MMattaparthy/Qlora-gguf-model
3B • Updated • 1
MMattaparthy/lora-gguf-model
3B • Updated • 3
3B • Updated • 2
3B • Updated • 1
MMattaparthy/ppo_model_final
Text Generation
• 2B • Updated • 6
MMattaparthy/sft_reward_model_final
Text Classification
• 2B • Updated • 1
MMattaparthy/sft_rewardmodel_final
Text Classification
• 2B • Updated • 2
MMattaparthy/sft_finetined_final
Text Generation
• 2B • Updated • 4
2B • Updated • 2
MMattaparthy/qwen_reward_model_updated_sft
0.5B • Updated • 2
MMattaparthy/reward_model_qwen_65
0.5B • Updated • 1
MMattaparthy/reward_model_qwen_100
0.5B • Updated • 1
MMattaparthy/sft-qwen-chat-jsonl
Text Generation
• 2B • Updated • 1