·
AI & ML interests
None yet
Organizations
stefandi/ultrafeedback_dpo_v2
Text Generation
•
0.5B
•
Updated
•
1
stefandi/cog_behavior_synthetic_sft_v2_step_850
Text Generation
•
0.5B
•
Updated
•
1
stefandi/cog_behavior_synthetic_sft_v1_step_580
Text Generation
•
0.5B
•
Updated
•
1
stefandi/smol_talk_sft_v2
Text Generation
•
0.5B
•
Updated
•
1
stefandi/countdown_rloo_v3
Text Generation
•
0.5B
•
Updated
•
5
•
stefandi/countdown_rloo_v2
Text Generation
•
0.5B
•
Updated
•
1
stefandi/ultrafeedback_dpo_v1
Text Generation
•
0.5B
•
Updated
•
2
stefandi/countdown_rloo_v1
Text Generation
•
0.5B
•
Updated
•
3
stefandi/cog_behavior_sft_v2
Text Generation
•
0.5B
•
Updated
•
1
stefandi/smol_talk_sft_v1
Text Generation
•
0.5B
•
Updated
•
3
stefandi/cog_behavior_sft_v1
Text Generation
•
0.5B
•
Updated
•
4
•
stefandi/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
stefandi/bert-base-uncased-finetuned-medqa-2
Multiple Choice
•
0.1B
•
Updated
stefandi/bert-base-uncased-finetuned-medqa-finetuned-medqa
Multiple Choice
•
0.1B
•
Updated
•
1
stefandi/bert-base-uncased-finetuned-swag
Multiple Choice
•
0.1B
•
Updated
stefandi/Medical-NER-finetuned-medqa
Updated
stefandi/bert-base-uncased-finetuned-medqa
Multiple Choice
•
0.1B
•
Updated
stefandi/bert-finetuned-squad
Question Answering
•
0.1B
•
Updated
Question Answering
•
66.4M
•
Updated
•
1