·
AI & ML interests
None yet
Organizations
stefandi/ultrafeedback_dpo_v2
Text Generation
• 0.5B • Updated • 1
stefandi/cog_behavior_synthetic_sft_v2_step_850
Text Generation
• 0.5B • Updated • 7
stefandi/cog_behavior_synthetic_sft_v1_step_580
Text Generation
• 0.5B • Updated • 1
stefandi/smol_talk_sft_v2
Text Generation
• 0.5B • Updated • 1
stefandi/countdown_rloo_v3
Text Generation
• 0.5B • Updated • 2
• stefandi/countdown_rloo_v2
Text Generation
• 0.5B • Updated • 1
stefandi/ultrafeedback_dpo_v1
Text Generation
• 0.5B • Updated • 2
stefandi/countdown_rloo_v1
Text Generation
• 0.5B • Updated • 1
stefandi/cog_behavior_sft_v2
Text Generation
• 0.5B • Updated • 1
stefandi/smol_talk_sft_v1
Text Generation
• 0.5B • Updated • 2
stefandi/cog_behavior_sft_v1
Text Generation
• 0.5B • Updated • 3
• stefandi/ppo-LunarLander-v2
Reinforcement Learning
• Updated • 2
stefandi/bert-base-uncased-finetuned-medqa-2
Multiple Choice
• 0.1B • Updated • 2
stefandi/bert-base-uncased-finetuned-medqa-finetuned-medqa
Multiple Choice
• 0.1B • Updated stefandi/bert-base-uncased-finetuned-swag
Multiple Choice
• 0.1B • Updated • 3
stefandi/Medical-NER-finetuned-medqa
Updated
stefandi/bert-base-uncased-finetuned-medqa
Multiple Choice
• 0.1B • Updated stefandi/bert-finetuned-squad
Question Answering
• 0.1B • Updated • 2
Question Answering
• 66.4M • Updated