Stefan D's picture

Stefan D

stefandi

·

AI & ML interests

None yet

Organizations

stefandi 's models 19

stefandi/ultrafeedback_dpo_v2

Text Generation • 0.5B • Updated Jun 9, 2025 • 3

stefandi/cog_behavior_synthetic_sft_v2_step_850

Text Generation • 0.5B • Updated Jun 8, 2025 • 3

stefandi/cog_behavior_synthetic_sft_v1_step_580

Text Generation • 0.5B • Updated Jun 8, 2025 • 3

stefandi/smol_talk_sft_v2

Text Generation • 0.5B • Updated Jun 8, 2025 • 3

stefandi/countdown_rloo_v3

Text Generation • 0.5B • Updated Jun 2, 2025 • 3 •

stefandi/countdown_rloo_v2

Text Generation • 0.5B • Updated Jun 2, 2025 • 3

stefandi/ultrafeedback_dpo_v1

Text Generation • 0.5B • Updated May 29, 2025 • 2

stefandi/countdown_rloo_v1

Text Generation • 0.5B • Updated May 29, 2025 • 3

stefandi/cog_behavior_sft_v2

Text Generation • 0.5B • Updated May 29, 2025 • 5

stefandi/smol_talk_sft_v1

Text Generation • 0.5B • Updated May 29, 2025 • 2 •

stefandi/cog_behavior_sft_v1

Text Generation • 0.5B • Updated May 28, 2025 • 1 •

stefandi/ppo-LunarLander-v2

Reinforcement Learning • Updated Mar 27, 2025

stefandi/bert-base-uncased-finetuned-medqa-2

Multiple Choice • 0.1B • Updated May 7, 2024 • 6

stefandi/bert-base-uncased-finetuned-medqa-finetuned-medqa

Multiple Choice • 0.1B • Updated May 4, 2024 • 5

stefandi/bert-base-uncased-finetuned-swag

Multiple Choice • 0.1B • Updated May 4, 2024 • 5

stefandi/Medical-NER-finetuned-medqa

Updated May 4, 2024

stefandi/bert-base-uncased-finetuned-medqa

Multiple Choice • 0.1B • Updated May 4, 2024 • 6

stefandi/bert-finetuned-squad

Question Answering • 0.1B • Updated Apr 30, 2024 • 6

stefandi/test_qa_model

Question Answering • 66.4M • Updated Apr 30, 2024 • 3