ILIA MIKHEEV

mikheevshow

·

AI & ML interests

None yet

Organizations

mikheevshow 's models 51

mikheevshow/DPO-reverse_kl_beta_5_0

Text Generation • 0.1B • Updated Mar 5, 2025 • 2

mikheevshow/DPO-reverse_kl_beta_1_0

Text Generation • 0.1B • Updated Mar 5, 2025 • 2

mikheevshow/DPO-reverse_kl_beta_0_05

Text Generation • 0.1B • Updated Mar 5, 2025 • 2

mikheevshow/DPO-reverse_kl_beta_0_1

Text Generation • 0.1B • Updated Mar 5, 2025 • 2

mikheevshow/DPO-js_divergence_beta_0_1

Text Generation • 0.1B • Updated Mar 5, 2025 • 2

mikheevshow/DPO-forward_kl_beta_0_1

Text Generation • 0.1B • Updated Mar 5, 2025 • 2

mikheevshow/DPO-alpha-divergence-alpha_0_5_beta_0_1

Text Generation • 0.1B • Updated Mar 5, 2025 • 2

mikheevshow/SFT-LOR-checkpoint-15532

Feature Extraction • 0.1B • Updated Mar 4, 2025 • 3

mikheevshow/SFT-LOR-checkpoint-12000

Feature Extraction • 0.1B • Updated Mar 4, 2025 • 3

mikheevshow/SFT-LOR-checkpoint-7500

Feature Extraction • 0.1B • Updated Mar 4, 2025 • 3

mikheevshow/SFT-LOR-checkpoint-3500

Feature Extraction • 0.1B • Updated Mar 4, 2025 • 3

mikheevshow/SFT-LOR-checkpoint-500

Feature Extraction • 0.1B • Updated Mar 4, 2025 • 3

mikheevshow/PRPO-checkpoint-15532

Feature Extraction • 0.1B • Updated Mar 4, 2025 • 3

mikheevshow/PRPO-checkpoint-12000

Feature Extraction • 0.1B • Updated Mar 4, 2025 • 3

mikheevshow/PRPO-checkpoint-7500

Feature Extraction • 0.1B • Updated Mar 4, 2025 • 3

mikheevshow/PRPO-checkpoint-4500

Feature Extraction • 0.1B • Updated Mar 4, 2025 • 5

mikheevshow/SFT-checkpoint-1000

Feature Extraction • 0.1B • Updated Mar 4, 2025 • 3

mikheevshow/ORPO-checkpoint-15532

Feature Extraction • 0.1B • Updated Mar 4, 2025 • 3

mikheevshow/ORPO-checkpoint-11649

Feature Extraction • 0.1B • Updated Mar 4, 2025 • 3

mikheevshow/yolo8-fashion-4-ft

Updated Oct 27, 2024

mikheevshow/warp-reward-model

Text Classification • 65.8M • Updated Aug 6, 2024 • 2