·
AI & ML interests
None yet
Organizations
mikheevshow/DPO-reverse_kl_beta_5_0
Text Generation
• 0.1B • Updated mikheevshow/DPO-reverse_kl_beta_1_0
Text Generation
• 0.1B • Updated mikheevshow/DPO-reverse_kl_beta_0_05
Text Generation
• 0.1B • Updated mikheevshow/DPO-reverse_kl_beta_0_1
Text Generation
• 0.1B • Updated mikheevshow/DPO-js_divergence_beta_0_1
Text Generation
• 0.1B • Updated mikheevshow/DPO-forward_kl_beta_0_1
Text Generation
• 0.1B • Updated mikheevshow/DPO-alpha-divergence-alpha_0_5_beta_0_1
Text Generation
• 0.1B • Updated mikheevshow/SFT-LOR-checkpoint-15532
Feature Extraction
• 0.1B • Updated mikheevshow/SFT-LOR-checkpoint-12000
Feature Extraction
• 0.1B • Updated • 1
mikheevshow/SFT-LOR-checkpoint-7500
Feature Extraction
• 0.1B • Updated mikheevshow/SFT-LOR-checkpoint-3500
Feature Extraction
• 0.1B • Updated mikheevshow/SFT-LOR-checkpoint-500
Feature Extraction
• 0.1B • Updated • 1
mikheevshow/PRPO-checkpoint-15532
Feature Extraction
• 0.1B • Updated • 1
mikheevshow/PRPO-checkpoint-12000
Feature Extraction
• 0.1B • Updated • 1
mikheevshow/PRPO-checkpoint-7500
Feature Extraction
• 0.1B • Updated • 1
mikheevshow/PRPO-checkpoint-4500
Feature Extraction
• 0.1B • Updated mikheevshow/SFT-checkpoint-1000
Feature Extraction
• 0.1B • Updated • 1
mikheevshow/ORPO-checkpoint-15532
Feature Extraction
• 0.1B • Updated • 1
mikheevshow/ORPO-checkpoint-11649
Feature Extraction
• 0.1B • Updated • 1
mikheevshow/yolo8-fashion-4-ft
Updated
mikheevshow/warp-reward-model
Text Classification
• 65.8M • Updated