mikeogezi/data_wp_output_gpt_4o_mini_llama-3.2-3b-instruct_lora_16_sample_950_bsz_8 Updated Mar 29, 2025
mikeogezi/data_wp_output_gpt_4o_mini_llama-3.2-3b-instruct_lora_16_sample_500_bsz_8 Updated Mar 29, 2025
mikeogezi/data_wp_output_gpt_4o_mini_llama-3.3-70b-instruct-bnb-4bit_lora_16_sample_100 Updated Mar 29, 2025
mikeogezi/data_wp_output_gpt_4o_mini_llama-3.2-3b-instruct_lora_16_sample_100_bsz_8 Updated Mar 28, 2025
mikeogezi/Qwen2-VL-7B-GRPO-MMR-TrainedRationaleVerifier Image-to-Text • 8B • Updated Mar 28, 2025 • 9
mikeogezi/data_wp_output_gpt_4o_mini_llama-3.3-70b-instruct-bnb-4bit_lora_4_sample_100_siz_100 Updated Mar 27, 2025