RichardErkhov/GitBag_-_reasoning_rebel_rere_eta_1e4_lr_3e-7_1732288722-gguf 8B • Updated Jun 10, 2025 • 3
RichardErkhov/Jimmy19991222_-_llama-3-8b-instruct-gapo-v2-rouge2-beta10-1minus-gamma0.3-rerun-gguf 8B • Updated Jun 10, 2025 • 6
RichardErkhov/kaiwenw_-_nov22_lr_3e-6_lora_32_dropout_0.1_all_reject_first_ep_4-gguf 8B • Updated Jun 9, 2025 • 10
RichardErkhov/kaiwenw_-_nov22_lr_3e-6_lora_32_dropout_0.1_all_reject_first_ep_3-gguf 8B • Updated Jun 9, 2025 • 13
RichardErkhov/kaiwenw_-_nov22_lr_3e-6_lora_32_dropout_0.1_all_reject_first_ep_1-gguf 8B • Updated Jun 9, 2025 • 9
RichardErkhov/kaiwenw_-_nov22_lr_3e-6_lora_32_dropout_0.1_all_reject_first_ep_2-gguf 8B • Updated Jun 9, 2025 • 8
RichardErkhov/CompassioninMachineLearning_-_llama3.1_8b_tenK_unclean_pretrained-gguf 8B • Updated Jun 9, 2025 • 2.34k