kuririrn/qwen3-4b-agent-trajectory-SFT_alfadm-prmcons_alformat2 Text Generation • 4B • Updated 15 minutes ago
kuririrn/qwen3-4b-agent-trajectory-SFT_alfadm-prmcons_alformat2 Text Generation • 4B • Updated 15 minutes ago
kuririrn/qwen3-4b-agent-trajectory_2stageSFT_alfadm_dbweek Text Generation • 4B • Updated about 19 hours ago
kuririrn/qwen3-4b-agent-trajectory_2stageSFT_alfadm_dbweek Text Generation • 4B • Updated about 19 hours ago
kuririrn/qwen25-7b-agent-trajectory_alf_admissible-lora-constraint_gen-dist_allign_base Text Generation • 8B • Updated about 20 hours ago
kuririrn/qwen25-7b-agent-trajectory_alf_admissible-lora-constraint_gen-dist_allign_base Text Generation • 8B • Updated about 20 hours ago
kuririrn/qwen25-7b-agent-trajectory_alf_admissible-lora-constraint_gen-dist_allign_a Text Generation • 8B • Updated 1 day ago
kuririrn/qwen25-7b-agent-trajectory_alf_admissible-lora-constraint_gen-dist_allign_b Text Generation • 8B • Updated 1 day ago
kuririrn/qwen25-7b-agent-trajectory_alf_admissible-lora-constraint_gen-dist_allign_a Text Generation • 8B • Updated 1 day ago
kuririrn/qwen25-7b-agent-trajectory_alf_admissible-lora-constraint_gen-dist_allign_b Text Generation • 8B • Updated 1 day ago
kuririrn/qwen25-7b-agent-trajectory_alf_admissible-lora-constraint_gen-dist_allign Text Generation • 8B • Updated 1 day ago
kuririrn/qwen25-7b-agent-trajectory_alf_admissible-lora-constraint_gen-dist_allign Text Generation • 8B • Updated 1 day ago
kuririrn/qwen3-4b-agent-trajectory_alf_admissible-lora-constraint_gen-dist_allign_c Text Generation • 4B • Updated 1 day ago
kuririrn/qwen3-4b-agent-trajectory_alf_admissible-lora-constraint_gen-dist_allign_c Text Generation • 4B • Updated 1 day ago
kuririrn/qwen3-4b-agent-trajectory_alf_admissible-lora-constraint_gen-dist_allign_b Text Generation • 4B • Updated 2 days ago
kuririrn/qwen3-4b-agent-trajectory_alf_admissible-lora-constraint_gen-dist_allign_b Text Generation • 4B • Updated 2 days ago
kuririrn/qwen3-4b-agent-trajectory_alf_admissible-lora-constraint_gen-dist_allign_a Text Generation • 4B • Updated 2 days ago
kuririrn/qwen3-4b-agent-trajectory_alf_admissible-lora-constraint_gen-dist_allign_a Text Generation • 4B • Updated 2 days ago
kuririrn/qwen3-4b-agent-trajectory_alfadm_dbweek-lora-constraint_gen-dist_allign_v3 Text Generation • 4B • Updated 2 days ago
kuririrn/qwen3-4b-agent-trajectory_alfadm_dbweek-lora-constraint_gen-dist_allign_v3 Text Generation • 4B • Updated 2 days ago