liamoon-ai-team/unsloth-llama-3.3-70b-4bit-packing-grpo_with_dpo_mt_and_foundational_stages_20250912
Updated
liamoon-ai-team/unsloth-llama-3.3-70b-4bit-packing-dpo_multi_turn_with_foundation_stage_20250910
Updated
liamoon-ai-team/unsloth-llama-3.3-70b-4bit-packing-dpo-single-turn-with-sft-psychology-and-chats-20250903
Updated
liamoon-ai-team/unsloth-llama-3.3-70b-packing-sft-books-and-chats-20250829-v2
Updated
liamoon-ai-team/unsloth-llama-3.3-70b-packing-sft-books-and-chats-20250829
Updated
liamoon-ai-team/unsloth-llama-3.3-70b-packing-sft-psychology-20250826_v4
Updated
liamoon-ai-team/unsloth-llama-3.3-70b-packing-sft-psychology-20250826_v3
Updated
liamoon-ai-team/unsloth-llama-3.3-70b-packing-sft-psychology-20250826_v2
Updated
liamoon-ai-team/unsloth-llama-3.3-70b-packing-sft-psychology-20250826
Updated
liamoon-ai-team/Qwen2.5-14B-llm-judge-20250820
15B
•
Updated
liamoon-ai-team/unsloth-llama-3.3-70b-4bit-dpo-grpo-august18-v6
Updated
liamoon-ai-team/unsloth-llama-3.3-70b-4bit-grpo-only-august17-v1
Updated
liamoon-ai-team/unsloth-llama-3.3-70b-4bit-dpo-grpo-august11
Updated
liamoon-ai-team/ray-serve-prod
Updated
liamoon-ai-team/unsloth-llama-3.3-70b-4bit-packing-dpo-july17-v3
Updated
liamoon-ai-team/unsloth-llama-3.3-70b-4bit-packing-dpo-july17-v2
Updated
liamoon-ai-team/16-07-2025-stage-4
Text Generation
•
71B
•
Updated
•
1
liamoon-ai-team/Llama-70b-4bit-bantutan-quantized-bold-instruct-packing-multiturn-3epochs-fulldataset-2025-07-07
Updated
liamoon-ai-team/7_phases_dataset
Updated