Add assemble_corpus.py with combined text+audio multimodal support for Unsloth" 750e870 verified s23deepak commited on 20 days ago
Fix: Remove TeleAntiFraud from text pipeline β it's audio-only (Phase 2). Add clear Phase 1/2 notes." 633ce01 verified s23deepak commited on 20 days ago
Add assemble_corpus.py β merges 3 scam datasets into unified training corpus a3a75f6 verified s23deepak commited on 20 days ago
Fix: Unsloth requires formatting_func β use pre-tokenized text column instead of messages" c2b772b verified s23deepak commited on 22 days ago
Fix: dtype must be None (not string 'auto'). Unsloth only accepts None/torch.float16/torch.bfloat16" 6a0af86 verified s23deepak commited on 22 days ago
Fix: dtype must be None or torch.dtype, not string. Kaggle T4 supports fp16 auto-detected. 8ee26d5 verified s23deepak commited on 22 days ago
Add 8GB VRAM version of training script (RTX 5060 8GB / Kaggle T4) 9ce2077 verified s23deepak commited on 23 days ago
Add updated Unsloth SFT script for Gemma 4 E2B scam fine-tuning 61f9777 verified s23deepak commited on 23 days ago