Add server deployment instructions and cross-platform training fixes 937d0e8 outlawmold commited on about 1 month ago
Add MPS training stability fixes and experiment logs 19655a1 outlawmold commited on about 1 month ago
Add bnb_optimizer and configurable batch params to training scripts e76ab40 outlawmold commited on May 2
Fix critical issues, migrate to IndicF5 fine-tuning, update pipeline dd75f48 outlawmold commited on May 1
Add auto next-batch launchers for Mac/Windows and cache-safe Mac training defaults 075e81a outlawmold commited on Apr 30
Fix: remove --finetune flag (vocab size mismatch with pretrained base), train from scratch with custom Sinhala vocab cb99d69 verified outlawmold commited on Apr 30
Update finetune script: epochs=20, batch_size=600, tuned for RTX 4050 6GB 0e8bc92 verified outlawmold commited on Apr 30
Add safe finetune launcher and latest training artifacts (HF-compatible) 5bed4d1 outlawmold commited on Apr 30
Add local training launch script (Mac MPS + Linux CUDA) e6e5d50 verified outlawmold commited on Apr 30
Add F5-TTS data prep script (Sinhala Arrow builder, bypasses pinyin) 76d096a verified outlawmold commited on Apr 30
Add start-index support to HF CC runner for non-overlapping batches 3a7dac7 outlawmold commited on Apr 30
Add resilient CC batching, hybrid filtering, and 10-video training prep workflow 718d352 outlawmold commited on Apr 30
Add CC-based pipeline script (v5) — uses YouTube auto-generated captions instead of local ASR" 8fba19c verified outlawmold commited on Apr 29
Replace train_f5tts.py with complete fine-tuning script (vocab gen + data prep + training + inference) cb6d98d verified outlawmold commited on Apr 28
feat(macos): implement Apple Silicon optimizations and switch to wav2vec2 ASR 1a2a2b3 outlawmold commited on Apr 28
fix: switch to faster-whisper with local CT2 model for stable ASR 2a9a208 outlawmold commited on Apr 28
Pipeline v3: Multi-ASR backend (whisper-hf, MMS, faster-whisper), anti-hallucination, default to Lingalingeswaran/whisper-small-sinhala_v3 7f82ad6 verified outlawmold commited on Apr 28
Pipeline v3: Multi-ASR backend (whisper-hf, MMS, faster-whisper), anti-hallucination, default to Lingalingeswaran/whisper-small-sinhala_v3 0f417a9 verified outlawmold commited on Apr 28
Add ASR model comparison test script (MMS, Whisper fine-tunes, Whisper large-v3) 4ed7a6b verified outlawmold commited on Apr 28
Finalize optimized pipeline with fine-tuned Whisper and model files 6990c3a outlawmold commited on Apr 28
Add local_pipeline.py — laptop-optimized data processing (GTX 4050 6GB VRAM) 1e3cb7c verified outlawmold commited on Apr 27
Optimize download_and_upload.py: add batch cleanup and fix encoding bd33bfc outlawmold commited on Apr 26
Add Phase 1: local download & upload script (Option C hybrid approach) 1225d5f verified outlawmold commited on Apr 26