Commit History

fix: Tune dataset gen params and improve training checkpoint/resume
75eda59

marcos Claude Opus 4.5 commited on

feat: Add full fine-tuning (no LoRA) and dataset generation tools
cbe0918

marcos Claude Opus 4.5 commited on

feat: Refactor training with SOLID principles and add optimizations
e20f447

marcos Claude Opus 4.5 commited on

feat: Add multi-GPU support and lmdeploy for faster TTS
5cc96de

marcos Claude Opus 4.5 commited on

feat: Add H200/Hopper support and VRAM-adaptive batch sizing
9767b87

marcos Claude Opus 4.5 commited on

feat: Replace Matcha with Soprano TTS and add inference pipeline
32fdbcd

marcos Claude Opus 4.5 commited on

fix: Update dependencies and tokenizer handling for dataset generation
512101b

marcos Claude Opus 4.5 commited on

Add gradient checkpointing auto-detection to Stage 1 and Stage 2
d2911f1

marcos Claude Opus 4.5 commited on

feat: Add gradient checkpointing to Stage 2 training
37f91f0

marcos Claude Opus 4.5 commited on

perf: Increase TTS_BATCH_SIZE to 200 for better throughput
2a5a768

marcos Claude Opus 4.5 commited on

feat: Pre-compute text_tokens in dataset generation
9258ac3

marcos Claude Opus 4.5 commited on

chore: Remove test files from datasets directory
1dafe0a

marcos Claude Opus 4.5 commited on

feat: Async dataset pipeline with adaptive memory management
0f2f953

marcos Claude Opus 4.5 commited on

Initial commit: Speech-to-Speech dataset generation pipeline
41d51d1

marcos Claude Opus 4.5 commited on