Add phonemes_for_display and strip language tags for user-facing text ff5e826 notmax123 commited on Apr 27
Strip Hebrew gershayim/quote marks in abbreviations before synthesis 07b30a1 notmax123 commited on Apr 27
Hebrew+Latin split phonemization; email/%/ratio/number prep; per-chunk phonemize; mixed pace blend b0a5a88 notmax123 commited on Apr 27
Tuning: CFG 4.0; tighten steps (5-16) and speed (0.8-1.2) slider ranges 8960a88 notmax123 commited on Apr 27
Strip Hebrew nikud; normalize 'any more'; default speed 0.95, CFG 3.5 c329763 notmax123 commited on Apr 27
Download default female/male v2 voice JSONs from hub; restore style_ttl std filter; remove Refresh voices UI f448887 notmax123 commited on Apr 26
Voices: require style_ttl+style_dp; add Refresh voices button; select new clone in dropdown c53219d notmax123 commited on Apr 26
Gradio: drop CFG/pace sliders (use cfg 3, pace blend 0); fix bad indents in infer loop and helpers b062e61 notmax123 commited on Apr 26
Clone export: reference audio cleanup and left-aligned ref mask; PT weights bundle marker and force download fd7eb4d notmax123 commited on Apr 26
Strip XML-like tags with attributes; unify tag stripping for phonemize and char encode 822ce81 notmax123 commited on Apr 26
Strip lang_list helper markup; restrict inline language tags in regex 32ed20e notmax123 commited on Apr 26
download_models: ONNX bundle stamp to bust Hub file cache on rebuild 26ca59a notmax123 commited on Apr 26
Pace blend for duration model; long-text Gradio slider; strip inline lang tags in phonemize cef5a6f notmax123 commited on Apr 26
Align CFG slider range with model; clean torchaudio imports in export 62277f6 notmax123 commited on Apr 25
Prefer pt_models (.pt) checkpoints matching vendored architectures c1cb918 notmax123 commited on Apr 24
Clamp DP duration to text-length-based cap (fix runaway cloned voice duration) b6633d8 notmax123 commited on Apr 24
Clamp predicted latent length below vector_estimator's ~1000-frame cap 0a1845d notmax123 commited on Apr 24
Hard-split oversize chunks; lower max_len below vector_estimator's 1000-token cap fafbe00 notmax123 commited on Apr 24
Inline voice cloning: reference WAV directly on Synthesize tab (no JSON) c75032a notmax123 commited on Apr 24
Skip DPNetwork in export; fall back to default-voice style_dp at runtime 69a2351 notmax123 commited on Apr 24
Auto-download PyTorch checkpoints from notmax123/blue on first clone f731e57 notmax123 commited on Apr 24
Clone tab: search fonts/pt_models with filename aliases; vendor models/ 863d06f notmax123 commited on Apr 24