DP ref encoder uses compressed latent channels from config; README v2 onnx path; pytest smoke tests 8876cf1 Running notmax123 commited on 14 days ago
Hebrew dates: spell month with ordinal names (ืืจืืฉืื โฆ ืืฉื ืื ืขืฉืจ) 920d18e notmax123 commited on 15 days ago
Expand DD/MM/YYYY-style dates before generic number expansion 9eebcc5 notmax123 commited on 15 days ago
Hebrew: expand 'ื' before Latin tokens (e.g. CPU-GPU) to 'ืื' 0ce399f notmax123 commited on 17 days ago
Add phonemes_for_display and strip language tags for user-facing text ff5e826 notmax123 commited on 17 days ago
Strip Hebrew gershayim/quote marks in abbreviations before synthesis 07b30a1 notmax123 commited on 17 days ago
Hebrew+Latin split phonemization; email/%/ratio/number prep; per-chunk phonemize; mixed pace blend b0a5a88 notmax123 commited on 17 days ago
Tuning: CFG 4.0; tighten steps (5-16) and speed (0.8-1.2) slider ranges 8960a88 notmax123 commited on 17 days ago
Strip Hebrew nikud; normalize 'any more'; default speed 0.95, CFG 3.5 c329763 notmax123 commited on 17 days ago
Download default female/male v2 voice JSONs from hub; restore style_ttl std filter; remove Refresh voices UI f448887 notmax123 commited on 18 days ago
Voices: require style_ttl+style_dp; add Refresh voices button; select new clone in dropdown c53219d notmax123 commited on 18 days ago
Gradio: drop CFG/pace sliders (use cfg 3, pace blend 0); fix bad indents in infer loop and helpers b062e61 notmax123 commited on 18 days ago
Clone export: reference audio cleanup and left-aligned ref mask; PT weights bundle marker and force download fd7eb4d notmax123 commited on 18 days ago
Strip XML-like tags with attributes; unify tag stripping for phonemize and char encode 822ce81 notmax123 commited on 18 days ago
Strip lang_list helper markup; restrict inline language tags in regex 32ed20e notmax123 commited on 18 days ago
download_models: ONNX bundle stamp to bust Hub file cache on rebuild 26ca59a notmax123 commited on 18 days ago
Pace blend for duration model; long-text Gradio slider; strip inline lang tags in phonemize cef5a6f notmax123 commited on 18 days ago
Centralize max synth chunk length in BLUE_SYNTH_MAX_CHUNK_LEN cdaa0f4 notmax123 commited on 19 days ago
Align CFG slider range with model; clean torchaudio imports in export 62277f6 notmax123 commited on 19 days ago
Prefer pt_models (.pt) checkpoints matching vendored architectures c1cb918 notmax123 commited on 20 days ago
Extract style_dp from reference WAV via DPNetwork.ref_encoder 7d51a93 notmax123 commited on 20 days ago
Clamp DP duration to text-length-based cap (fix runaway cloned voice duration) b6633d8 notmax123 commited on 20 days ago
Clamp predicted latent length below vector_estimator's ~1000-frame cap 0a1845d notmax123 commited on 20 days ago
Hard-split oversize chunks; lower max_len below vector_estimator's 1000-token cap fafbe00 notmax123 commited on 20 days ago
Voice clone: legacy state-dict remap + better error surfacing 11dd574 notmax123 commited on 20 days ago
Inline voice cloning: reference WAV directly on Synthesize tab (no JSON) c75032a notmax123 commited on 20 days ago
Skip DPNetwork in export; fall back to default-voice style_dp at runtime 69a2351 notmax123 commited on 20 days ago