Commit History

DP ref encoder uses compressed latent channels from config; README v2 onnx path; pytest smoke tests
8876cf1
Running

notmax123 commited on

Hebrew dates: spell month with ordinal names (ืœืจืืฉื•ืŸ โ€ฆ ืœืฉื ื™ื ืขืฉืจ)
920d18e

notmax123 commited on

Hebrew: format expanded dates as 'D ืœ-M Y'
4e7bf3c

notmax123 commited on

Expand DD/MM/YYYY-style dates before generic number expansion
9eebcc5

notmax123 commited on

Hebrew: expand 'ืœ' before Latin tokens (e.g. CPU-GPU) to 'ืืœ'
0ce399f

notmax123 commited on

Lower BLUE_SYNTH_MAX_CHUNK_LEN to 200
9010a30

notmax123 commited on

Add phonemes_for_display and strip language tags for user-facing text
ff5e826

notmax123 commited on

Strip Hebrew gershayim/quote marks in abbreviations before synthesis
07b30a1

notmax123 commited on

Hebrew+Latin split phonemization; email/%/ratio/number prep; per-chunk phonemize; mixed pace blend
b0a5a88

notmax123 commited on

Tuning: CFG 4.0; tighten steps (5-16) and speed (0.8-1.2) slider ranges
8960a88

notmax123 commited on

Strip Hebrew nikud; normalize 'any more'; default speed 0.95, CFG 3.5
c329763

notmax123 commited on

Download default female/male v2 voice JSONs from hub; restore style_ttl std filter; remove Refresh voices UI
f448887

notmax123 commited on

Voices: require style_ttl+style_dp; add Refresh voices button; select new clone in dropdown
c53219d

notmax123 commited on

Gradio: drop CFG/pace sliders (use cfg 3, pace blend 0); fix bad indents in infer loop and helpers
b062e61

notmax123 commited on

Clone export: reference audio cleanup and left-aligned ref mask; PT weights bundle marker and force download
fd7eb4d

notmax123 commited on

Strip XML-like tags with attributes; unify tag stripping for phonemize and char encode
822ce81

notmax123 commited on

Strip lang_list helper markup; restrict inline language tags in regex
32ed20e

notmax123 commited on

download_models: ONNX bundle stamp to bust Hub file cache on rebuild
26ca59a

notmax123 commited on

Restore vocoder edge trim after each chunk
c4afdc5

notmax123 commited on

Pace blend for duration model; long-text Gradio slider; strip inline lang tags in phonemize
cef5a6f

notmax123 commited on

Update README.md
da673b0
verified

notmax123 commited on

Update README.md
70795a6
verified

notmax123 commited on

Update README.md
0489362
verified

notmax123 commited on

Restore vocoder edge trim before waveform return
bb02d02

notmax123 commited on

Post-process TTS: gentle RMS boost; drop vocoder edge trim
12a2ba3

notmax123 commited on

Centralize max synth chunk length in BLUE_SYNTH_MAX_CHUNK_LEN
cdaa0f4

notmax123 commited on

Align CFG slider range with model; clean torchaudio imports in export
62277f6

notmax123 commited on

Resample reference WAVs with torchaudio for voice export
5f1d4a9

notmax123 commited on

Improve voice clone UX with cache and Gradio progress
fa68581

notmax123 commited on

Simplify Gradio voice cloning and auto-run synthesis
56e7960

notmax123 commited on

Use v2 model repositories for Space runtime
615a636

notmax123 commited on

Simplify uploaded voice style export
35eb04b

notmax123 commited on

Force uploaded voice extraction to use PT checkpoints
eea5c89

notmax123 commited on

Return uploaded WAV voice styles directly from exporter
e6dba6f

notmax123 commited on

Use root tts config for voice export normalizers
6d44c5f

notmax123 commited on

Make uploaded reference audio an explicit voice source
fe9f63a

notmax123 commited on

Align voice cloning path with reference exporter
e699db1

notmax123 commited on

Prefer pt_models (.pt) checkpoints matching vendored architectures
c1cb918

notmax123 commited on

Infer DPNetwork vocab_size from checkpoint embedding
604ce7b

notmax123 commited on

Extract style_dp from reference WAV via DPNetwork.ref_encoder
7d51a93

notmax123 commited on

Clamp DP duration to text-length-based cap (fix runaway cloned voice duration)
b6633d8

notmax123 commited on

Load vocab.json from repo root (fix noise output)
b0dee38

notmax123 commited on

Auto-detect vocoder input name (z_pred vs latent)
adc2859

notmax123 commited on

Clamp predicted latent length below vector_estimator's ~1000-frame cap
0a1845d

notmax123 commited on

Hard-split oversize chunks; lower max_len below vector_estimator's 1000-token cap
fafbe00

notmax123 commited on

Voice clone: legacy state-dict remap + better error surfacing
11dd574

notmax123 commited on

Install bluecodec package so LatentEncoder is importable
06f0962

notmax123 commited on

Inline voice cloning: reference WAV directly on Synthesize tab (no JSON)
c75032a

notmax123 commited on

Add torch/torchaudio/librosa/safetensors for voice cloning
24f1c53

notmax123 commited on

Skip DPNetwork in export; fall back to default-voice style_dp at runtime
69a2351

notmax123 commited on