Spaces:

Duplicated from notmax123/BlueV2

ivrit-ai
/

BlueV2

Sleeping

App Files Files Community

Commit History

Hebrew: expand 'ל' before Latin tokens (e.g. CPU-GPU) to 'אל'

0ce399f

notmax123 commited on Apr 27

Lower BLUE_SYNTH_MAX_CHUNK_LEN to 200

9010a30

notmax123 commited on Apr 27

Add phonemes_for_display and strip language tags for user-facing text

ff5e826

notmax123 commited on Apr 27

Strip Hebrew gershayim/quote marks in abbreviations before synthesis

07b30a1

notmax123 commited on Apr 27

Hebrew+Latin split phonemization; email/%/ratio/number prep; per-chunk phonemize; mixed pace blend

b0a5a88

notmax123 commited on Apr 27

Tuning: CFG 4.0; tighten steps (5-16) and speed (0.8-1.2) slider ranges

8960a88

notmax123 commited on Apr 27

Strip Hebrew nikud; normalize 'any more'; default speed 0.95, CFG 3.5

c329763

notmax123 commited on Apr 27

Download default female/male v2 voice JSONs from hub; restore style_ttl std filter; remove Refresh voices UI

f448887

notmax123 commited on Apr 26

Voices: require style_ttl+style_dp; add Refresh voices button; select new clone in dropdown

c53219d

notmax123 commited on Apr 26

Gradio: drop CFG/pace sliders (use cfg 3, pace blend 0); fix bad indents in infer loop and helpers

b062e61

notmax123 commited on Apr 26

Clone export: reference audio cleanup and left-aligned ref mask; PT weights bundle marker and force download

fd7eb4d

notmax123 commited on Apr 26

Strip XML-like tags with attributes; unify tag stripping for phonemize and char encode

822ce81

notmax123 commited on Apr 26

Strip lang_list helper markup; restrict inline language tags in regex

32ed20e

notmax123 commited on Apr 26

download_models: ONNX bundle stamp to bust Hub file cache on rebuild

26ca59a

notmax123 commited on Apr 26

Restore vocoder edge trim after each chunk

c4afdc5

notmax123 commited on Apr 26

Pace blend for duration model; long-text Gradio slider; strip inline lang tags in phonemize

cef5a6f

notmax123 commited on Apr 26

Update README.md

da673b0
verified

notmax123 commited on Apr 26

Update README.md

70795a6
verified

notmax123 commited on Apr 26

Update README.md

0489362
verified

notmax123 commited on Apr 26

Restore vocoder edge trim before waveform return

bb02d02

notmax123 commited on Apr 25

Post-process TTS: gentle RMS boost; drop vocoder edge trim

12a2ba3

notmax123 commited on Apr 25

Centralize max synth chunk length in BLUE_SYNTH_MAX_CHUNK_LEN

cdaa0f4

notmax123 commited on Apr 25

Align CFG slider range with model; clean torchaudio imports in export

62277f6

notmax123 commited on Apr 25

Resample reference WAVs with torchaudio for voice export

5f1d4a9

notmax123 commited on Apr 25

Improve voice clone UX with cache and Gradio progress

fa68581

notmax123 commited on Apr 25

Simplify Gradio voice cloning and auto-run synthesis

56e7960

notmax123 commited on Apr 25

Use v2 model repositories for Space runtime

615a636

notmax123 commited on Apr 25

Simplify uploaded voice style export

35eb04b

notmax123 commited on Apr 24

Force uploaded voice extraction to use PT checkpoints

eea5c89

notmax123 commited on Apr 24

Return uploaded WAV voice styles directly from exporter

e6dba6f

notmax123 commited on Apr 24

Use root tts config for voice export normalizers

6d44c5f

notmax123 commited on Apr 24

Make uploaded reference audio an explicit voice source

fe9f63a

notmax123 commited on Apr 24

Align voice cloning path with reference exporter

e699db1

notmax123 commited on Apr 24

Prefer pt_models (.pt) checkpoints matching vendored architectures

c1cb918

notmax123 commited on Apr 24

Infer DPNetwork vocab_size from checkpoint embedding

604ce7b

notmax123 commited on Apr 24

Extract style_dp from reference WAV via DPNetwork.ref_encoder

7d51a93

notmax123 commited on Apr 24

Clamp DP duration to text-length-based cap (fix runaway cloned voice duration)

b6633d8

notmax123 commited on Apr 24

Load vocab.json from repo root (fix noise output)

b0dee38

notmax123 commited on Apr 24

Auto-detect vocoder input name (z_pred vs latent)

adc2859

notmax123 commited on Apr 24

Clamp predicted latent length below vector_estimator's ~1000-frame cap

0a1845d

notmax123 commited on Apr 24

Hard-split oversize chunks; lower max_len below vector_estimator's 1000-token cap

fafbe00

notmax123 commited on Apr 24

Voice clone: legacy state-dict remap + better error surfacing

11dd574

notmax123 commited on Apr 24

Install bluecodec package so LatentEncoder is importable

06f0962

notmax123 commited on Apr 24

Inline voice cloning: reference WAV directly on Synthesize tab (no JSON)

c75032a

notmax123 commited on Apr 24

Add torch/torchaudio/librosa/safetensors for voice cloning

24f1c53

notmax123 commited on Apr 24

Skip DPNetwork in export; fall back to default-voice style_dp at runtime

69a2351

notmax123 commited on Apr 24

Auto-download PyTorch checkpoints from notmax123/blue on first clone

f731e57

notmax123 commited on Apr 24

Fix indentation errors in UI block

58aa51a

notmax123 commited on Apr 24

Clone tab: search fonts/pt_models with filename aliases; vendor models/

863d06f

notmax123 commited on Apr 24

Add requirements.txt for HF Space build

4818895

notmax123 commited on Apr 24