tortoise-base / src /utils.py

Commit History

modified logic to determine valid voice folders, also allows subdirs within the folder (for example: ./voices/SH/james/ will be named SH/james)
faa8da1

mrq commited on

should fix #203
02beb1d

mrq commited on

a bunch of shit i had uncommited over the past while pertaining to VALL-E
d8b9969

mrq commited on

added VALL-E inference support (very rudimentary, gimped, but it will load a model trained on a config generated through the web UI)
4744120

mrq commited on

only include auto in the list of models under setting, nothing else
9b01377

mrq commited on

added mixing models (shamelessly inspired from voldy's web ui)
f66281f

mrq commited on

fixes #176
c89c648

mrq commited on

for real this time show those new vall-e metrics
41d47c7

mrq commited on

added showing reported training accuracy and eval/validation metrics to graph
c4ca04c

mrq commited on

now there should be feature parity between trainers
8c647c8

mrq commited on

x_lim and y_lim for graph
fd9b2e0

mrq commited on

actually make parsing VALL-E metrics work
9856db5

mrq commited on

I forget
69d84bb

mrq commited on

my sanitizer actually did work, it was just batch sizes leading to problems when transcribing
444bcda

mrq commited on

when the sanitizer thingy works in testing but it doesn't outside of testing, and you have to retranscribe for the fourth time today
a6daf28

mrq commited on

why does this keep happening to me
86589ff

mrq commited on

more cleanup, use 24KHz for preparing for VALL-E (encodec will resample to 24Khz anyways, makes audio a little nicer), some other things
0ea93a7

mrq commited on

remove redundant phonemize for vall-e (oops), quantize all files and then phonemize all files for cope optimization, load alignment model once instead of for every transcription (speedup with whisperx)
d2a9ab9

mrq commited on

do not write current whisper.json if there's no changes
19c0854

mrq commited on

added whisper transcription 'sanitizing' (collapse very short transcriptions to the previous segment) (I really have to stop having several copies spanning several machines for AIVC, I keep reverting shit)
932eacc

mrq commited on

disable diarization for whisperx as it's just a useless performance hit (I don't have anything that's multispeaker within the same audio file at the moment)
736cdc8

mrq commited on

ugh
aa5bdaf

mrq commited on

now whisperx should output json that aligns with what's expected
13605f9

mrq commited on

fixes for whisperx batching
8877960

mrq commited on

begrudgingly added back whisperx integration (VAD/Diarization testing, I really, really need accurate timestamps before dumping mondo amounts of time on training a dataset)
4056a27

mrq commited on

Fixed #167
b8c3c4c

mrq commited on

cleanups, realigning vall-e training
f822c87

mrq commited on

ugh
909325b

mrq commited on

Added option to unsqueeze sample batches after sampling
5a5fd9c

mrq commited on

forgot to not require it to be relative
2e33bf0

mrq commited on

option to set results folder location
5cb8610

mrq commited on

tweaks
da9b4b5

mrq commited on

brain worms
f448959

mrq commited on

yammed
f34cc38

mrq commited on

yammed
96b7f9d

mrq commited on

cleanup, metrics are grabbed for vall-e trainer
249c601

mrq commited on

forgot to separate phonemes by spaces for [redacted]
1b72d0b

mrq commited on

cleaned up some prepare dataset code
d4c5096

mrq commited on

setup bnb on windows as needed
0b62ccc

mrq commited on

unk hunting
1a8c5de

mrq commited on

fixes v2
46ff3c4

mrq commited on

fixed reload tts being broken due to being as untouched as I am
0408d44

mrq commited on

yammed
aeb904a

mrq commited on

fixes
f9154c4

mrq commited on

ops
54f2fc7

mrq commited on

ops
0a7d6f0

mrq commited on

I forgot I undid the thing in DLAS
4ac43fa

mrq commited on

oops
da4f926

mrq commited on

preparations for training an IPA-based finetune
ee8270b

mrq commited on

fixed not cleaning up states while training (oops)
7b80f7a

mrq commited on