modified logic to determine valid voice folders, also allows subdirs within the folder (for example: ./voices/SH/james/ will be named SH/james)
faa8da1
mrqcommited on
should fix #203
02beb1d
mrqcommited on
disable diarize button
8f3e944
mrqcommited on
a bunch of shit i had uncommited over the past while pertaining to VALL-E
d8b9969
mrqcommited on
Merge pull request 'Make convenient to use with Docker' (#191) from psr/ai-voice-cloning:docker into master
b785192
mrqcommited on
docker: add training script
9afafc6
psrcommited on
docker: add ffmpeg for whisper and general cleanup
c018bfc
psrcommited on
docker support
d64cba6
psrcommited on
#185
0440eac
mrqcommited on
fixes #185
9f64153
mrqcommited on
added VALL-E inference support (very rudimentary, gimped, but it will load a model trained on a config generated through the web UI)
4744120
mrqcommited on
only include auto in the list of models under setting, nothing else
9b01377
mrqcommited on
added mixing models (shamelessly inspired from voldy's web ui)
f66281f
mrqcommited on
fixes #176
c89c648
mrqcommited on
for real this time show those new vall-e metrics
41d47c7
mrqcommited on
added showing reported training accuracy and eval/validation metrics to graph
c4ca04c
mrqcommited on
now there should be feature parity between trainers
8c647c8
mrqcommited on
x_lim and y_lim for graph
fd9b2e0
mrqcommited on
actually make parsing VALL-E metrics work
9856db5
mrqcommited on
I forget
69d84bb
mrqcommited on
my sanitizer actually did work, it was just batch sizes leading to problems when transcribing
444bcda
mrqcommited on
when the sanitizer thingy works in testing but it doesn't outside of testing, and you have to retranscribe for the fourth time today
a6daf28
mrqcommited on
why does this keep happening to me
86589ff
mrqcommited on
more cleanup, use 24KHz for preparing for VALL-E (encodec will resample to 24Khz anyways, makes audio a little nicer), some other things
0ea93a7
mrqcommited on
remove redundant phonemize for vall-e (oops), quantize all files and then phonemize all files for cope optimization, load alignment model once instead of for every transcription (speedup with whisperx)
d2a9ab9
mrqcommited on
do not write current whisper.json if there's no changes
19c0854
mrqcommited on
added whisper transcription 'sanitizing' (collapse very short transcriptions to the previous segment) (I really have to stop having several copies spanning several machines for AIVC, I keep reverting shit)
932eacc
mrqcommited on
disable diarization for whisperx as it's just a useless performance hit (I don't have anything that's multispeaker within the same audio file at the moment)
736cdc8
mrqcommited on
ugh
aa5bdaf
mrqcommited on
now whisperx should output json that aligns with what's expected
13605f9
mrqcommited on
fixes for whisperx batching
8877960
mrqcommited on
begrudgingly added back whisperx integration (VAD/Diarization testing, I really, really need accurate timestamps before dumping mondo amounts of time on training a dataset)
4056a27
mrqcommited on
Fixed #167
b8c3c4c
mrqcommited on
oops
da96161
mrqcommited on
cleanups, realigning vall-e training
f822c87
mrqcommited on
ugh
909325b
mrqcommited on
Added option to unsqueeze sample batches after sampling
5a5fd9c
mrqcommited on
oops
9657c1d
mrqcommited on
DLAS is PIPified (but I'm still cloning it as a submodule to make updating it easier)
0c2a916
mrqcommited on
VALL-E config edits
34ef046
mrqcommited on
forgot to not require it to be relative
2e33bf0
mrqcommited on
option to set results folder location
5cb8610
mrqcommited on
doing what I do best: sourcing other configs and banging until it works (it doesnt work)