Spaces:

jailen
/

tortoise-base

Build error

App Files Files Community

tortoise-base

Commit History

More changes.

c853a45
unverified

Iñaki Marin commited on Apr 17, 2023

New Changes.

88b8900
unverified

Iñaki Marin commited on Apr 17, 2023

Update README.md

e0fd6bc

jailen commited on Apr 17, 2023

More stuff.

1172157
unverified

Iñaki Marin commited on Apr 17, 2023

modified logic to determine valid voice folders, also allows subdirs within the folder (for example: ./voices/SH/james/ will be named SH/james)

faa8da1

mrq commited on Apr 13, 2023

should fix #203

02beb1d

mrq commited on Apr 13, 2023

disable diarize button

8f3e944

mrq commited on Apr 12, 2023

a bunch of shit i had uncommited over the past while pertaining to VALL-E

d8b9969

mrq commited on Apr 12, 2023

Merge pull request 'Make convenient to use with Docker' (#191) from psr/ai-voice-cloning:docker into master

b785192

mrq commited on Apr 8, 2023

docker: add training script

9afafc6

psr commited on Apr 7, 2023

docker: add ffmpeg for whisper and general cleanup

c018bfc

psr commited on Apr 7, 2023

docker support

d64cba6

psr commited on Apr 5, 2023

#185

0440eac

mrq commited on Mar 31, 2023

fixes #185

9f64153

mrq commited on Mar 31, 2023

added VALL-E inference support (very rudimentary, gimped, but it will load a model trained on a config generated through the web UI)

4744120

mrq commited on Mar 31, 2023

only include auto in the list of models under setting, nothing else

9b01377

mrq commited on Mar 29, 2023

added mixing models (shamelessly inspired from voldy's web ui)

f66281f

mrq commited on Mar 29, 2023

fixes #176

c89c648

mrq commited on Mar 26, 2023

for real this time show those new vall-e metrics

41d47c7

mrq commited on Mar 26, 2023

added showing reported training accuracy and eval/validation metrics to graph

c4ca04c

mrq commited on Mar 26, 2023

now there should be feature parity between trainers

8c647c8

mrq commited on Mar 25, 2023

x_lim and y_lim for graph

fd9b2e0

mrq commited on Mar 25, 2023

actually make parsing VALL-E metrics work

9856db5

mrq commited on Mar 23, 2023

I forget

69d84bb

mrq commited on Mar 23, 2023

my sanitizer actually did work, it was just batch sizes leading to problems when transcribing

444bcda

mrq commited on Mar 23, 2023

when the sanitizer thingy works in testing but it doesn't outside of testing, and you have to retranscribe for the fourth time today

a6daf28

mrq commited on Mar 23, 2023

why does this keep happening to me

86589ff

mrq commited on Mar 23, 2023

more cleanup, use 24KHz for preparing for VALL-E (encodec will resample to 24Khz anyways, makes audio a little nicer), some other things

0ea93a7

mrq commited on Mar 23, 2023

remove redundant phonemize for vall-e (oops), quantize all files and then phonemize all files for cope optimization, load alignment model once instead of for every transcription (speedup with whisperx)

d2a9ab9

mrq commited on Mar 23, 2023

do not write current whisper.json if there's no changes

19c0854

mrq commited on Mar 22, 2023

added whisper transcription 'sanitizing' (collapse very short transcriptions to the previous segment) (I really have to stop having several copies spanning several machines for AIVC, I keep reverting shit)

932eacc

mrq commited on Mar 22, 2023

disable diarization for whisperx as it's just a useless performance hit (I don't have anything that's multispeaker within the same audio file at the moment)

736cdc8

mrq commited on Mar 22, 2023

ugh

aa5bdaf

mrq commited on Mar 22, 2023

now whisperx should output json that aligns with what's expected

13605f9

mrq commited on Mar 22, 2023

fixes for whisperx batching

8877960

mrq commited on Mar 22, 2023

begrudgingly added back whisperx integration (VAD/Diarization testing, I really, really need accurate timestamps before dumping mondo amounts of time on training a dataset)

4056a27

mrq commited on Mar 22, 2023

Fixed #167

b8c3c4c

mrq commited on Mar 22, 2023

oops

da96161

mrq commited on Mar 22, 2023

cleanups, realigning vall-e training

f822c87

mrq commited on Mar 22, 2023

ugh

909325b

mrq commited on Mar 21, 2023

Added option to unsqueeze sample batches after sampling

5a5fd9c

mrq commited on Mar 21, 2023

oops

9657c1d

mrq commited on Mar 21, 2023

DLAS is PIPified (but I'm still cloning it as a submodule to make updating it easier)

0c2a916

mrq commited on Mar 21, 2023

VALL-E config edits

34ef046

mrq commited on Mar 20, 2023

forgot to not require it to be relative

2e33bf0

mrq commited on Mar 19, 2023

option to set results folder location

5cb8610

mrq commited on Mar 19, 2023

doing what I do best: sourcing other configs and banging until it works (it doesnt work)

74510e8

mrq commited on Mar 18, 2023

tweaks

da9b4b5

mrq commited on Mar 18, 2023

brain worms

f448959

mrq commited on Mar 17, 2023

added japanese tokenizer (experimental)

b17260c

mrq commited on Mar 17, 2023

Commit History

More changes. c853a45 unverified

New Changes. 88b8900 unverified

Update README.md e0fd6bc

More stuff. 1172157 unverified

modified logic to determine valid voice folders, also allows subdirs within the folder (for example: ./voices/SH/james/ will be named SH/james) faa8da1

should fix #203 02beb1d

disable diarize button 8f3e944

a bunch of shit i had uncommited over the past while pertaining to VALL-E d8b9969

Merge pull request 'Make convenient to use with Docker' (#191) from psr/ai-voice-cloning:docker into master b785192

docker: add training script 9afafc6

docker: add ffmpeg for whisper and general cleanup c018bfc

docker support d64cba6

#185 0440eac

fixes #185 9f64153

added VALL-E inference support (very rudimentary, gimped, but it will load a model trained on a config generated through the web UI) 4744120

only include auto in the list of models under setting, nothing else 9b01377

added mixing models (shamelessly inspired from voldy's web ui) f66281f

fixes #176 c89c648

for real this time show those new vall-e metrics 41d47c7

added showing reported training accuracy and eval/validation metrics to graph c4ca04c

now there should be feature parity between trainers 8c647c8

x_lim and y_lim for graph fd9b2e0

actually make parsing VALL-E metrics work 9856db5

I forget 69d84bb

my sanitizer actually did work, it was just batch sizes leading to problems when transcribing 444bcda

when the sanitizer thingy works in testing but it doesn't outside of testing, and you have to retranscribe for the fourth time today a6daf28

why does this keep happening to me 86589ff

more cleanup, use 24KHz for preparing for VALL-E (encodec will resample to 24Khz anyways, makes audio a little nicer), some other things 0ea93a7

remove redundant phonemize for vall-e (oops), quantize all files and then phonemize all files for cope optimization, load alignment model once instead of for every transcription (speedup with whisperx) d2a9ab9

do not write current whisper.json if there's no changes 19c0854

added whisper transcription 'sanitizing' (collapse very short transcriptions to the previous segment) (I really have to stop having several copies spanning several machines for AIVC, I keep reverting shit) 932eacc

disable diarization for whisperx as it's just a useless performance hit (I don't have anything that's multispeaker within the same audio file at the moment) 736cdc8

ugh aa5bdaf

now whisperx should output json that aligns with what's expected 13605f9

fixes for whisperx batching 8877960

begrudgingly added back whisperx integration (VAD/Diarization testing, I really, really need accurate timestamps before dumping mondo amounts of time on training a dataset) 4056a27

Fixed #167 b8c3c4c

oops da96161

cleanups, realigning vall-e training f822c87

ugh 909325b

Added option to unsqueeze sample batches after sampling 5a5fd9c

oops 9657c1d

DLAS is PIPified (but I'm still cloning it as a submodule to make updating it easier) 0c2a916

VALL-E config edits 34ef046

forgot to not require it to be relative 2e33bf0

option to set results folder location 5cb8610

doing what I do best: sourcing other configs and banging until it works (it doesnt work) 74510e8

tweaks da9b4b5

brain worms f448959

added japanese tokenizer (experimental) b17260c

More changes.

c853a45
unverified

New Changes.

88b8900
unverified

Update README.md

e0fd6bc

More stuff.

1172157
unverified

modified logic to determine valid voice folders, also allows subdirs within the folder (for example: ./voices/SH/james/ will be named SH/james)

faa8da1

should fix #203

02beb1d

disable diarize button

8f3e944

a bunch of shit i had uncommited over the past while pertaining to VALL-E

d8b9969

Merge pull request 'Make convenient to use with Docker' (#191) from psr/ai-voice-cloning:docker into master

b785192

docker: add training script

9afafc6

docker: add ffmpeg for whisper and general cleanup

c018bfc

docker support

d64cba6

#185

0440eac

fixes #185

9f64153

added VALL-E inference support (very rudimentary, gimped, but it will load a model trained on a config generated through the web UI)

4744120

only include auto in the list of models under setting, nothing else

9b01377

added mixing models (shamelessly inspired from voldy's web ui)

f66281f

fixes #176

c89c648

for real this time show those new vall-e metrics

41d47c7

added showing reported training accuracy and eval/validation metrics to graph

c4ca04c

now there should be feature parity between trainers

8c647c8

x_lim and y_lim for graph

fd9b2e0

actually make parsing VALL-E metrics work

9856db5

I forget

69d84bb

my sanitizer actually did work, it was just batch sizes leading to problems when transcribing

444bcda

when the sanitizer thingy works in testing but it doesn't outside of testing, and you have to retranscribe for the fourth time today

a6daf28

why does this keep happening to me

86589ff

more cleanup, use 24KHz for preparing for VALL-E (encodec will resample to 24Khz anyways, makes audio a little nicer), some other things

0ea93a7

remove redundant phonemize for vall-e (oops), quantize all files and then phonemize all files for cope optimization, load alignment model once instead of for every transcription (speedup with whisperx)

d2a9ab9

do not write current whisper.json if there's no changes

19c0854

added whisper transcription 'sanitizing' (collapse very short transcriptions to the previous segment) (I really have to stop having several copies spanning several machines for AIVC, I keep reverting shit)

932eacc

disable diarization for whisperx as it's just a useless performance hit (I don't have anything that's multispeaker within the same audio file at the moment)

736cdc8

ugh

aa5bdaf

now whisperx should output json that aligns with what's expected

13605f9

fixes for whisperx batching

8877960

begrudgingly added back whisperx integration (VAD/Diarization testing, I really, really need accurate timestamps before dumping mondo amounts of time on training a dataset)

4056a27

Fixed #167

b8c3c4c

oops

da96161

cleanups, realigning vall-e training

f822c87

ugh

909325b

Added option to unsqueeze sample batches after sampling

5a5fd9c

oops

9657c1d

DLAS is PIPified (but I'm still cloning it as a submodule to make updating it easier)

0c2a916

VALL-E config edits

34ef046

forgot to not require it to be relative

2e33bf0

option to set results folder location

5cb8610

doing what I do best: sourcing other configs and banging until it works (it doesnt work)

74510e8

tweaks

da9b4b5

brain worms

f448959

added japanese tokenizer (experimental)

b17260c