Uberduck #machine-learning

Join Discord

user

04/14/2021, 1:08 AM

user

04/14/2021, 1:08 AM

you can use 20

Toasty

04/14/2021, 1:08 AM

20 will work?

user

04/14/2021, 1:08 AM

yes

Toasty

04/14/2021, 1:08 AM

Alright

Toasty

04/14/2021, 1:08 AM

My theory is that I made the files too big

Toasty

04/14/2021, 1:09 AM

And the only way to counteract is to lower the batch size

Toasty

04/14/2021, 1:14 AM

Little did I realize, I made my dataset untrainable.

Toasty

04/14/2021, 1:16 AM

Three days of work gone.

Toasty

04/14/2021, 1:17 AM

fml.

user

04/14/2021, 1:17 AM

Fuck google

Toasty

04/14/2021, 1:18 AM

The only way to make my dataset from 24 minutes to 2 hours was raise the limit on ASSFAP from 12 seconds to 30.

Toasty

04/14/2021, 1:18 AM

(yes ik thats the actual name)

Toasty

04/14/2021, 1:18 AM

I feel like I made my dataset too big for Tacotron2.

user

04/14/2021, 1:18 AM

Rip

Toasty

04/14/2021, 1:19 AM

So... it means I'm stuck with a crap 24 minute tts of Yahtzee

Toasty

04/14/2021, 1:20 AM

created with an automated data set maker that's crap

user

04/14/2021, 1:20 AM

Wich one?

Toasty

04/14/2021, 1:21 AM

Automatic Super Speaker-Filtered Audio Processing V4

user

04/14/2021, 1:21 AM

And does it build wav files in?

Toasty

04/14/2021, 1:21 AM

What it does

Toasty

04/14/2021, 1:21 AM

it automatically makes a dataset

user

04/14/2021, 1:21 AM

For tacotron?

Toasty

04/14/2021, 1:21 AM

Yes

Toasty

04/14/2021, 1:22 AM

ARPABET, transcripts, wavs, you name it

Toasty

04/14/2021, 1:22 AM

One problem

Toasty

04/14/2021, 1:22 AM

So, you're suppose to dump like several 22050 hz mono audio files of your speaker with a sample wav of said speaker.

user

04/14/2021, 1:23 AM

Discombobulate the allies of the genetical chromosomian ectomycorrhizals.

Toasty

04/14/2021, 1:23 AM

ASSFAP would take the sample wav and find all possible voice thingies that match the 10 second sample wav.

Toasty

04/14/2021, 1:23 AM

The problem? It has a godawful time at extracting the audio to wav.