Uberduck #tacotron-2-support

Join Discord

tylerdurdenceketi

11/26/2022, 9:20 AM

Add a new code block and type this: !pip install --upgrade gdown

lil beby

11/26/2022, 10:03 AM

Guys, how to get in?

hecko

11/26/2022, 12:01 PM

did you run the previous steps

hecko

11/26/2022, 12:02 PM

are you using a school computer or school google account? if so then you can't

hecko

11/26/2022, 12:03 PM

you'd have to use your own device with your own personal account

hecko

11/26/2022, 12:03 PM

(or take it up with the computer person in your school but i doubt it'll work)

9 x x

11/26/2022, 5:03 PM

yeah make sure you are not on a work account

9 x x

11/26/2022, 5:03 PM

also

9 x x

11/26/2022, 5:03 PM

#841437191073955920

9 x x

11/26/2022, 5:04 PM

make sure the notebook is there

9 x x

11/26/2022, 5:04 PM

and when you make a copy you are not switching accounts

Sonic2022_mario

11/26/2022, 6:57 PM

I Just Uploaded Sonic (Roger Craig Smith, Frontiers) And Tested It And Sounds Like This

Daft

11/26/2022, 11:12 PM

how come my model is pronouncing certain words wrong

Lexi (delulu posts on the daily)

11/27/2022, 12:42 AM

because dataset could be silly or u could use arpabet

mynameisNegan

11/27/2022, 1:51 AM

@hecko What's the best epoch for AITCH using Uberduck Tacotron 2 Pipeline? I have over 547 wav files total of 32:19 mins.

hecko

11/27/2022, 1:52 AM

there's no hard number for things like that

hecko

11/27/2022, 1:52 AM

just train it for some time and test a few of the checkpoints in the synthesis notebook

mynameisNegan

11/27/2022, 1:52 AM

Okay.

tylerdurdenceketi

11/27/2022, 11:31 AM

I have preprocessed files with preprocess_audio.py. Graphs were good. But speeches cut off early. So i ran preprocess files with preprocess_audio.py again with 300ms padding. But mel spectogram shows a black region at the end. I have read something about adding noise to audio tracks in github issues section of the tacotron2 repo. What should I do?

(Dawn) Will Draw Fictional Women

11/27/2022, 11:32 AM

>But speeches cut off early.

(Dawn) Will Draw Fictional Women

11/27/2022, 11:32 AM

you overtrained

(Dawn) Will Draw Fictional Women

11/27/2022, 11:32 AM

even then a few tries would probably yield the full prompt

tylerdurdenceketi

11/27/2022, 11:33 AM

Well loss ratio says otherwise.

tylerdurdenceketi

11/27/2022, 11:33 AM

It was 0.30

(Dawn) Will Draw Fictional Women

11/27/2022, 11:33 AM

if you get it TOO low it could be overtrained

(Dawn) Will Draw Fictional Women

11/27/2022, 11:34 AM

if you would like tips on a third attempt i suggest multiple sentences in a single wav file

(Dawn) Will Draw Fictional Women

11/27/2022, 11:34 AM

make sure they dont exceed 12 seconds tho

tylerdurdenceketi

11/27/2022, 11:35 AM

Well I have 10000 wav files.

(Dawn) Will Draw Fictional Women

11/27/2022, 11:35 AM

@hecko has a wav merger cell that should alleviate all the labor

tylerdurdenceketi

11/27/2022, 11:36 AM

I set batch size 32 for 20 epoch. It stalled at 0.30 Then I set it to 128 Then it triggered lr_decay Learning rate got smaller and loss started to decrease