Uberduck #tacotron-2-support

Join Discord

Cris140

05/02/2023, 10:43 PM

Yes but the link may be outdated

Gosmokeless28

05/02/2023, 10:44 PM

https://colab.research.google.com/drive/1p0Vh_GFiYQ84ECCrjU2PDkmhKcN-GEfR

jalin28

05/02/2023, 10:52 PM

Oh That Might Be Why Ok

Cris140

05/02/2023, 10:56 PM

Are you using the one @Gosmokeless28 sent?

(".Lukus_bt.")

05/05/2023, 9:25 PM

https://cdn.discordapp.com/attachments/994486394049282058/1104157005427519588/Screenshot_2023-05-05_172527.png▾

Gosmokeless28

05/05/2023, 9:36 PM

Did you run the setup cell?

(".Lukus_bt.")

05/05/2023, 9:36 PM

yes

Gosmokeless28

05/05/2023, 9:37 PM

Try manually selecting the model using the "manual_select" function

Sonic2022_mario

05/06/2023, 12:24 AM

Can Someone Remove The Music And SFX From This, I Don't Feel Like It https://drive.google.com/file/d/1S04uHEMk5j6WdP1pid9OxwdVs4Iz9kaG/view?usp=share_link

Sonic2022_mario

05/06/2023, 12:24 AM

I Am Doing Phineas' Season 1 Voice

YTR76

05/06/2023, 4:17 AM

THAT'S A WHOLE AHH PHINEAS AND FERB EPISODE HOW IS THAT STILL UP💀

YTR76

05/06/2023, 4:25 AM

btw i'm doing it in the background

Sonic2022_mario

05/06/2023, 9:49 AM

I Thought You Were Removing The SFX And Music For This

mishutka154

05/06/2023, 10:24 AM

Am I correct in assuming that there should be no commas in transcript?

YTR76

05/06/2023, 4:05 PM

I can't get it to be under 31 mb without it being .wav tho

Gosmokeless28

05/06/2023, 5:05 PM

So why not convert it to a .wav file?

Gosmokeless28

05/06/2023, 5:05 PM

YTR76

05/06/2023, 5:06 PM

it becomes 31 mb all the time

mishutka154

05/06/2023, 5:10 PM

Does it make no difference how to write numbers in a transcript, in letters or in digits?

Gosmokeless28

05/06/2023, 5:24 PM

Writing numbers in word form is the best way to transcribe them

mishutka154

05/06/2023, 5:25 PM

Got it

YTR76

05/07/2023, 10:54 PM

i did it, it's not wav tho https://cdn.discordapp.com/attachments/994486394049282058/1104904105236893796/8mb.video-JzR-UAWl3zK8.m4a

Twilight Sparkle

05/09/2023, 2:01 PM

torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 2.00 MiB (GPU 0; 14.75 GiB total capacity; 12.67 GiB already allocated; 832.00 KiB free; 13.61 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF

Sonic2022_mario

05/09/2023, 7:19 PM

What Site Did You Use?

BedlamGames

05/10/2023, 1:04 PM

is there a max amplitude to use to avoid auto clipping and the audio amplitude out of range warning on the files?

PUMPKINEATER

05/11/2023, 1:15 AM

NameError Traceback (most recent call last) in () 6 #@markdown #### Import the transcript .txt file into the /content/tacotron2/filelists folder on the left. 7 Training_file = "filelists/transcription.txt" #@param {type: "string"} ----> 8 hparams.training_files = Training_file 9 hparams.validation_files = Training_file 10 NameError: name 'hparams' is not defined

Gosmokeless28

05/11/2023, 1:52 AM

Is that even a problem? That's just in regard to the TensorBoard preview outputs.

BedlamGames

05/11/2023, 7:43 AM

Good to know, cheers.

BedlamGames

05/11/2023, 1:09 PM

hmm, what's a good epoch number to get to with the Multivoice one on Uberduck, Gravel one (that imo ended up great) only needed around 200, Kroos I'm trying now is okay at 300 ish with a bit of audio scratch, Amiya though I got up to over 500 and was unlistenable with how scratchy it was with very similar setup with entirely clean audio split over 50 clips each. Output from Tensorboard seems to not represent how it will sound in actual testing so it's hard to tell. Any advice would be appreciated.

YTR76

05/11/2023, 8:17 PM

8mb.video