Uberduck #tacotron-2-support

Join Discord

haru0l

07/20/2022, 10:15 AM

a fix was alr given irc

haru0l

07/20/2022, 10:15 AM

use the search function

Deleted User

07/20/2022, 9:44 PM

Guys?

Deleted User

07/20/2022, 9:44 PM

I'm starting to think that maybe I'm making a mistake trying to make a voice for Uberduck, since apparently the tutorial is outdated.

Deleted User

07/20/2022, 9:45 PM

I have an audio file...

Deleted User

07/20/2022, 9:45 PM

I just want to simply train a model, and throw it onto Uberduck's site...

{K EY1} (Kei)

07/20/2022, 9:50 PM

Start by splitting the audio file into segments of 10 seconds or less

Deleted User

07/20/2022, 9:52 PM

So, every sentence into its own WAV file?

{K EY1} (Kei)

07/20/2022, 9:52 PM

Basically, yeah Sometimes you can fit two sentences in there; that works too!

{K EY1} (Kei)

07/20/2022, 9:54 PM

(It's actually helpful to have more than one sentence, because it helps the model learn how to make sentence breaks.)

Deleted User

07/20/2022, 9:54 PM

There are no sentences

Deleted User

07/20/2022, 9:54 PM

Just several portions

{K EY1} (Kei)

07/20/2022, 9:56 PM

I can't listen to that since my phone won't download the audio

Deleted User

07/20/2022, 9:57 PM

So I have 4 .wav 16bit signed PCM files

{K EY1} (Kei)

07/20/2022, 9:58 PM

Cool

Deleted User

07/20/2022, 9:58 PM

Each labeled 1.wav 2.wav 3.wav etc

{K EY1} (Kei)

07/20/2022, 10:00 PM

Now, make a txt file called transcription or list (or whatever else you want to call it) On separate lines, put

wavs/{number}.wav|{text}

with {number} being the name of the file and {text} being what's said in the file

Deleted User

07/20/2022, 10:00 PM

I'll call it list.txt

Deleted User

07/20/2022, 10:00 PM

So, for example, I write...

Deleted User

07/20/2022, 10:01 PM

Copy code

wavs/1.wav|Incoming message.

{K EY1} (Kei)

07/20/2022, 10:01 PM

Include the |

Deleted User

07/20/2022, 10:01 PM

Like that?

{K EY1} (Kei)

07/20/2022, 10:01 PM

There we go

{K EY1} (Kei)

07/20/2022, 10:01 PM

Yep

Deleted User

07/20/2022, 10:01 PM

And the same for all 4

{K EY1} (Kei)

07/20/2022, 10:02 PM

Yeah And then you'll have your dataset

Deleted User

07/20/2022, 10:03 PM

Like this

Deleted User

07/20/2022, 10:03 PM

Copy code

wavs/1.wav|Incoming message.
wavs/2.wav|Kids need power.
wavs/3.wav|Operation call to action.
wavs/4.wav|Brock Lee Coco Lay Superhero Identitities.

Deleted User

07/20/2022, 10:04 PM

So, I got that 😛

Deleted User

07/20/2022, 10:04 PM

But how do I get this shoved into a model? 🙂