Uberduck #machine-learning

Join Discord

Cris140

09/26/2022, 4:46 AM

Perfect, thank you so much 🙏

Dalton

09/27/2022, 2:01 AM

does anyone know if theres anysites using this ? https://github.com/neonbjb/tortoise-tts

Dalton

09/27/2022, 2:01 AM

they samples sound REALLY REAL

(Dawn) Will Draw Fictional Women

09/27/2022, 2:01 AM

you cant implement this into a site

(Dawn) Will Draw Fictional Women

09/27/2022, 2:01 AM

it requires gpu to work with good speed which is super costly

Dalton

09/27/2022, 2:02 AM

is this not a model type like tacotron or talknet?

(Dawn) Will Draw Fictional Women

09/27/2022, 2:03 AM

Dalton

09/27/2022, 2:03 AM

oh I thought it was

Dalton

09/27/2022, 2:03 AM

im stupid srry

hecko

09/27/2022, 8:08 AM

it's closer to 15.ai in that it has a bunch of voices crammed into one giant model so many in fact that it can mix them together to sorta emulate any voice you give it but unfortunately it's also extremely slow, takes like a minute to generate one audio file on colab on default settings

hecko

09/27/2022, 8:08 AM

plus since it was trained on podcasts and such it's not very good at the wacky cartoon voices we have at uberduck

a cat came into my house

09/27/2022, 11:50 PM

hi, new pipeline trainer here. am i supposed to use the "wavs/" format for transcriptions txts? (like in legacy training) if this is not in the right channel then let me know

real sky 2022 (how)

09/27/2022, 11:51 PM

the preferred channel is #994486394049282058 but its okay yes, the formatting is the same

fatherallah

09/28/2022, 2:31 AM

Anyone here successfully make a great singing model on talknet? With the dataset composed of a singer’s acapellas? My first one came out awful and I want to know if I should keep trying or find an alternative.

postmates!!

09/28/2022, 3:37 AM

well that was easier then i expected

a cat came into my house

09/28/2022, 5:07 AM

training a model of the "Uh oh, you found the toothpaste!" guy. i think he sounds good

a cat came into my house

09/28/2022, 5:12 AM

even though i might have to retrain him, which takes hours all because i forgot his "You found the toothpaste" line

a cat came into my house

09/28/2022, 5:34 AM

fuck it. retraining just to add the toothpaste lines

a cat came into my house

09/28/2022, 7:00 AM

alright it's time for the FFmpeg arpabet test.

a cat came into my house

09/28/2022, 7:00 AM

small passed the test... kind of!

postmates!!

09/28/2022, 2:20 PM

got some meh results from my edd model

postmates!!

09/28/2022, 2:21 PM

sounds like it needs more time in the oven

HolyArapaima

09/28/2022, 4:48 PM

https://youtu.be/40HJJKxHO_o▾

HolyArapaima

09/28/2022, 4:49 PM

This is the best singing model I got to output out of talknet,

Couch

09/28/2022, 4:49 PM

if i recall someone said consistency is a huge factor when it comes to talknet

HolyArapaima

09/28/2022, 4:49 PM

Yes

HolyArapaima

09/28/2022, 4:49 PM

It's unpredictable

HolyArapaima

09/28/2022, 4:50 PM

You can skew your learning rate and stuff to help you out but it's very trial and error

HolyArapaima

09/28/2022, 4:51 PM

After I got this output out of talknet I struggled to get the model to replicate it lol

HolyArapaima

09/28/2022, 4:51 PM

I found that it obviously worked the best using words and singing phrases already recorded obviously