Uberduck #machine-learning

Join Discord

KubaPL2004Gmail

02/20/2023, 3:46 PM

https://github.com/lugia19/speechToSpeechElevenLabs speachToSpeachElevenLabs by lugia19 on github

KubaPL2004Gmail

02/20/2023, 3:47 PM

https://github.com/lugia19/elevenlabslib elevenlabslib by lugia19 on github

The Wiggles and High School D×D

02/20/2023, 7:54 PM

I seem to get some decent results from the Tortoise notebook using Japanese voice work to create English speech

The Wiggles and High School D×D

02/20/2023, 7:58 PM

All getting better the more audio I collect

The Wiggles and High School D×D

02/20/2023, 7:59 PM

Here's Australian English The voice sounds right but not the emotion

The Wiggles and High School D×D

02/20/2023, 8:43 PM

Result of Japanese speech

The Wiggles and High School D×D

02/20/2023, 8:44 PM

I don't know how to address the robotic nature

TheRoyalRuby2000

02/20/2023, 10:41 PM

That actually sounds like Greg, holy shit!

I love the hive

02/21/2023, 1:33 PM

i try to finetune stable diffusion and this is a result:

The Wiggles and High School D×D

02/21/2023, 10:58 PM

I got Greg, Murray, and Anthony saying it on intro tracks in Wiggly Safari

Justin

02/22/2023, 1:01 AM

https://github.com/NVIDIA/BigVGAN

The Wiggles and High School D×D

02/22/2023, 2:29 AM

Tortoise seems fairly random when it comes to generating a specific emotion from input text for a speaker

The Wiggles and High School D×D

02/22/2023, 2:29 AM

I've been trying quite a few times to get a good sounding audio clip from a voice I added more support for and it sounds worse than before due to weird inflictions

PeaNutsAreGood

02/22/2023, 2:30 AM

are you using vanilla tortoise?

PeaNutsAreGood

02/22/2023, 2:31 AM

vanilla tortoise only uses the first 4 sec of the clip

The Wiggles and High School D×D

02/22/2023, 2:37 AM

Using the Colab notebook

tanooki426

02/22/2023, 2:38 AM

Tried using Tortoise TTS to create a model of Android 17 by only using voice clips of David Menkin as Travis from Bob the Builder and it turned him into a woman

The Wiggles and High School D×D

02/22/2023, 2:39 AM

Oh that does seem to happen occasionally towards the end of the clip

The Wiggles and High School D×D

02/22/2023, 2:40 AM

I don't know if adding more voice clips to the folder remedies that at all

tanooki426

02/22/2023, 2:41 AM

I tried adding an equal amount of Android 17 voice clips with the Travis voice clips to make an impression of Android 17 and it got rejected from Uberduck because "it sounds too similar."

The Wiggles and High School D×D

02/22/2023, 2:42 AM

That's fresh

The Wiggles and High School D×D

02/22/2023, 2:42 AM

Tortoise probably isn't the best thing to use for English speakers

The Wiggles and High School D×D

02/22/2023, 2:42 AM

But I'm using it for a Japanese speaker and wanted to throw an English model in there for the sake of it

The Wiggles and High School D×D

02/22/2023, 2:43 AM

Doesn't sound so Aussie now

The Wiggles and High School D×D

02/22/2023, 2:43 AM

She sounds great though

The Wiggles and High School D×D

02/22/2023, 2:46 AM

Rias speaks first

The Wiggles and High School D×D

02/22/2023, 2:46 AM

In this exchange

I love the hive

02/24/2023, 9:08 AM

weird result from tor Tortoise TTS

Gosmokeless28

02/24/2023, 9:47 AM

Is this better than iSTFTNet?

Justin

02/24/2023, 9:48 AM

hmm not sure