Uberduck #machine-learning

Join Discord

hecko

12/15/2022, 8:54 PM

hecko

12/15/2022, 8:54 PM

there's spectrogram from waveform

hecko

12/15/2022, 8:54 PM

but image from spectrogram is a todo

Pikachu ✓

12/15/2022, 8:54 PM

Todo?

hecko

12/15/2022, 8:54 PM

to do

Pikachu ✓

12/15/2022, 8:55 PM

How long is each image

mega b

12/16/2022, 12:33 AM

super cool...

mega b

12/16/2022, 12:33 AM

i think i could totally make a vocoder model for music

mega b

12/16/2022, 12:34 AM

the demo uses griffinlm

mega b

12/16/2022, 12:39 AM

@hecko do you know any datasets that is just music wavs?

mega b

12/16/2022, 12:39 AM

like 5 second clips

hecko

12/16/2022, 12:39 AM

pre-spliced no

hecko

12/16/2022, 12:39 AM

i do know that kevin macleod is willing to share his stuff for ml projects

mega b

12/16/2022, 12:39 AM

well i think im just going to pretrain a vocoder model so no need to splice anything

mega b

12/16/2022, 12:39 AM

thanks for the tip

hecko

12/16/2022, 12:40 AM

there's musdb18 but that's kinda small and more for stem separation

{K EY1} (Kei)

12/16/2022, 12:40 AM

I'm willing to share my music for ml purposes too if you'd like

{K EY1} (Kei)

12/16/2022, 12:40 AM

There's only about 1 hour worth though

{K EY1} (Kei)

12/16/2022, 12:40 AM

And some of it is garbage

{K EY1} (Kei)

12/16/2022, 12:40 AM

Cuz it's old

Justin

12/16/2022, 1:35 AM

https://www.riffusion.com/about

hecko

12/16/2022, 1:46 AM

-oh and it requires contact too

hecko

12/16/2022, 1:47 AM

ooh here's https://github.com/mdeff/fma

Gosmokeless28

12/16/2022, 2:26 AM

This is the coolest StableDiffusion model I've ever seen.

zwf

12/16/2022, 6:29 AM

I think someone should see if that method works for speech. I'd like us to do it if we can make the bandwidth

zwf

12/16/2022, 6:29 AM

pretty experimental tho

hecko

12/16/2022, 10:12 AM

welllll - stable diffusion as text to speech? probably not, since it can't even assign colors to objects properly - as a description-based voice shifter? maybe!

PixPrucer

12/16/2022, 10:17 AM

Isn't that what diffsvc has going on

hecko

12/16/2022, 11:15 AM

not description-based though

hecko

12/16/2022, 11:16 AM

but idk where one would get described voice clips