https://uberduck.ai/ logo
Join Discord
Powered by
# machine-learning
  • h

    hecko

    12/15/2022, 8:54 PM
    or
  • h

    hecko

    12/15/2022, 8:54 PM
    there's spectrogram from waveform
  • h

    hecko

    12/15/2022, 8:54 PM
    but image from spectrogram is a todo
  • p

    Pikachu ✓

    12/15/2022, 8:54 PM
    Todo?
  • h

    hecko

    12/15/2022, 8:54 PM
    to do
  • p

    Pikachu ✓

    12/15/2022, 8:55 PM
    How long is each image
  • m

    mega b

    12/16/2022, 12:33 AM
    super cool...
  • m

    mega b

    12/16/2022, 12:33 AM
    i think i could totally make a vocoder model for music
  • m

    mega b

    12/16/2022, 12:34 AM
    the demo uses griffinlm
  • m

    mega b

    12/16/2022, 12:39 AM
    @hecko do you know any datasets that is just music wavs?
  • m

    mega b

    12/16/2022, 12:39 AM
    like 5 second clips
  • h

    hecko

    12/16/2022, 12:39 AM
    pre-spliced no
  • h

    hecko

    12/16/2022, 12:39 AM
    i do know that kevin macleod is willing to share his stuff for ml projects
  • m

    mega b

    12/16/2022, 12:39 AM
    well i think im just going to pretrain a vocoder model so no need to splice anything
  • m

    mega b

    12/16/2022, 12:39 AM
    thanks for the tip
  • h

    hecko

    12/16/2022, 12:40 AM
    there's musdb18 but that's kinda small and more for stem separation
  • u

    {K EY1} (Kei)

    12/16/2022, 12:40 AM
    I'm willing to share my music for ml purposes too if you'd like
  • u

    {K EY1} (Kei)

    12/16/2022, 12:40 AM
    There's only about 1 hour worth though
  • u

    {K EY1} (Kei)

    12/16/2022, 12:40 AM
    And some of it is garbage
  • u

    {K EY1} (Kei)

    12/16/2022, 12:40 AM
    Cuz it's old
  • j

    Justin

    12/16/2022, 1:35 AM
    https://www.riffusion.com/about
  • h

    hecko

    12/16/2022, 1:46 AM
    -oh and it requires contact too
  • h

    hecko

    12/16/2022, 1:47 AM
    ooh here's https://github.com/mdeff/fma
  • g

    Gosmokeless28

    12/16/2022, 2:26 AM
    This is the coolest StableDiffusion model I've ever seen.
  • z

    zwf

    12/16/2022, 6:29 AM
    I think someone should see if that method works for speech. I'd like us to do it if we can make the bandwidth
  • z

    zwf

    12/16/2022, 6:29 AM
    pretty experimental tho
  • h

    hecko

    12/16/2022, 10:12 AM
    welllll - stable diffusion as text to speech? probably not, since it can't even assign colors to objects properly - as a description-based voice shifter? maybe!
  • p

    PixPrucer

    12/16/2022, 10:17 AM
    Isn't that what diffsvc has going on
  • h

    hecko

    12/16/2022, 11:15 AM
    not description-based though
  • h

    hecko

    12/16/2022, 11:16 AM
    but idk where one would get described voice clips
1...102310241025...1068Latest