https://uberduck.ai/ logo
Join Discord
Powered by
# machine-learning
  • u

    user

    04/14/2021, 1:08 AM
    no
  • u

    user

    04/14/2021, 1:08 AM
    you can use 20
  • t

    Toasty

    04/14/2021, 1:08 AM
    20 will work?
  • u

    user

    04/14/2021, 1:08 AM
    yes
  • t

    Toasty

    04/14/2021, 1:08 AM
    Alright
  • t

    Toasty

    04/14/2021, 1:08 AM
    My theory is that I made the files too big
  • t

    Toasty

    04/14/2021, 1:09 AM
    And the only way to counteract is to lower the batch size
  • t

    Toasty

    04/14/2021, 1:14 AM
    Little did I realize, I made my dataset untrainable.
  • t

    Toasty

    04/14/2021, 1:16 AM
    Three days of work gone.
  • t

    Toasty

    04/14/2021, 1:17 AM
    fml.
  • u

    user

    04/14/2021, 1:17 AM
    Fuck google
  • t

    Toasty

    04/14/2021, 1:18 AM
    The only way to make my dataset from 24 minutes to 2 hours was raise the limit on ASSFAP from 12 seconds to 30.
  • t

    Toasty

    04/14/2021, 1:18 AM
    (yes ik thats the actual name)
  • t

    Toasty

    04/14/2021, 1:18 AM
    I feel like I made my dataset too big for Tacotron2.
  • u

    user

    04/14/2021, 1:18 AM
    Rip
  • t

    Toasty

    04/14/2021, 1:19 AM
    So... it means I'm stuck with a crap 24 minute tts of Yahtzee
  • t

    Toasty

    04/14/2021, 1:20 AM
    created with an automated data set maker that's crap
  • u

    user

    04/14/2021, 1:20 AM
    Wich one?
  • t

    Toasty

    04/14/2021, 1:21 AM
    Automatic Super Speaker-Filtered Audio Processing V4
  • u

    user

    04/14/2021, 1:21 AM
    And does it build wav files in?
  • t

    Toasty

    04/14/2021, 1:21 AM
    What it does
  • t

    Toasty

    04/14/2021, 1:21 AM
    it automatically makes a dataset
  • u

    user

    04/14/2021, 1:21 AM
    For tacotron?
  • t

    Toasty

    04/14/2021, 1:21 AM
    Yes
  • t

    Toasty

    04/14/2021, 1:22 AM
    ARPABET, transcripts, wavs, you name it
  • t

    Toasty

    04/14/2021, 1:22 AM
    One problem
  • t

    Toasty

    04/14/2021, 1:22 AM
    So, you're suppose to dump like several 22050 hz mono audio files of your speaker with a sample wav of said speaker.
  • u

    user

    04/14/2021, 1:23 AM
    Discombobulate the allies of the genetical chromosomian ectomycorrhizals.
  • t

    Toasty

    04/14/2021, 1:23 AM
    ASSFAP would take the sample wav and find all possible voice thingies that match the 10 second sample wav.
  • t

    Toasty

    04/14/2021, 1:23 AM
    The problem? It has a godawful time at extracting the audio to wav.
1...91011...1068Latest