https://uberduck.ai/ logo
Join Discord
Powered by
# tacotron-2-support
  • c

    Cosmeticwave

    12/10/2022, 3:00 PM
    at the mel spectogram part
  • c

    Cosmeticwave

    12/10/2022, 3:11 PM
    Was using legacy tacotron notebook
  • c

    Cosmeticwave

    12/10/2022, 3:21 PM
    I should probably use the other one
  • c

    Cosmeticwave

    12/10/2022, 4:47 PM
    got it now
  • a

    Amizade | Pony's voice creator

    12/10/2022, 5:36 PM
    The wavsample is not changing the bitrate of my wav files.
  • r

    Reclezon

    12/10/2022, 5:51 PM
    There's a colab to fix that in #841437191073955920 under the
    Other notebooks
    section.
  • a

    Amizade | Pony's voice creator

    12/10/2022, 6:09 PM
    Okay, I'll see thx
  • a

    Amizade | Pony's voice creator

    12/10/2022, 6:13 PM
    but I'm not founding
  • a

    Amizade | Pony's voice creator

    12/10/2022, 6:14 PM
    @Reclezon
  • r

    Reclezon

    12/10/2022, 6:22 PM
    https://colab.research.google.com/drive/100uSy4yDgr118RfuSojVumI23nM4OBkY?usp=sharing#scrollTo=IXbqIuGG8ai
  • a

    Amizade | Pony's voice creator

    12/10/2022, 6:28 PM
    @Reclezon did not work
  • a

    Amizade | Pony's voice creator

    12/10/2022, 7:37 PM
    Copy code
    js
    RuntimeError                              Traceback (most recent call last)
    <ipython-input-21-37fdb9b04f8a> in <module>
          3 print("Generating mels")
          4 if generate_mels:
    ----> 5     create_mels()
          6 
          7 print("Checking for missing files")
    
    3 frames
    /content/tacotron2/stft.py in transform(self, input_data)
         82 
         83         # similar to librosa, reflect-pad the input
    ---> 84         input_data = input_data.view(num_batches, 1, num_samples)
         85         input_data = F.pad(
         86             input_data.unsqueeze(1),
    
    RuntimeError: shape '[1, 1, 21654]' is invalid for input of size 43308
  • h

    hecko

    12/11/2022, 12:44 PM
    i don't think it fixes anything by itself, it just checks
  • h

    hecko

    12/11/2022, 12:45 PM
    the pipeline notebook has a fixer but you're best off learning how to fix it yourself
  • h

    hecko

    12/11/2022, 12:46 PM
    namely - drag it into audacity - set sample rate in the bottom left to 22050 - tracks→ mix → mix stereo down to mono - export as wav (16-bit)
  • a

    Amizade | Pony's voice creator

    12/11/2022, 12:47 PM
    but is there a way to do this at once with all the audios?
  • h

    hecko

    12/11/2022, 12:47 PM
    yeah
  • h

    hecko

    12/11/2022, 12:48 PM
    drag them all in
  • h

    hecko

    12/11/2022, 12:48 PM
    rest is the same
  • h

    hecko

    12/11/2022, 12:48 PM
    but instead of normal export do export multiple
  • a

    Amizade | Pony's voice creator

    12/11/2022, 1:02 PM
    Copy code
    js
    Generating mels
    Generating Mels
    37%
    11/30 [00:00<00:00, 32.64it/s]
    /content/tacotron2/utils.py:14: WavFileWarning: Chunk (non-data) not understood, skipping it.
      sampling_rate, data = read(full_path)
    ---------------------------------------------------------------------------
    RuntimeError                              Traceback (most recent call last)
    <ipython-input-5-37fdb9b04f8a> in <module>
          3 print("Generating mels")
          4 if generate_mels:
    ----> 5     create_mels()
          6 
          7 print("Checking for missing files")
    
    3 frames
    /content/tacotron2/stft.py in transform(self, input_data)
         82 
         83         # similar to librosa, reflect-pad the input
    ---> 84         input_data = input_data.view(num_batches, 1, num_samples)
         85         input_data = F.pad(
         86             input_data.unsqueeze(1),
    
    RuntimeError: shape '[1, 1, 52781]' is invalid for input of size 10556
  • a

    Amizade | Pony's voice creator

    12/11/2022, 1:02 PM
    what this error means?
  • h

    hecko

    12/11/2022, 1:02 PM
    i,,,, have no idea
  • a

    Amizade | Pony's voice creator

    12/11/2022, 1:05 PM
    who is the developer of this notebook?
  • a

    Amizade | Pony's voice creator

    12/11/2022, 1:09 PM
    surely he must understand that
  • h

    hecko

    12/11/2022, 1:16 PM
    it's complicated
  • h

    hecko

    12/11/2022, 1:17 PM
    it was touched by at least 4 separate people, 1 of whom disclaims responsibility for the code itself
  • h

    hecko

    12/11/2022, 1:17 PM
    all i know is something's very wrong with your audio
  • a

    Amizade | Pony's voice creator

    12/11/2022, 1:20 PM
    if you want, I can send you a print of the files to see how it turned out
  • h

    hecko

    12/11/2022, 1:20 PM
    i can try checking them sure
1...878889...158Latest