https://uberduck.ai/ logo
Join DiscordCommunities
Powered by
# tacotron-2-support
  • c

    Cris140

    05/02/2023, 10:43 PM
    Yes but the link may be outdated
  • g

    Gosmokeless28

    05/02/2023, 10:44 PM
    https://colab.research.google.com/drive/1p0Vh_GFiYQ84ECCrjU2PDkmhKcN-GEfR
  • j

    jalin28

    05/02/2023, 10:52 PM
    Oh That Might Be Why Ok
  • c

    Cris140

    05/02/2023, 10:56 PM
    Are you using the one @Gosmokeless28 sent?
  • u

    (".Lukus_bt.")

    05/05/2023, 9:25 PM

    https://cdn.discordapp.com/attachments/994486394049282058/1104157005427519588/Screenshot_2023-05-05_172527.png▾

  • g

    Gosmokeless28

    05/05/2023, 9:36 PM
    Did you run the setup cell?
  • u

    (".Lukus_bt.")

    05/05/2023, 9:36 PM
    yes
  • g

    Gosmokeless28

    05/05/2023, 9:37 PM
    Try manually selecting the model using the "manual_select" function
  • s

    Sonic2022_mario

    05/06/2023, 12:24 AM
    Can Someone Remove The Music And SFX From This, I Don't Feel Like It https://drive.google.com/file/d/1S04uHEMk5j6WdP1pid9OxwdVs4Iz9kaG/view?usp=share_link
  • s

    Sonic2022_mario

    05/06/2023, 12:24 AM
    I Am Doing Phineas' Season 1 Voice
  • y

    YTR76

    05/06/2023, 4:17 AM
    THAT'S A WHOLE AHH PHINEAS AND FERB EPISODE HOW IS THAT STILL UP💀
  • y

    YTR76

    05/06/2023, 4:25 AM
    btw i'm doing it in the background
  • s

    Sonic2022_mario

    05/06/2023, 9:49 AM
    I Thought You Were Removing The SFX And Music For This
  • m

    mishutka154

    05/06/2023, 10:24 AM
    Am I correct in assuming that there should be no commas in transcript?
  • y

    YTR76

    05/06/2023, 4:05 PM
    I can't get it to be under 31 mb without it being .wav tho
  • g

    Gosmokeless28

    05/06/2023, 5:05 PM
    So why not convert it to a .wav file?
  • g

    Gosmokeless28

    05/06/2023, 5:05 PM
    No
  • y

    YTR76

    05/06/2023, 5:06 PM
    it becomes 31 mb all the time
  • m

    mishutka154

    05/06/2023, 5:10 PM
    Does it make no difference how to write numbers in a transcript, in letters or in digits?
  • g

    Gosmokeless28

    05/06/2023, 5:24 PM
    Writing numbers in word form is the best way to transcribe them
  • m

    mishutka154

    05/06/2023, 5:25 PM
    Got it
  • y

    YTR76

    05/07/2023, 10:54 PM
    i did it, it's not wav tho https://cdn.discordapp.com/attachments/994486394049282058/1104904105236893796/8mb.video-JzR-UAWl3zK8.m4a
  • t

    Twilight Sparkle

    05/09/2023, 2:01 PM
    torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 2.00 MiB (GPU 0; 14.75 GiB total capacity; 12.67 GiB already allocated; 832.00 KiB free; 13.61 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF
  • s

    Sonic2022_mario

    05/09/2023, 7:19 PM
    What Site Did You Use?
  • b

    BedlamGames

    05/10/2023, 1:04 PM
    is there a max amplitude to use to avoid auto clipping and the audio amplitude out of range warning on the files?
  • p

    PUMPKINEATER

    05/11/2023, 1:15 AM
    NameError Traceback (most recent call last) in () 6 #@markdown #### Import the transcript .txt file into the /content/tacotron2/filelists folder on the left. 7 Training_file = "filelists/transcription.txt" #@param {type: "string"} ----> 8 hparams.training_files = Training_file 9 hparams.validation_files = Training_file 10 NameError: name 'hparams' is not defined
  • g

    Gosmokeless28

    05/11/2023, 1:52 AM
    Is that even a problem? That's just in regard to the TensorBoard preview outputs.
  • b

    BedlamGames

    05/11/2023, 7:43 AM
    Good to know, cheers.
  • b

    BedlamGames

    05/11/2023, 1:09 PM
    hmm, what's a good epoch number to get to with the Multivoice one on Uberduck, Gravel one (that imo ended up great) only needed around 200, Kroos I'm trying now is okay at 300 ish with a bit of audio scratch, Amiya though I got up to over 500 and was unlistenable with how scratchy it was with very similar setup with entirely clean audio split over 50 clips each. Output from Tensorboard seems to not represent how it will sound in actual testing so it's hard to tell. Any advice would be appreciated.
  • y

    YTR76

    05/11/2023, 8:17 PM
    8mb.video
1...150151152...158Latest