https://uberduck.ai/ logo
Join Discord
Powered by
# tacotron-2-support
  • c

    Cris140

    12/13/2022, 6:55 PM
    840 epochs, that's too much
  • c

    Cris140

    12/13/2022, 6:55 PM
    you overfitted the model
  • a

    AhmadGT

    12/13/2022, 8:01 PM
    is there any way making it less janky with less epochs
  • a

    AhmadGT

    12/13/2022, 8:04 PM
    also thre is no refrance adio o use
  • a

    AhmadGT

    12/13/2022, 8:06 PM
    there is no more refrance audio to use*
  • a

    AhmadGT

    12/13/2022, 8:20 PM
    well i think it sounds janky because im too smart and name 2 wavs with the same name
  • a

    AhmadGT

    12/13/2022, 8:22 PM
    do i need to retrain?
  • f

    fincherfan00

    12/13/2022, 11:55 PM
    using the legacy tacotron 2 model and got this error, can someone help me figure this out?
  • f

    fincherfan00

    12/14/2022, 12:03 AM
    nvm issue fixed, i just switched to the FakeYou tacotron 2 notebook
  • g

    Gosmokeless28

    12/14/2022, 12:05 AM
    840?? No wonder it sounds janky. You overfit the model.
  • g

    Gosmokeless28

    12/14/2022, 12:06 AM
    Train the model for 200 epochs.
  • a

    AhmadGT

    12/14/2022, 2:00 AM
    mhmm okay thanks!
  • a

    AhmadGT

    12/14/2022, 2:00 AM
    RETAINING T I M E
  • f

    fincherfan00

    12/14/2022, 3:52 AM
    noo FakeYou model tester is giving me this message
  • f

    fincherfan00

    12/14/2022, 3:56 AM
    i already have it set to anyone can edit with the link
  • g

    Gosmokeless28

    12/14/2022, 5:24 AM
    I see the problem: You pasted the whole link instead of only the ID part of the URL.
  • a

    AhmadGT

    12/14/2022, 5:25 PM
    so i trained it for 200 but i think it needs to be 300
  • c

    Cris140

    12/14/2022, 6:46 PM
    No, it doesn't
  • c

    Cris140

    12/14/2022, 6:47 PM
    If you model didn't came out good with 200 epochs using only 6 minutes, your 6 minutes of audio isn't good
  • c

    Cris140

    12/14/2022, 6:47 PM
    Or you transcription can have typos
  • c

    Cris140

    12/14/2022, 6:47 PM
    Around 150 epochs you would already get a good model with 18 batch size
  • p

    PixPrucer

    12/15/2022, 9:12 AM
    Hello! It's been a while since I've typed here I missed silly talking robots But yeah I've encountered an interesting problem on the pipeline interference notebook while testing my Polish model
  • p

    PixPrucer

    12/15/2022, 9:13 AM
    I've recorded a fresh dataset earlier and the training seemed successful, but for some reason __the model rolls from accented letters back to Latin letters when rendering speech__, making the pronunciation wack af
  • p

    PixPrucer

    12/15/2022, 9:16 AM
    As an example
    Częściowo śmierć się ułaskawiła
    gets pronounced as
    Czesciowo smierc sie ulaskawila
  • p

    PixPrucer

    12/15/2022, 9:21 AM
    Do we know what might be causing that ? The model was fine and pronounced everything correctly when I listened to tensorboard samples
  • p

    PixPrucer

    12/15/2022, 1:33 PM
    Just now was able to sit on the PC to render the sample The input was the first sentence, but the output sounds like the second one (incorrect)
  • p

    PixPrucer

    12/15/2022, 1:35 PM
    This manner repeats with other inputs as well
  • p

    PixPrucer

    12/15/2022, 1:36 PM
    Again, input text:
    Śmieszny ten błąd, który nie wiadomo skąd wynika.
    Gets pronounced as
    Smieszny ten blad, ktory nie wiadomo skad wynika.
  • h

    hecko

    12/15/2022, 8:24 PM
    uhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh did you select the polish base
  • h

    hecko

    12/15/2022, 8:24 PM
    -or sorry no if it's a synthesis problem then
1...909192...158Latest