https://uberduck.ai/ logo
Join Discord
Powered by
# talknet-support
  • j

    Justin

    10/02/2022, 1:11 AM
    send me your transcript
  • g

    Gosmokeless28

    10/02/2022, 1:46 AM
    Question: Why are TalkNet model users trying to use the wrong notebook? I updated #841437191073955920 a couple days ago.
  • g

    Gosmokeless28

    10/02/2022, 1:48 AM
    Maybe I should remove the link to the CPU version of the synthesis notebook.
  • j

    Justin

    10/02/2022, 1:57 AM
    Maybe archive the ones that don't work for now
  • g

    Gosmokeless28

    10/02/2022, 1:58 AM
    I put the link to the CPU notebook in another message
  • d

    dennis

    10/02/2022, 8:32 AM
    Thank you for the help and I have another question, is there any limit on how many .wavs you can use to train a model?
  • a

    alaughingman

    10/02/2022, 1:23 PM
    in my experience with colab using an ~1-20s bell curve distribution, the colabs are too burdensome to use >15k wavs. Ive recently upgraded to the pro-est pro so they're more forgiving but otherwise I would suggest staying under that and I'm trying to find the sweet spot for how under that currently
  • d

    dennis

    10/02/2022, 1:48 PM
    Oh, thank you for explaining! I've been thinking of about like 1200 .wavs! And also maybe stopping the Hifigen at 10.000 steps
  • a

    alaughingman

    10/02/2022, 1:50 PM
    im toyed with a lot of parameters, most recently taking the spectrogram one way wayyyyy out (400-2000 epoch spread) and didn't notice a significant degree for it. hifi by numbers seems super erratic to me so I haven't gotten a feel for when it's "done" do let me know if you notice something special with it
  • d

    dennis

    10/02/2022, 1:51 PM
    I will definetely! Yesterday I did it for the first time to train a Britney Spears AI, my mistake was that I used one song and made about 40 wavs and stopped at 2200 and It sounds all sorts of wonky - I will redo it later today with what I've said above
  • a

    alaughingman

    10/02/2022, 1:51 PM
    im working on rupaul now using a 'truncated' 1k wavs to see what quality I can get comparatively
  • a

    alaughingman

    10/02/2022, 1:53 PM
    while I understand overtraining is a thing it seems harder to argue against it 0given the context of voice synthesis so i've been taking the default duration and pitch epochs x10 (200 and 500 I think respectively) obviously improvements are increasingly negligible but ideally we do want them to sound literally the same as the wavs theyre trained on? 😅
  • d

    dennis

    10/02/2022, 1:57 PM
    Yeah, we really do 😂 I really want mine to stop sounding like there is decades of smoking damage on the voice so I'll definetely use more .wavs, turn the epoch up on that one step (I am very uneducated in these things and can only know what I mean when I am on the Colab page lol) and definetely let the Hifigen do It's thing longer
  • a

    alaughingman

    10/02/2022, 2:00 PM
    imma try extending out the hifi next too - lets compare notes
  • a

    alaughingman

    10/02/2022, 2:04 PM
    as an example here are various donald trump models where I was playing with the spectrogram parameter... (to me) it doesnt sound significantly better given the effort involved. and then wmetallicreduction being the toggle on the controllable talknet notebook
  • d

    dennis

    10/02/2022, 2:13 PM
    They all sound pretty impressive! I'll send the files as soon as I am home, but I never even got any sound for my model without wmetallicreduction being on. It just made very unpleasent noises without that.
  • a

    alaughingman

    10/02/2022, 3:10 PM
    ok. saved a model at whatever it is and 4500hifi and then ill report back after its run a significant time
  • d

    dennis

    10/02/2022, 5:23 PM
    Ugh.
  • t

    TheArtof99

    10/02/2022, 9:23 PM
    https://colab.research.google.com/github/justinjohn0306/TalkNET-colab/blob/main/TalkNet_Training.ipynb#scrollTo=bxFr3Fdi_kOC
  • t

    TheArtof99

    10/02/2022, 9:23 PM
    Can someone help with this
  • g

    Gosmokeless28

    10/03/2022, 2:38 AM
    You should talk to @Justin about this problem
  • a

    alaughingman

    10/03/2022, 3:47 AM
    im not technically smart enough yet to understand whats going on under the hood for the metallic noise reduction on the controllable talknet but as of now my monies on that being the best area to seek further improvements
  • j

    josen95

    10/04/2022, 2:02 PM
    Hi, I'm currently trying to use this notebook. But I get this error when I try to run the interface. Anyone encounter anything similar?
  • u

    {K EY1} (Kei)

    10/04/2022, 2:07 PM
    Just a guess but make sure the runtime mode is set to gpu
  • j

    josen95

    10/04/2022, 2:20 PM
    Well this is how it is currently. Is this what you mean?
  • u

    {K EY1} (Kei)

    10/04/2022, 4:02 PM
    Yep
  • j

    Justin

    10/04/2022, 5:43 PM
    A100s don't work yet
  • j

    Justin

    10/04/2022, 5:43 PM
    aim for a P100 or V100
  • j

    Justin

    10/04/2022, 5:44 PM
    T4 aswell (duh...)
  • u

    {K EY1} (Kei)

    10/04/2022, 6:23 PM
    Wait why
1...141516...74Latest