https://uberduck.ai/ logo
Join Discord
Powered by
# 🎃general🎃
  • z

    zwf

    01/15/2021, 3:06 PM
    oh it sounds like crap haha
  • z

    zwf

    01/15/2021, 3:06 PM
    not english
  • u

    user

    01/15/2021, 3:06 PM
    do you havea screen recorder
  • u

    user

    01/15/2021, 3:06 PM
    ?
  • z

    zwf

    01/15/2021, 3:07 PM
    I can upload a file
  • u

    user

    01/15/2021, 3:07 PM
    ok
  • u

    user

    01/15/2021, 3:07 PM
    do it
  • u

    user

    01/15/2021, 3:07 PM
    wich Graphics card do you have
  • u

    user

    01/15/2021, 3:08 PM
    ?
  • z

    zwf

    01/15/2021, 3:08 PM
    I train usually on an NVIDIA V100
  • u

    user

    01/15/2021, 3:09 PM
    i had a nvidia gtx 560 ti
  • u

    user

    01/15/2021, 3:09 PM
    but its buggy graphics
  • u

    user

    01/15/2021, 3:09 PM
    less powerfull
  • u

    user

    01/15/2021, 3:09 PM
    but still works
  • u

    user

    01/15/2021, 3:10 PM
    lol
  • z

    zwf

    01/15/2021, 3:10 PM
    yeah haha
  • u

    user

    01/15/2021, 3:10 PM
    did you type numbers?
  • z

    zwf

    01/15/2021, 3:10 PM
    nope haha
  • z

    zwf

    01/15/2021, 3:10 PM
    he just only learned to say numbers
  • u

    user

    01/15/2021, 3:11 PM
    he act like 1 years old
  • z

    zwf

    01/15/2021, 3:11 PM
    yeah, I'm working on a speech embedding model so I'm hoping that that will allow synthesizing voices with much less data
  • z

    zwf

    01/15/2021, 3:12 PM
    in this case I think there probably isn't enough diversity in the dataset
  • u

    user

    01/15/2021, 3:12 PM
    wich speech are you working on?
  • z

    zwf

    01/15/2021, 3:12 PM
    I'm trying to reproduce this paper: https://arxiv.org/pdf/1910.10838.pdf
  • z

    zwf

    01/15/2021, 3:13 PM
    it is similar to Tacotron2, but it incorporates speaker embeddings which are independently trained on a large, unlabeled multispeaker dataset.
  • u

    user

    01/15/2021, 3:14 PM
    https://colab.research.google.com/github/pytorch/pytorch.github.io/blob/master/assets/hub/nvidia_deeplearningexamples_tacotron2.ipynb
  • u

    user

    01/15/2021, 3:15 PM
    this is only demo
  • u

    user

    01/15/2021, 3:16 PM
    https://colab.research.google.com/github/tugstugi/dl-colab-notebooks/blob/master/notebooks/RealTimeVoiceCloning.ipynb
  • u

    user

    01/15/2021, 3:16 PM
    i use this most but sounds bad
  • z

    zwf

    01/15/2021, 3:17 PM
    how'd the michael stevens model sound when you did that?
1...456...6886Latest