Uberduck #🎃general🎃

Join Discord

zwf

01/15/2021, 3:06 PM

oh it sounds like crap haha

zwf

01/15/2021, 3:06 PM

not english

user

01/15/2021, 3:06 PM

do you havea screen recorder

user

01/15/2021, 3:06 PM

zwf

01/15/2021, 3:07 PM

I can upload a file

user

01/15/2021, 3:07 PM

user

01/15/2021, 3:07 PM

do it

user

01/15/2021, 3:07 PM

wich Graphics card do you have

user

01/15/2021, 3:08 PM

zwf

01/15/2021, 3:08 PM

I train usually on an NVIDIA V100

user

01/15/2021, 3:09 PM

i had a nvidia gtx 560 ti

user

01/15/2021, 3:09 PM

but its buggy graphics

user

01/15/2021, 3:09 PM

less powerfull

user

01/15/2021, 3:09 PM

but still works

user

01/15/2021, 3:10 PM

lol

zwf

01/15/2021, 3:10 PM

yeah haha

user

01/15/2021, 3:10 PM

did you type numbers?

zwf

01/15/2021, 3:10 PM

nope haha

zwf

01/15/2021, 3:10 PM

he just only learned to say numbers

user

01/15/2021, 3:11 PM

he act like 1 years old

zwf

01/15/2021, 3:11 PM

yeah, I'm working on a speech embedding model so I'm hoping that that will allow synthesizing voices with much less data

zwf

01/15/2021, 3:12 PM

in this case I think there probably isn't enough diversity in the dataset

user

01/15/2021, 3:12 PM

wich speech are you working on?

zwf

01/15/2021, 3:12 PM

I'm trying to reproduce this paper: https://arxiv.org/pdf/1910.10838.pdf

zwf

01/15/2021, 3:13 PM

it is similar to Tacotron2, but it incorporates speaker embeddings which are independently trained on a large, unlabeled multispeaker dataset.

user

01/15/2021, 3:14 PM

https://colab.research.google.com/github/pytorch/pytorch.github.io/blob/master/assets/hub/nvidia_deeplearningexamples_tacotron2.ipynb

user

01/15/2021, 3:15 PM

this is only demo

user

01/15/2021, 3:16 PM

https://colab.research.google.com/github/tugstugi/dl-colab-notebooks/blob/master/notebooks/RealTimeVoiceCloning.ipynb

user

01/15/2021, 3:16 PM

i use this most but sounds bad

zwf

01/15/2021, 3:17 PM

how'd the michael stevens model sound when you did that?