Uberduck #machine-learning

Join Discord

postmates!!

09/29/2022, 7:49 PM

nice

postmates!!

09/29/2022, 9:06 PM

spamtons never gonna sound right is he

hecko

09/29/2022, 9:26 PM

you could try finetuning a hifi-gan model

postmates!!

09/29/2022, 9:37 PM

i have no idea how to do that

hecko

09/29/2022, 9:38 PM

#841437191073955920

postmates!!

09/29/2022, 9:38 PM

alright ill test it out

postmates!!

09/29/2022, 9:39 PM

which one is it?

postmates!!

09/29/2022, 9:39 PM

oh wait this one?

hecko

09/29/2022, 9:41 PM

yeah

postmates!!

09/30/2022, 12:47 AM

if im transcribing someone in another language as arpa but they have some english lines should i just make the whole list arpa

{K EY1} (Kei)

09/30/2022, 2:25 AM

You can write the english lines in english or arpa, doesn't really matter

postmates!!

09/30/2022, 2:30 AM

alright thanks

mega b

09/30/2022, 2:33 PM

https://phenaki.video/#interactive

mega b

09/30/2022, 2:33 PM

Text to video but with long length (2 minutes)

hecko

09/30/2022, 2:50 PM

hmhm https://github.com/jianfch/stable-ts

hecko

09/30/2022, 3:02 PM

also they added a thing to accommodate prompt engineering

zwf

09/30/2022, 7:41 PM

oh this is really nice

hecko

09/30/2022, 7:41 PM

gonna have to upgrade my install because i tried prompt engineering on polish with

prefix

and it failed catastrophically

hecko

09/30/2022, 7:43 PM

though with polish it seems to veer off anyway, like 5% of cases had random other languages at the end or sometimes from the start even

PixPrucer

10/01/2022, 7:01 AM

Do we know if it's possible to train a sound diffusion model yet o(-( I've seen Mr. Pikachu send some results a while ago and it intrigues me to this day

PixPrucer

10/01/2022, 7:01 AM

I just really am curious about AI dreaming ideas in my music style

TheArtof99

10/01/2022, 1:29 PM

Ok now, I'm trying to download the PyTorch as said in the uberduck tutorial but appearently 10.2 and 11.3 have been discontinued what's the best subsitute

TheArtof99

10/01/2022, 1:29 PM

Or they aren't working on my computer

TheArtof99

10/01/2022, 3:02 PM

anyone?

hecko

10/02/2022, 2:47 PM

currently getting speaker embeddings for all the tacotron voices, using this mess of code

Copy code

py
api_key = # not giving you mine
data_path = "/content/drive/MyDrive/uberduck-embeddings.json"

!pip install speechbrain
!pip install tinydb

import os
import numpy
import requests
import shutil
import torchaudio
import traceback

from speechbrain.pretrained import EncoderClassifier
from tinydb import TinyDB, Query
from tinydb.storages import JSONStorage
from tinydb.middlewares import CachingMiddleware
from tqdm.auto import tqdm

classifier = EncoderClassifier.from_hparams(source = "speechbrain/spkrec-xvect-voxceleb", savedir = "pretrained_models/spkrec-xvect-voxceleb")
q = Query()

headers = {
    "accept": "application/json",
    "authorization": f"Bearer {api_key}"
}

voice_list = requests.get("https://api.uberduck.ai/voices?mode=tts-basic", headers = headers).json()

with TinyDB(data_path, storage = CachingMiddleware(JSONStorage)) as db:
  for i in tqdm([i for i in voice_list if not db.search(q.voicemodel_uuid == i["voicemodel_uuid"])]):
    try:
      r = requests.get(f"https://api.uberduck.ai/voices/{i['voicemodel_uuid']}/samples").json()
      audio = [torchaudio.load(requests.get(j["url"], stream = True).raw)[0] for j in r]
      embeddings = [classifier.encode_batch(j).squeeze().tolist() for j in audio]
      db.insert({**i, "embeddings": embeddings, "avg_embedding": numpy.average(embeddings, axis = 0).tolist()})
    except Exception:
      print(f"fucky wucky at {i['name']}")
      traceback.print_exc()

hecko

10/02/2022, 2:51 PM

oh cool it seems that the for loop is being very slow to initialize now

hecko

10/02/2022, 2:55 PM

fixed it seems

hecko

10/02/2022, 2:57 PM

as my voice is on uberduck i'll record a new clip of myself speaking some random language and use that as a first test case

hecko

10/02/2022, 3:03 PM

what the hay

hecko

10/02/2022, 3:08 PM

usecases: - moderators (me!) looking for duplicate models - users looking for soundalikes of blacklisted voices - viewers curious about which voice meme zee used this time