https://uberduck.ai/ logo
Join Discord
Powered by
# machine-learning
  • p

    postmates!!

    09/29/2022, 7:49 PM
    nice
  • p

    postmates!!

    09/29/2022, 9:06 PM
    spamtons never gonna sound right is he
  • h

    hecko

    09/29/2022, 9:26 PM
    you could try finetuning a hifi-gan model
  • p

    postmates!!

    09/29/2022, 9:37 PM
    i have no idea how to do that
  • h

    hecko

    09/29/2022, 9:38 PM
    #841437191073955920
  • p

    postmates!!

    09/29/2022, 9:38 PM
    alright ill test it out
  • p

    postmates!!

    09/29/2022, 9:39 PM
    which one is it?
  • p

    postmates!!

    09/29/2022, 9:39 PM
    oh wait this one?
  • h

    hecko

    09/29/2022, 9:41 PM
    yeah
  • p

    postmates!!

    09/30/2022, 12:47 AM
    if im transcribing someone in another language as arpa but they have some english lines should i just make the whole list arpa
  • u

    {K EY1} (Kei)

    09/30/2022, 2:25 AM
    You can write the english lines in english or arpa, doesn't really matter
  • p

    postmates!!

    09/30/2022, 2:30 AM
    alright thanks
  • m

    mega b

    09/30/2022, 2:33 PM
    https://phenaki.video/#interactive
  • m

    mega b

    09/30/2022, 2:33 PM
    Text to video but with long length (2 minutes)
  • h

    hecko

    09/30/2022, 2:50 PM
    hmhm https://github.com/jianfch/stable-ts
  • h

    hecko

    09/30/2022, 3:02 PM
    also they added a thing to accommodate prompt engineering
  • z

    zwf

    09/30/2022, 7:41 PM
    oh this is really nice
  • h

    hecko

    09/30/2022, 7:41 PM
    gonna have to upgrade my install because i tried prompt engineering on polish with
    prefix
    and it failed catastrophically
  • h

    hecko

    09/30/2022, 7:43 PM
    though with polish it seems to veer off anyway, like 5% of cases had random other languages at the end or sometimes from the start even
  • p

    PixPrucer

    10/01/2022, 7:01 AM
    Do we know if it's possible to train a sound diffusion model yet o(-( I've seen Mr. Pikachu send some results a while ago and it intrigues me to this day
  • p

    PixPrucer

    10/01/2022, 7:01 AM
    I just really am curious about AI dreaming ideas in my music style
  • t

    TheArtof99

    10/01/2022, 1:29 PM
    Ok now, I'm trying to download the PyTorch as said in the uberduck tutorial but appearently 10.2 and 11.3 have been discontinued what's the best subsitute
  • t

    TheArtof99

    10/01/2022, 1:29 PM
    Or they aren't working on my computer
  • t

    TheArtof99

    10/01/2022, 3:02 PM
    anyone?
  • h

    hecko

    10/02/2022, 2:47 PM
    currently getting speaker embeddings for all the tacotron voices, using this mess of code
    Copy code
    py
    api_key = # not giving you mine
    data_path = "/content/drive/MyDrive/uberduck-embeddings.json"
    
    !pip install speechbrain
    !pip install tinydb
    
    import os
    import numpy
    import requests
    import shutil
    import torchaudio
    import traceback
    
    from speechbrain.pretrained import EncoderClassifier
    from tinydb import TinyDB, Query
    from tinydb.storages import JSONStorage
    from tinydb.middlewares import CachingMiddleware
    from tqdm.auto import tqdm
    
    classifier = EncoderClassifier.from_hparams(source = "speechbrain/spkrec-xvect-voxceleb", savedir = "pretrained_models/spkrec-xvect-voxceleb")
    q = Query()
    
    headers = {
        "accept": "application/json",
        "authorization": f"Bearer {api_key}"
    }
    
    voice_list = requests.get("https://api.uberduck.ai/voices?mode=tts-basic", headers = headers).json()
    
    with TinyDB(data_path, storage = CachingMiddleware(JSONStorage)) as db:
      for i in tqdm([i for i in voice_list if not db.search(q.voicemodel_uuid == i["voicemodel_uuid"])]):
        try:
          r = requests.get(f"https://api.uberduck.ai/voices/{i['voicemodel_uuid']}/samples").json()
          audio = [torchaudio.load(requests.get(j["url"], stream = True).raw)[0] for j in r]
          embeddings = [classifier.encode_batch(j).squeeze().tolist() for j in audio]
          db.insert({**i, "embeddings": embeddings, "avg_embedding": numpy.average(embeddings, axis = 0).tolist()})
        except Exception:
          print(f"fucky wucky at {i['name']}")
          traceback.print_exc()
  • h

    hecko

    10/02/2022, 2:51 PM
    oh cool it seems that the for loop is being very slow to initialize now
  • h

    hecko

    10/02/2022, 2:55 PM
    fixed it seems
  • h

    hecko

    10/02/2022, 2:57 PM
    as my voice is on uberduck i'll record a new clip of myself speaking some random language and use that as a first test case
  • h

    hecko

    10/02/2022, 3:03 PM
    what the hay
  • h

    hecko

    10/02/2022, 3:08 PM
    usecases: - moderators (me!) looking for duplicate models - users looking for soundalikes of blacklisted voices - viewers curious about which voice meme zee used this time
1...985986987...1068Latest