https://uberduck.ai/ logo
Join Discord
Powered by
# 🎃general🎃
  • k

    kaptain

    11/21/2020, 10:04 PM
    ah
  • z

    zwf

    11/21/2020, 10:04 PM
    do
    head metadata.csv
  • z

    zwf

    11/21/2020, 10:04 PM
    it's already almost right
  • z

    zwf

    11/21/2020, 10:04 PM
    you'll see
  • z

    zwf

    11/21/2020, 10:04 PM
    each line contains a file path and the transcription
  • k

    kaptain

    11/21/2020, 10:05 PM
    so to be clear, that txt file that prepare mels expects has the audio file name and transcribed text pipe separated
  • z

    zwf

    11/21/2020, 10:05 PM
    yeah exactly. and that's what transcribe_dataset.py will output
  • k

    kaptain

    11/21/2020, 10:05 PM
    sweet
  • k

    kaptain

    11/21/2020, 10:05 PM
    so i just have to separate it
  • k

    kaptain

    11/21/2020, 10:05 PM
    *split
  • k

    kaptain

    11/21/2020, 10:06 PM
    thanks
  • z

    zwf

    11/21/2020, 10:06 PM
    yee
  • a

    Ajay

    12/12/2020, 9:05 PM
    here from vocodes, very cool project
  • z

    zwf

    12/13/2020, 2:17 AM
    thanks for checking it out!
  • z

    zwf

    12/13/2020, 11:23 PM
    Using GPT3 to generate jeopardy style questions and answers based on topics that don't actually appear in jeopardy...
  • z

    zwf

    12/13/2020, 11:24 PM
    Copy code
    Q: Gradient descent can be expressed in terms of this quantity.
    A: What is the loss function?
    R: No, the correct answer is the hessian matrix.
    
    Q: An artificial neural network that predicts the next symbol in a sequence based on the previous symbols is called this.
    A: What is a recurrent neural network?
    R: That was the correct answer.
  • z

    zwf

    12/13/2020, 11:24 PM
    questions, answers and followup remark were all generated by GPT3
  • z

    zwf

    12/13/2020, 11:26 PM
    Definitely need to work on improving pronunciation but pretty promising start
  • z

    zwf

    12/13/2020, 11:53 PM
    to avoid any copyright issues we can call it GPrT (pronounced "jeopartee")
  • a

    Ajay

    12/14/2020, 12:46 AM
    did gpt3 ever become public?
  • z

    zwf

    12/14/2020, 1:45 AM
    it isn't public, but they keep expanding the private beta
  • z

    zwf

    12/14/2020, 1:45 AM
    I think there's probably around 10k people in it by now
  • z

    zwf

    12/14/2020, 1:45 AM
    don't think they'll release the model itself though
  • a

    Ajay

    12/14/2020, 2:10 AM
    ah, I see. I thought they stopped when I saw the microsoft announcement
  • z

    zwf

    12/16/2020, 8:34 PM
    I started training trebek on a dataset with arpabet transcriptions to try to improve pronunciations
  • z

    zwf

    12/16/2020, 8:36 PM
    fairly mediocre results so far but I think that's because I don't have enough data (~1 hour or so), and the base LJSpeech model wasn't trained on arpabet at all. At least it's learning some of the tokens
  • z

    zwf

    12/16/2020, 8:36 PM
    The inputs for each of these:
  • z

    zwf

    12/16/2020, 8:37 PM
    Copy code
    "{AY1}, {Z AE1 K}, {W AH1 N S} {W AA1 Z} {AH0} {M AE1 N}... {DH EH1 N} - {L EY1 T ER0}, {AY1} {B IH0 K EY1 M} {B IY1 S T}!!"
      "{M AY1} {F EY1 V ER0 IH0 T} {W ER1 D} {IH1 Z} {AE2 N T AY0 D IH2 S AH0 S T AE2 B L IH0 SH M AH0 N T EH1 R IY0 AH0 N IH2 Z AH0 M}."
      "{M AY1} {F EY1 V ER0 IH0 T} {HH AH0 W AY1 AH0 N} {AY1 L AH0 N D} {IH1 Z} {M AW1 IY0}."
  • z

    zwf

    12/17/2020, 7:58 PM
    I decided to try to improve on this approach by training on LJSpeech with both normal and arpabet transcriptions .... here's a sample from 5 epochs in 😁
  • z

    zwf

    12/17/2020, 7:59 PM
    never trained tacotron from scratch before so this'll be interesting! hopefully she'll learn to speak
12345...6886Latest