Voice replacement is getting faster to train, but seems to actually be getting worse with identifying pitch/keys.

There’s still an issue with reverb/echo and doubled vocals. The only way I was able to make this passable was to find pre-separared vocals, and even still it struggled with the pitch drifting, so I had to rerecord parts of it.

Still, I trained these in so-vits-svc for about 2 hours each on a 3080ti. I spent more time producing it than the AI needed to completely replace someones voice with someone else’s voice.

Combining these with deepfakes/wav2lip can give some damn good results. If anyone wants some guidance on the process for voice replacement, I can certainly share anything I’ve picked up along the way.

  • DisaA
    link
    English
    31 year ago

    This is really impressive. Kind of a bop.

  • @goat
    link
    English
    21 year ago

    how does Alex Jones actually sound good?

    • @soulnullOPM
      link
      English
      11 year ago

      There’s about 20 minutes of him yelling and screaming in the training data. Without it, he couldn’t hit these notes at all, I specifically looked for his angriest rants and screams to add to the data, now he hits it like a champ.

      Oh wait, did you mean that in a rhetorical sense? If so, I have no idea… lol