Home » Blog » Microsoft unveils text-to-speech AI model VALL-E, which was trained on English speech data and can simulate a person's voice with three seconds of sample audio (Benj Edwards/Ars Technica)
Microsoft unveils text-to-speech AI model VALL-E, which was trained on English speech data and can simulate a person's voice with three seconds of sample audio (Benj Edwards/Ars Technica)