Google DeepMind Develops Human-Sounding Computer Speech

Google’s British AI unit DeepMind has introduced a major step forward inside the synthesis of human-sounding machine speech. Researchers say their WaveNet generation produces a sound that is 50% extra convincing than present computer speech. The neural community fashions the uncooked waveform of the audio signal attempting to imitate one pattern at a time. Given that there can be as many as 16,000 samples in step with a second of audio and that each prediction is influenced with the aid of every previous one, it is using DeepMind’s own admission a pretty “computationally high-priced” technique.

For WaveNet to utter actual sentences, the researchers must also feed the program linguistic and phonetic pointers. So if it’s such an extensive method, why has DeepMind chosen it? Well, researchers believe it’s a pleasant way of truly advancing human-sounding gadget speech.

Danette J. Ellerman

Hardcore webaholic. Unapologetic pop culture enthusiast. Music evangelist. Avid alcohol lover. Social media trailblazer.
Spoke at an international conference about implementing dolls in Fort Lauderdale, FL. Spent 2002-2007 working with human growth hormone in Pensacola, FL. Spent college summers exporting foreign currency on Wall Street. Garnered an industry award while training human growth hormone on the black market. Spent 2002-2007 promoting fatback in Libya. Spent 2001-2007 implementing jack-in-the-boxes in Libya.