Today, voice technology company DAISYS B.V. announces its release of a worldwide breakthrough in the development of human-sounding voices by means of artificial intelligence. The innovation, which narrates written texts in a natural way, generates new, realistically sounding, not yet existing voices. Speech properties like speed and pitch can be adjusted in real-time, allowing the voice to be customized.
“This is a huge breakthrough. Up until now, natural-sounding voices were always deepfake, based on audio data of professional speakers. But deepfakes in text-to-speech for many reasons aren’t usable. One of those reasons is that not everyone wishes to lend out their voice without having control over what is being said with it. With this technology, as the first company in the world, we are able to create new voices that sound like real people,” Barnier Geerling, CEO of DAISYS explains.
During the past year and a half, the start-up from Leiden has worked on its technology with a small international team of Artificial Intelligence (AI) developers.
“We’ve made several important adjustments to the existing basic technology. In addition, we had to cleverly ‘train’ our models, using the right balance of speech data from different speakers. Because of this we’ve managed to generate new, naturally sounding voices that can be real-time adjusted by means of gender, pitch, power and speed.” Dr.ir Joost Broekens, Chief Technology Officer at DAISYS, explains.
The new voice technology is suitable for all online and offline surroundings where the human voice is used, like traditional media, smart devices, games, robots, speech assistants and public announcement systems.
Visit www.daisys.ai to listen to samples of this breakthrough in voice technology.
About DAISYS B.V.
DAISYS B.V. is a speech technology company building machine learning technology that enables us to generate credible speech that is indistinguishable from real human voices, from text. Their technology is not based on copying existing voices through deepfakes, but on the generation of completely new, not yet existing, human-sounding voices. They distinguish themselves from other providers of this type of service by giving the speech intent and emotion.
Source: DAISYS B.V.