Get the latest tech news
Voice AI that actually converts: New TTS model boosts sales 15% for major brands
A new spoken language model can quickly generate “infinite” new voices of varying genders, ages, demographics, based on a simple text prompt.
To do so, Rime built its own recording studio in a basement in San Francisco and spent several months recruiting people off Craigslist, through word-of-mouth, or just causally gathered themselves and friends and family. They do a lot of work with large contact centers, enterprise developers building interactive voice response (IVR) systems and telecoms, Clifford noted. For instance, phrases the model has never encountered, like Domino’s tongue-tying “Meatza ExtravaganZZa.” As Clifford noted, even if a voice is personalized, natural and responds in real time, it’s going to fail if it can’t handle a company’s unique needs.
Or read this on Venture Beat