OpenAI Demos Voice Engine, However Not Prepared for Large Launch


Audio deepfakes, or AI-generated audio that impersonates somebody, aren’t new β€” they usually’re the rationale that OpenAI, the corporate that introduced AI chatbots to the mainstream, is holding again on releasing its newest providing.

OpenAI introduced on Friday that it was selecting to preview, however not broadly launch, text-to-speech life like voice generator Voice Engine due to “the potential for artificial voice misuse.”

In a weblog submit, the corporate outlined Voice Engine’s skill to take a 15-second pattern of somebody’s voice and emotionally and realistically mimic it as directed by a textual content enter.

“When you’ve got the appropriate audio setup, it is mainly a human-caliber voice,” Jeff Harris, a product lead at OpenAI, instructed Bloomberg. “It is a fairly spectacular technical high quality.”

Associated: Deepfake Scams Are Turning into So Subtle, They May Begin Impersonating Your Boss And Coworkers

OpenAI has been privately testing Voice Engine since creating it over a yr in the past and has recognized that it may be “used for good” in its weblog submit.

One utility that the corporate previewed, Voice Engine helps people who find themselves non-verbal by giving distinct voices throughout many languages. Livox, an alternate communication app, has already began utilizing Voice Engine for that function, in response to OpenAI.

Voice Engine might additionally translate movies and podcasts into different languages and supply studying help to youngsters and non-readers with audio content material.

Associated: Tennessee Simply Handed a New Legislation to Defend Musicians From a Rising AI Menace

OpenAI pointed to its AI security and voluntary commitments insurance policies when stating its rationale for previewing, and never releasing, Voice Engine to the general public. The preview was meant to showcase Voice Engine’s capabilities whereas additionally emphasizing “the necessity to bolster societal resilience in opposition to the challenges introduced by ever extra convincing generative fashions,” the corporate said.

Artificial voices have captured curiosity within the startup world, with AI voice cloning firm ElevenLabs valued at $1.1 billion in January after launching in beta solely a few yr in the past. The expertise has additionally come below hearth for the brand new energy it provides cybercriminals, who can use it to impersonate individuals or entry funds via voice-based banking.

OpenAI previewed an AI video generator referred to as Sora final month that creates life like movies from prompts.

Associated: ‘This Is a Critical Drawback’: Mr. Beast Slams AI Deepfake

Supply hyperlink


Please enter your comment!
Please enter your name here