As people brace for the disruptive impact of artificial intelligence on jobs and everyday living, those in the world of audio books say their field is already being transformed.
AI has the ability to create human-sounding recordings — at assembly-line speed — while bypassing at least part of the services of the human professionals who for years have made a living with their voices.
Many of them are already seeing a sharp drop off in business.
Tanya Eby has been a full-time voice actor and professional narrator for 20 years. She has a recording studio in her home. But in the past six months she has seen her workload fall by half. Her bookings now run only through June, while in a normal year they would extend through August.
"Many of her colleagues report similar declines. While other factors could be at play. It seems to make sense that AI is affecting all of us," she told AFP.
There is no label identifying AI-assisted recordings as such, but professionals say thousands of audio books currently in circulation use “voices” generated from a databank.
Among the most cutting-edge, DeepZen offers rates that can slash the cost of producing an audio book to one-fourth, or less, that of a traditional project.
The small London-based company draws from a database it created by recording the voices of several actors who were asked to speak in a variety of emotional registers.
“Every voice we use, we sign a licence agreement, and we for the recordings,” said DeepZen CEO Kamis Taylan.
For every project, he added, “we pay royalties based on the work that we do.”
"Not everyone respects that standard. All these new companies are popping up who are not as ethical,” Eby said, and some use voices found in databases without paying for them.
“There’s that gray area being exploited by several platforms," Taylan agreed.
“They take your voice, my voice, five other people’s voices combined that just creates a separate voice... They say that it doesn’t belong to anybody.”
All the audio book companies contacted by AFP denied using such practices.
Speechki, a Texas start-up, uses both its own recordings and voices from existing databanks, said CEO Dima Abramov. The five largest US publishing houses didn’t respond to requests for comment.
But professionals contacted by AFP said several traditional publishers are already using so-called generative AI, which can create texts, images, videos and voices from existing content — without human intervention.
“Professional narration has always been, and will remain, core to the audible listening experience,” said a spokesperson for that Amazon subsidiary, a giant in the American audio book sector.
“However, as text-to-speech technology improves, we see a future in which human performances and text-to-speech generated content can coexist," he added.
US technology giants, deeply involved in the explosively developing field of AI, are all pursuing the promising business of digitally narrated audio books.
Apple already said it will move into AI-narrated audio books, to make the creation of audio books more accessible to all, notably independent authors and small publishers.
Google is offering a similar service, which it describes as “auto-narration".
“We have to democratise the publishing industry, because only the most famous and the big names are getting converted into audio,” said Taylan.
“Synthetic narration just opened the door for old books that have never been recorded, and those from the future that never will be recorded because of the economics,” added Abramov.
But Abramov insisted that the growing market would also benefit voice actors.
“They will make more money, they will make more recordings,” he said.
“The essence of storytelling is teaching humanity how to be human. And we feel strongly that should never be given to a machine to teach us about how to be human,” said Emily Ellet, an actor and audio book narrator who co-founded the Professional Audio book Narrators Association.
“Storytelling should remain human entirely,” she added.
Eby underlined a frequent criticism of digitally generated recordings. When compared with a human recording, she said, an AI product “lacks in emotional connectivity".
Eby said she fears, however, that people will grow accustomed to the machine-generated version and thinks that’s quietly what’s kind of happening.
Her wish is simply that companies would let listeners know that they are listening to an AI-generated piece, adding that she just wants people to be honest about it.
"That is only after a contract has been signed covering usage rights," he said.