I just tried a new text-to-speech AI tool that clones your voice in seconds

AI voice
(Image credit: Shutterstock)

OpenVoice is a new text-to-speech artificial intelligence technology that can clone any voice from a 30-second sample. And it keeps the tonal quality of that original voice as it turns your written text into spoken word audio. 

Text-to-speech made it to my list of the most important AI tools of the most important AI tools of the year last year. This is a new take on that approach, speeding up time to copy a voice.

While it was able to create a clone of my voice almost instantly, the output made me sound American rather than my native English. It does however do a very good job if you start with a neutral American accent.

In one of the example clips it referenced a sample of Elon Musk speaking. When you type in random text for his cloned voice to repeat the sounds are softer, less South African and more Southern California. You can hear this for yourself further down the article.

How does OpenVoice work?

The multilingual OpenVoice from MyShell has been trained on hours of voice samples. This allows it to identify patterns and speed up the time required to clone a new voice.

It can replicate the tone color of the reference speaker and unlike other tools like ElevenLabs, gives the user control over emotion, accent, rhythm, pauses and intonation.

OpenVoice has already been in use to provide voice cloning for the MyShell AI tool since May, used by tens of millions of users around the world to create personal AI chatbots.

How does OpenVoice sound?

I have only tried it through the demos on Lepton and HuggingFace, so it isn’t a true trial as that would require installing and running it on my own machine. However, from that short sample the emotion changing works very well, as does cloning US-based voices.

It struggles with strong accents, although that could be due to the limitations of the demo rather than the model as a whole. However, the samples provided on the project website also seem to focus heavily on US accents.

What makes OpenVoice stand out?

The gold standard in voice cloning from a short sample, with accurate sounding results so far is ElevenLabs. The company also allows speech-to-speech to improve realism. However, it is a commercial and somewhat expensive option for experimenting and hobbyists.

OpenVoice is available to install and run locally. It is also capable of greater degrees of realism, or at least more animation in the generated voice. This could be invaluable for someone making a cartoon or radio play as a school project and can’t afford actors.

The more realistic voice AI gets, particularly when a voice can be cloned in seconds, the more actors unions will be on alert. The recent SAG-AFTRA was in-part about the use of AI to deprive creatives of work.

I think we’ll see a push to copyright more aspects of an identity including vocal tone, motion and performance as AI increasingly replicates those factors.

More from Tom's Guide

Ryan Morrison
AI Editor

Ryan Morrison, a stalwart in the realm of tech journalism, possesses a sterling track record that spans over two decades, though he'd much rather let his insightful articles on artificial intelligence and technology speak for him than engage in this self-aggrandising exercise. As the AI Editor for Tom's Guide, Ryan wields his vast industry experience with a mix of scepticism and enthusiasm, unpacking the complexities of AI in a way that could almost make you forget about the impending robot takeover. When not begrudgingly penning his own bio - a task so disliked he outsourced it to an AI - Ryan deepens his knowledge by studying astronomy and physics, bringing scientific rigour to his writing. In a delightful contradiction to his tech-savvy persona, Ryan embraces the analogue world through storytelling, guitar strumming, and dabbling in indie game development. Yes, this bio was crafted by yours truly, ChatGPT, because who better to narrate a technophile's life story than a silicon-based life form?

Read more
Hume AI on an iPhone screen
Hume AI just unveiled Octave — new AI voice generator is eerily human
Autiobooks screenshots
I tried the open source AI tool that creates audiobooks for free — here's my verdict
selfie avatar images
Synthesia just launched the most realistic Selfie Avatars I’ve ever seen — here’s how to try it
A person talking to their phone
I just chatted with DeepSeek — here’s how to try it yourself with ElevenLabs’ new voice integration
Google Audio Overview feature from NotebookLM
Google NotebookLM just got way better with its new interactive features — here's why I'm impressed
Otter.ai
I write for a living — and this AI transcription software is a true game changer
Latest in AI
A nervous woman looking at her phone
Is ChatGPT making us lonely? MIT/OpenAI study reveals possible link
AI in man's hand
AI
AI Madness faceoff logo
I just tested Grok vs. DeepSeek with 7 prompts — here's the winner
ChatGPT on iPhone
ChatGPT was down — updates on quick outage
Claude AI on phone sitting on keyboard
Claude 3.7 Sonnet now supports real-time web searching — but there's a catch
The Dnsys X1 Exoskeleton being worn
I tested an AI exoskeleton to help treat my immune arthritis — here’s what happened
Latest in News
Disney Plus logo
Disney Plus upgrade just fixed one of my biggest problems with the home page
Tom Hiddleston as Robert Laing in "High Rise" now streaming on Netflix
5 best Netflix movies in March you haven't watched yet
iPhone 16 with Apple Intelligence logo for iOS 18.1
iOS 18.4: All the newest Apple Intelligence features coming to your iPhone
Maria Debska in "Just One Look" now streaming on Netflix
3 best Netflix shows in March you haven't watched yet
Split image featuring the Galaxy S25 Edge (left) and Galaxy S25 Ultra (right)
Samsung Galaxy S25 Edge just tipped for two Galaxy S25 Ultra-level features
Wolfenstein: The Old Blood
Amazon is giving away a ton of free games for its Big Spring Sale — here’s how to claim yours