I just got early access to ElevenLabs' AI sound effect generator — here's what it can do
Make sounds of anything from text
ElevenLabs is the most prominent and one of the best generative artificial intelligence sound platforms available, creating synthetic voices and cloned voices from tiny samples.
Now it is moving into sound effects, allowing creators to generate a range of noises from a simple prompt that could be used in presentations, movies or podcasts.
I’ve tried out the tool and it can create some complex, multilayered sound effects from a relatively simple prompt including animal, atmosphere and technical noises.
It isn’t perfect yet, it struggles with certain noises and couldn’t get the sound of a mid-90s modern at all, giving it too much musical tone and missing the harsh clicks but overall it is impressive considering the sounds are AI generated and not recorded.
How do ElevenLabs sound effects work?
Like any generative AI tool ElevenLabs sound effects model was trained on a vast amount of data including record sounds of real-world objects.
Each object, animal or scene used in the training dataset would have been well labelled and described, allowing the AI to learn to replicate the noise.
To use the tool you get a standard text box that will be familiar to anyone that has used more or less any generative AI tool, and a button to generate.
Sign up to get the BEST of Tom's Guide direct to your inbox.
Here at Tom’s Guide our expert editors are committed to bringing you the best news, reviews and guides to help you stay informed and ahead of the curve!
ElevenLabs sound effects generator gives you five sounds from the same prompt and unlike AI image and video, each version can vary significantly from one another.
How well does ElevenLabs sound effects work?
It is particularly good at weather sounds and those of the natural world including animals, but it seemed to struggle more with electronic sounds.
I gave it seven prompts to see how well it handled a range of sounds. This included a crackling campfire, ocean waves, thunderstorm, wind chimes and an old-fashioned train whistle.
Over all it did well on most prompts, although of the five versions it generates in response to a prompt, usually only one or two were good and longer than a few seconds.
The tool is also built in such a way that you could double up on sounds. For a video of waves crashing on the beach I found I needed two sounds, one of waves and one of seagulls as neither captured the full tapestry shown in the video.
Sound is a rapidly growing area of AI, from music tools to synthetic speech and now on to sound effects and custom noises. It gives a new dimension to generated video, a new form of storytelling for slides and makes podcast creation more immersive.
More from Tom's Guide
- Apple GPT: latest news, rumored release date and predicted features
- Bing with ChatGPT is now Copilot — what it means for you
- Google unveils Gemini AI for Bard chatbot — and it could beat ChatGPT
Ryan Morrison, a stalwart in the realm of tech journalism, possesses a sterling track record that spans over two decades, though he'd much rather let his insightful articles on artificial intelligence and technology speak for him than engage in this self-aggrandising exercise. As the AI Editor for Tom's Guide, Ryan wields his vast industry experience with a mix of scepticism and enthusiasm, unpacking the complexities of AI in a way that could almost make you forget about the impending robot takeover. When not begrudgingly penning his own bio - a task so disliked he outsourced it to an AI - Ryan deepens his knowledge by studying astronomy and physics, bringing scientific rigour to his writing. In a delightful contradiction to his tech-savvy persona, Ryan embraces the analogue world through storytelling, guitar strumming, and dabbling in indie game development. Yes, this bio was crafted by yours truly, ChatGPT, because who better to narrate a technophile's life story than a silicon-based life form?