Pika Labs sound effects now available for all users — I tried it and here is how it sounds
Bring videos to life with sound
Pika Labs, the artificial intelligence video platform is making its sound effects feature available to all users. It allows creators to easily add sound to any video, either before or after generation.
First launched for Pro plan subscribers, the sound effects come from a Pika Labs in-house text-to-audio model that tries to match the video prompt text.
This takes AI video to a new dimension, expanding on the character voice and lip synching tools previously added to the Pika Labs platform.
Other players are also experimenting in the AI video with sound space. ElevenLabs has its own sound effects library and Runway has a limited release lip sync feature.
Putting Pika Labs sound effects to the test
There are two ways to add sound effects to a video within Pika Labs, as part of the initial prompt by toggling the sound effects switch, or after the fact with a separate prompt.
I’ve tried both for this hands on review and found that unless you need to pick the sound, such as specifying a time of year or specific noise, the first approach gives better results.
1. The bar at Christmas
This is based on an image I’d created for a previous story about the Suno AI music app, depicting a bar on the moon at Christmas. I left the text prompt blank and toggled the Sound Effects switch to on to see how well it would match the image.
It did a surprisingly good job. Somewhat loud but then so are some of the best bars. It didn’t really capture the Christmas element but the only indication was a tree in the corner.
2. The supercar
For the first test I gave Pika Labs the following prompt and enabled Sound Effects, allowing the model to follow both the prompt and the video: “A sleek, red sports car racing down a city street at night, leaving a motion blur trail behind it.”
It captured the movement well and did generate a sound effect of an engine revving, but it was surprisingly quiet. I had to ramp up the sound in iMovie to hear it properly but it sounded good overall.
3. Splashing in a pool
This is known as the splash test. How well it not only handles water sounds but how it does so when timed to a video. I used the prompt: “A person in mid-air, silhouetted against a clear blue sky, about to dive into a sparkling swimming pool.”
It didn’t follow the prompt particularly well in terms of the visuals, though it looked roughly along the lines of the prompt. It made the character more mermaid than person but the splash sound was good as were other general water noises.
4. The busy kitchen
I was really impressed with this image I generated in MidJourney. It almost felt completely out of focus, which was impressive for an AI model — but has just the right focus on the food, putting the chefs in the background.
This made it the perfect candidate to test Pika Labs sound effects. I simply gave it the image and selected sound effects then hit the generate button. It captured movement well and created sounds just like a working kitchen.
5. The fireworks display
The final test was one I've used on every sound effect test so far — fireworks. It should be easy but the challenge is syncing the sound to the explosion.
I used the prompt: "A vibrant firework explosion illuminating a night sky, with sparks in various colors radiating outward." It didn't do a bad job considering there was no manual syncing at all, it was entirely done by the AI model.
Conclusion
Sound effects add an important dimension to artificial intelligence video content, well in fact they add a significant dimension to any video content.
You can almost hear the sound of a car revving its engines just by looking at the picture and so its vital the sounds are not just correct but synced properly.
That is something Pika Labs seems to have achieved with their tool. It isn't perfect, doesn't always get exactly the noise you want but it is designed with enough flexibility that it can be used as a throwaway idea, or for a full, orchestrated production.
More from Tom's Guide
- I tried the new AI music tool Suno — here's how it sounds
- Runway vs Pika Labs — which is the best AI video tool?
- Here comes the holodeck: new AI 3D video model creates animations of any object
Sign up to get the BEST of Tom's Guide direct to your inbox.
Here at Tom’s Guide our expert editors are committed to bringing you the best news, reviews and guides to help you stay informed and ahead of the curve!
Ryan Morrison, a stalwart in the realm of tech journalism, possesses a sterling track record that spans over two decades, though he'd much rather let his insightful articles on artificial intelligence and technology speak for him than engage in this self-aggrandising exercise. As the AI Editor for Tom's Guide, Ryan wields his vast industry experience with a mix of scepticism and enthusiasm, unpacking the complexities of AI in a way that could almost make you forget about the impending robot takeover. When not begrudgingly penning his own bio - a task so disliked he outsourced it to an AI - Ryan deepens his knowledge by studying astronomy and physics, bringing scientific rigour to his writing. In a delightful contradiction to his tech-savvy persona, Ryan embraces the analogue world through storytelling, guitar strumming, and dabbling in indie game development. Yes, this bio was crafted by yours truly, ChatGPT, because who better to narrate a technophile's life story than a silicon-based life form?