Hume AI launches OCTAVE — suddenly everything can get a voice

Hume AI on an iPhone screen
(Image credit: Shutterstock / Future)

Hume AI, the empathetic AI voice company, has just unveiled OCTAVE (Omni-Capable Text And Voice Engine). This new model combines the capabilities of the EVI 2 speech-language model with advanced emotional and cloning functionality. And all in an extremely small form factor.

OCTAVE can take a prompt or brief recording, and generate not just words but also expressive emotions, dialects, and other components of a full personality.

The product can also understand a wide range of prompt instructions, for example, the user can request a ‘gentle therapist’ or an ‘excitable salesman’, and the model will output the required response instantly and with minimal latency.

Hume Fall Update: Introducing EVI 2 - YouTube Hume Fall Update: Introducing EVI 2 - YouTube
Watch On

A key feature of the new model is the fact that it can do all this on the fly, and so users can instantly create emotive expressive characters just by uploading a short five second audio sample or entering a prompt which sets the scene.

Having cloned a voice, the model can then combine it with real-time conversation on the fly. It's not hard to visualize how this could be abused in the future.

This is a big improvement over most current voice models, which generally only offer a limited set of 'personality' voice types, and opens up the field to much more engaging AI interactions with zero training.

The demo launch clips demonstrate a wide range of prompt variation, including different accents and temperaments, as well as vague terms such as ‘favorite uncle’ or a ‘voice that barrels through conversations like rush hour traffic.’

The results are good, although it has to be said they’re not perfect. It’s still possible to detect artifacts bleeding through the audio at times, especially with the more exotic prompt demands.

What’s slightly more impressive, and actually quite worrying, is the model’s ability to clone a voice. Again there are demos on the launch page which demonstrate a striking ability to mimic voices, including that of Ilya Sutskeva and Humes own Lauren Kim.

Having cloned a voice, the model can then combine it with real-time conversation on the fly. It's not hard to visualize how this could be abused in the future.

Potential use cases

This multi-voice capability can be used to create instant podcasts which integrate real live human chat with a cloned voice of choice. Imagine setting up a podcast with you in conversation with Barack Obama or John Wayne, without any lengthy tedious training.

This even goes so far as the ability to clone multiple characters, such as taking an existing podcast from Google’s NotebookLM and using it to generate a completely new conversation on the fly.

With a nod towards edge devices such as smartphones, OCTAVE is a modest model, featuring just 3B parameters. The implication is that the new product will give voice to many more smaller devices, and maybe even consumer appliances, opening up a whole new universe of interactive possibilities.

The product is only available to a select number of trusted testers at the moment, to ensure safety and functionality, but the company plans a wider rollout over the next few months.

More from Tom's Guide

Category
Arrow
Arrow
Back to MacBook Air
Brand
Arrow
Processor
Arrow
RAM
Arrow
Storage Size
Arrow
Screen Size
Arrow
Colour
Arrow
Storage Type
Arrow
Condition
Arrow
Price
Arrow
Any Price
Showing 10 of 98 deals
Filters
Arrow
Show more
Nigel Powell
Tech Journalist

Nigel Powell is an author, columnist, and consultant with over 30 years of experience in the technology industry. He produced the weekly Don't Panic technology column in the Sunday Times newspaper for 16 years and is the author of the Sunday Times book of Computer Answers, published by Harper Collins. He has been a technology pundit on Sky Television's Global Village program and a regular contributor to BBC Radio Five's Men's Hour.

He has an Honours degree in law (LLB) and a Master's Degree in Business Administration (MBA), and his work has made him an expert in all things software, AI, security, privacy, mobile, and other tech innovations. Nigel currently lives in West London and enjoys spending time meditating and listening to music.

Read more
Hume AI on an iPhone screen
Hume AI just unveiled Octave — new AI voice generator is eerily human
A person talking to their phone
I just chatted with DeepSeek — here’s how to try it yourself with ElevenLabs’ new voice integration
Google Audio Overview feature from NotebookLM
Google NotebookLM just got way better with its new interactive features — here's why I'm impressed
OpenAI logo
OpenAI ChatGPT-4.5 is here and it's the most human-like chatbot yet — here's how to try it
OmniHuman screenshot of AI generated video
TikTok parent company just launched stunning AI video generator — OmniHuman-1 is taking the world by storm
selfie avatar images
Synthesia just launched the most realistic Selfie Avatars I’ve ever seen — here’s how to try it
Latest in AI
A mother and daughter happily browse the internet
7 AI hacks every mom needs to stop feeling exhausted all the time
Woman using ChatGPT app on the beach
I just tested ChatGPT-4.5 vs ChatGPT-4o with 7 prompts — here's my verdict
ChatGPT on iPhone
5 mind-blowing ChatGPT prompts you’ll wish you knew sooner
Smiling man in a kitchen preparing a vegan meal
Doritos were invented at Disneyland? I asked Google Gemini Deep Research for snack facts— my mind is blown
Sam Altman
ChatGPT-4.5 delayed in surprise announcement — and it could launch with a controversial new payment model
AI Mode of google search
Google launches 'AI Mode' for search — here's how to try it now
Latest in Features
Nothing Phone 3a Pro vs Pixel 8a.
I shot over 200 photos with the Nothing Phone 3a Pro vs Pixel 8a — here’s the winner
Roku OS 10 update — Virtual Surround
5 ways to get surround sound in a small room
Man in a low squat with arms extended at shoulder height during gorilla walk exercise
Mobility coach shares 'squat like a baby' routine to boost lower-body flexibility and improve hip health
The Silent Beacon Bluetooth panic button worn on a wrist next to a Fitbit
I tried a physical panic button for 48 hours — and this tiny device already makes me feel safer
A woman with long dark hair sits up in bed with her arms stretched in the air as sunlight streams in through her open curtains
You lose an hour of sleep this weekend, here’s why that’s a good thing
the dyson airwrap ID in teal and terracotta colorway (patina and orange) with a lapis case, with a brush, hairfryer, curling wand attachments
I’ve ruined my hair with 3 years of perms — but the new Dyson Airwrap ID saved my locks