Conversational AI, emotional intelligence, and multimodal interactions are converging with OpenAI's latest release: ChatGPT's advanced voice mode. This groundbreaking feature, unveiled just last week, is set to revolutionize how we interact with AI-powered systems. And the magic powering these incredibly human-like interactions? An advanced language model with unprecedented emotional depth and versatility.
ChatGPT's advanced voice mode exists within the OpenAI model, allowing it to perform specific conversational tasks with a level of emotional intelligence and character voicing previously unheard of in AI systems. This mode, which can be experienced today through the ChatGPT interface, enables individual interactions that can convey a wide range of emotions, adopt different character voices, and even mimic physical reactions like coughing or sneezing.
Check it out:
Currently, the technology is limited to OpenAI’s ChatGPT Plus subscribers, requiring manual prompting and interaction. However, the potential for integration into various applications and services is immense, promising to change the landscape of AI-powered conversations.
WillowTree’s Data and AI Research Team (DART) produced a recent whitepaper on How to Evaluate Conversational AI for Politeness, and other hard-to-measure dimensions of communication, such as empathy, attentiveness, and compassion. This feels like the next natural step, moving from a text-based chatbot interface to audible vocalization.
ChatGPT’s advanced voice mode functionality enables users to engage in more natural, emotionally nuanced conversations with artificial intelligence in real time, opening up new possibilities for training, customer service, and entertainment. I see a lot of opportunity in the nitty-gritty of ESL and foreign language tutoring as well (think Duolingo use cases: practicing speech patterns, slowing down, different voices, etc.)
Here are some targeted use cases I can envision for various industries:
The Evolution from Text-Based AI to Emotionally Intelligent Voice AI
Text-based AI chatbots and voice assistants like Siri and Alexa have been around for years, but they've always lacked the emotional depth and versatility needed for truly human-like interactions. They required specific commands or phrases and often failed to understand context or convey appropriate emotions.
Similarly, earlier versions of AI-powered voice technology have been around for a while, powering things like automated customer service lines and basic voice commands. But with ChatGPT's advanced voice mode, we can see where the tech is headed.
This new voice technology (as the potential core of future AI-powered and voice-first applications) will enhance users' interactions with AI systems, transforming key actions into intuitive, emotionally intelligent conversations and removing barriers that previously made AI interactions feel robotic or impersonal.
Here's why.
Your users aren't always engaging with AI just to get information; they often want meaningful, context-aware interactions that feel natural and responsive. OpenAI's version of conversational AI is like having a highly skilled voice actor at your fingertips: it understands context, can adopt different personas, and conveys a wide range of emotions to enhance the user experience.
When you use the advanced voice mode, ChatGPT learns about the emotional context of the conversation: what tone is appropriate, how to modulate its voice for different characters or scenarios, and how to respond to the user's emotional state.
The AI then uses this understanding to build a more nuanced and engaging conversation, improving its ability to provide appropriate responses and create more immersive experiences for the user in the future. It also informs the system on how to better adapt to different conversational scenarios, essentially prioritizing emotional intelligence in AI interactions.
At worst, this human-like experience could create an “uncanny valley” effect, a topic we often hear about from our founder Tobias Dengel, President of TELUS Digital Solutions, when he speaks about conversational AI at conferences and in his bestselling book Sound of the Future: The Coming Age of Voice Technology. Or, this kind of human-like emotion could be leveraged by bad actors for scams, high-pressure sales tactics, and the like.
At best, by leveraging this voice technology, you provide a rich and emotionally resonant experience, which will keep users engaged and coming back for more. Your brand can offer personalized and context-aware services that respond appropriately to users' emotional states and natural language prompts, increasing engagement and driving loyalty.
Before forecasting the future of emotionally intelligent AI... Have you ever thought about the practical applications you can create today?
My advice is to imagine your application as an emotionally intelligent conversational partner with various personas and capabilities at your disposal. Most customer-facing businesses, for instance, could benefit from more effective training scenarios for handling difficult customer interactions.
Let's assume I want to create a training module for handling an irate customer who wants to cancel their subscription. Right now, I can manually create a scenario where:
...and once this is all set up as a training module, it can be used repeatedly to train customer service representatives with minimal setup time. Pretty rad.
So, what's the next step for your brand? Start identifying and implementing your emotionally intelligent AI strategy today. Strategic considerations for digital leaders might include:
The potential for emotionally intelligent conversational AI voice technology is vast and largely untapped. As our WillowTree product developers continue to explore generative AI, we invite you to join us in preparing for an era where AI-powered interactions are intuitive, emotionally resonant, and indispensable.
This space is moving so quickly, so keep an eye on this space as we endeavor to stay on top of the most recent releases and features. Reach out today and let's make sure that when emotionally intelligent AI becomes widely available, your business is leading the charge in providing unrivaled digital experiences.
One email, once a month.