Give Your Apps a Voice TM

The Executive’s Guide to Driving ROI with Voice Experiences

voice hero

We’re all hearing a lot of buzz about voice experiences, also known as conversational user interfaces (CUIs). But will investing in voice deliver real benefits for your business? Here’s what you need to know to decide.

Throughout history, we’ve seen one rule proven again and again: when we find an easier way of getting something done, the old way of doing it fades into the background. That’s why voice capabilities will soon become integrated into every facet of human-facing technology.

Does that mean voice will replace screens? Certainly not. What we’re seeing instead is the emergence of VoiceCases—those moments when the easiest way of getting something done is via voice.

65 percent of people who own an Amazon Echo or Google Home can’t imagine going back to the days before they had a smart speaker. (source)

We’re looking at a tectonic shift in experience design, and we’ll see a huge and growing gap between the brands that “get it” and the brands that still insist on screen-based interactions where they no longer make sense.

How do we know this is more than a fad?

For one, we can look to what’s already happening in the market:

  • 1 in 2 smartphone users use voice technology on their smartphone (source)
  • Smart speaker ownership grew 54% in the 3 months (Dec '17 - Feb '18) (source)
  • 50% of all searches will be voice searches by 2020 (source)

But behind those trends are a few simple facts about the way we humans work:

  • words per minute
    Human typing speed
  • words per minute
    Human speaking speed
  • words per minute
    Human reading speed

The most efficient way for us to interact with technology is often to speak our request and read the app’s response.

It’s faster to speak when you have a request (“Alexa, show me movie times today”), and faster and easier to read the results (a table of movie times).

At WillowTree, we call these mixed voice-and-visual interactions multimodal experiences.

Moving forward, the best conversational UIs will allow for both voice and screen-based input and output. That is, they’ll be multimodal.

What are the implications for business?

Until recently, voice wasn’t reliable enough of a way to communicate with computers. But that’s changed. Microsoft’s voice transcription last year achieved a remarkably low error rate of 5.1%—making it as accurate as professional human transcribers.

Increased reliability means voice interactions are quickly becoming the easiest way to get simple tasks done. And users’ expectations—and their brand loyalty—will shift accordingly.

Rethink your UX

You need to rethink all of your customer’s interactions with your products and services now that there is an additional tool—voice—at your disposal.

When the steam engine was invented, you rethink your entire approach to transportation—rather than hooking a steam engine to a carriage.

Similarly, you can’t just create apps for your company on Google Home and Alexa and think you’ve checked the “voice” box. You have to think about how and where voice will be the most efficient, most delightful way for your users to complete a task—even if what they’re doing via voice is just one part of a flow that happens across multiple channels.

Opportunities inside your company

If employees at your company spend any time inputting information—completing forms, searching for parts, providing customer service, searching for customer records—voice technology is going to make your team more productive.

1 hour of paperwork only takes 20 minutes via voice, saving—

  • minutes
    Each day
  • minutes
    Per year
  • hours
    Each year, per employee

Multiply that by the number of employees at your company, and it’s easy to see how voice can enhance productivity—while removing some of the stress of performing repetitive tasks.

Work-related tasks that you can enhance with voice:

  • Querying large databases
  • Composing emails
  • Locating a part while performing a repair in the field

Opportunities with your customers

In order to discover the most valuable VoiceCases for your customers, you’ll need to do a deep dive into your customers’ specific wants and needs. But no matter what industry you’re in, the trend is clear.

Voice is going to have a profound impact.
  • Retail

    49% would prefer voice assistants over human interactions in shops/call centers (source)

  • Finance

    20% say they’d rather use a voice assistant instead of visiting a shop or bank branch (source)

  • Healthcare

    One quarter of voice assistant users are 55 and older (source)

  • Hospitality

    In April 2017, Bing Ads claimed that the number of Britons using voice search on their mobiles to book hotels rose by 343% from the previous year (source)

  • Media

    30% of smart speaker owners say the device is replacing time previously spent with TV (source)

  • Field Services

    32% of IT pros surveyed expect intelligent assistants to automate help desk tickets, improving customer service (source)

Across industries, customers will be able to make requests three times faster. That means transactions lose friction, conversions improve, and customers are happier.

It’s a simple fact: customers will return to an easy shopping path over a difficult one.

The analytics goldmine

Voice assistants not only streamline communications with your customers and employees, but let you learn from those interactions virtually in real time.

If someone leaves your website without buying anything, you can often only speculate as to what went wrong (or try to learn after the fact, via a survey). With voice, they’ve literally told you what they’re looking for, even if your voice app wasn’t able to give it to them.

Alexa, for example, can be a very low-cost way to learn how your business can deploy voice successfully, because it provides relatively robust analytics. Read our CEO’s thoughts about using Alexa skills to gain insights from your customers.

Getting started

1. Know the current state

As with any design or technology initiative, you need to understand the constraints. To help you get started, here are high-level and feature-level overviews of the four leading voice platforms.

The voice landscape

(Last updated April 2018)

How users access it

Google AssistantAmazon AlexaWindows CortanaApple Siri
Primary in-home hardwareGoogle Home devicesAlexa devices, including the Echo and ShowInvoke speaker, built by Harmon KardonHomePod (limited implementation)
People are also using it onMobile (Android and iOS), cars (via Android Auto)Mobile (Android and iOS), third-party hardware (including thermostats, watches and, yes, toilets)Mobile (Android and iOS), PCs, XBox OneiOS devices, cars (via CarPlay)

As a developer you can

Google AssistantAmazon AlexaWindows CortanaApple Siri
Send push notificationsYesYesNoN/A (there are not as of yet any Siri-specific)
Sell digital contentNoYes (one-time or subscription based)Yes (subscription)No
Sell goods and servicesYes, via Google's Shopping Actions (beta)Yes, but you have to handle it all within your skillYes, but you have to handle it all within your skillYes (ride booking)

And all of the above will soon be obsolete—so check back here often. The rate of change is astonishing. Here’s a look at just the past few months.

Recent changes in the voice landscape

October 2017

Microsoft announces the launch of Invoke, a Cortana-powered speaker by Harmon Kardon

January 2018

Amazon launches Mobile Accessory Kit, an SDK that enables OEMs to more easily integrate Alexa into their hardware

January 2018

Google announces several Assistant-enabled devices with screens, including the Lenovo smart display

January 2018

Google announces it’s bringing Assistant to Android Auto, bringing the virtual assistant to the car

February 2018

Google announces Routines for Assistant: users can now create a set of actions triggered together by a single command.

March 2018

Google launches Shopping Actions

April 2018

Amazon allows TV manufacturers to let Alexa control TVs

April 2018

Alexa gets Announcements feature, which lets one person in a house broadcast a recorded message to all other Alexa devices in the home

April 2018

Amazon launches Alexa Blueprints, which allows anyone to create custom Alexa skills without coding.

Coming soon (rumored)

P2P payments on Alexa

2. Find use cases

The unignorable VoiceCases

In April 2018, we conducted a survey of 824 people (all across the U.S.) to get a pulse on which voice use cases were resonating with users. We asked people to rate fifteen use cases on two criteria: usefulness and efficiency. Here’s what we found.

Based on this research and on what we’re seeing in the market, voice is poised to become the preferred interaction model for at least three use cases that currently happen primarily via screens.

Use CaseSummaryExamples
Specific SearchSearching for a known item or piece of informationPulling up media, directions, weather, FAQs, finding a part in inventory
Composition & LoggingNot suprising, but newly feasible; The impact will be hugeWriting emails, completing forms, composing lists
Coaching & InstructionAnything that requires guidance, especially when there is a benefit to being hands freeLearning a new skill, getting guidance in the completion of a task

Offer a Goldilocks feature set

3. Build the foundation

Regardless of the use cases you’re focusing on, or even the platforms you intend to start with, there a few things you should do to make you positioned to move quickly as this space continues to develop.

Take a platform-agnostic approach

The form factor, capabilities, and market share of voice-enabled devices will continue to shift rapidly over the coming years. Design a backend for your voice ecosystem that can support voice applications wherever they are: whether that’s on Google Home, a car, or a device that has yet to be invented. You should also have APIs that enable easy access for voice applications.

Start documenting knowledge and capabilities

That platform-agnostic backend needs to be able to fulfill on the use cases you’ve identified. In order to do that, it needs two things: knowledge, and capabilities.

  • What can my bot do? (e.g. Help a customer find the product that is right for them)
  • What does my bot need to know? (e.g. product prices, product attributes, product availability)
Gather training data

You need content to “train” your AI what your users mean when they use specific terms. This can come from any number of places; some of the most common places to look are in-person user interviews, customer support transcripts, internal or customer-facing knowledge bases and blog content.

In-person user interviews can yield lots of valuable data you can use to train your AI.

Reuse existing content

By the same token, you should audit all that customer-facing content for useful solutions to users’ problems. If you’ve surfaced use cases in your discovery process that you don’t yet have a solution for, you’ll need to develop new content to address them.

4. Design and build your simple application

So you’ve got some use cases in mind and you’ve got a backend that can support the conversations you need to have with your users. It’s time to launch something into the world!

Choosing the right platform or platforms to launch on is a complicated process—far too detailed for this discussion. But there are a few key things to consider:

  • What platforms is your audience currently using? (Alexa, Google Home, Facebook Messenger)
  • Which platforms have the right combinations of capabilities you need to make the experience great? (e.g. Do people need to be able to view products?)

And a few best practices:

  • Have an established personality for our voice apps (a “voice for voice”)
  • Established KPIs
  • Run regular user testing to generate further training data and hone the conversation structure
  • Design flows not in isolation, but across devices—this allows you to create a fully orbed “multimodal” experience

Appendix: Voice app capabilities by platform

(Last updated April 2018)

Google AssistantAmazon AlexaWindows CortanaApple Siri
Text messagingYesNo access to Echo-to-Echo calling and messagingNoYes
CallingYesNo access to Echo-to-Echo calling and messagingNoYes (VOIP)
Lists and NotesListsYes, access Alexa shopping and to-do listsListsLists and Notes
RemindersDaily updates, timersTimersNoNo
General content (jokes, games, stories, facts, etc)YesYes (+ offers Flash Briefing, which plays short bursts of content daily)YesNo
Music and audioYesYesYesYes
Video playbackYes, with compatible hardwareYes, with compatible Echo deviceNoNo
Car integrationYesYesNoYes
PurchasingUser account requiredUser account requiredUser account requiredNo
Ride bookingYesYesNoYes
Photo searchYesN/ANoYes
Health and fitnessYesYesYesYes
Smart home integrationYesYesYesNo
Mobile requiredNoNoNoiOS

App Development

Working on an RFP? Check this out.


Contact Us

Say hello.