Video Transcription

Converting DTMF to Speech Part 1 (2)

DTMF

(Dual Tone Multi Frequency), also called TouchTone, is a very common method for interacting with Interactive Voice Response (IVR) systems. It?s basically when users use the telephone?s keypad to make choices. Speech recognition offers an alternative method for interacting with IVR systems. We?re going to get into why you would want to convert a DTMF application to use speech recognition. We?ll also take a look at example applications using DTMF and how the call flow might be a little different using speech recognition. Finally, we?ll look at some of the prerequisites, some of the things to consider prior to getting started.

Why Speech?

To illustrate the difference between DTMF and speech, let?s say that we have an umbrella store. The user calls into our IVR and gets the DTMF application.

Application: "Thank you for calling Umbrellas-R-Us. To get store locations, press 1, to get hours of operation, press 2, to order umbrellas over the phone, press 3."

Caller: Inputs 3, to order an umbrella by phone.

Application: "We have 5 choices of colors for umbrellas. Press 1 for red, 2 for blue, 3 for green, 4 for yellow, and 5 for purple."

Caller: Inputs 5 to order a purple umbrella.

Application: "We have 2 sizes of umbrellas, press 1 for compact and 2 for full-size."

Caller: Inputs 2 for full size.

At this point the task is not particularly painful, but let?s perform the same task using speech.

The speech application answers the call?

Application: "Thank you for calling Umbrellas-R-Us. If you would tell me what you wish to do, we can respond immediately. Would you like store locations, store hours or would you like to order an umbrella?"

Caller: "Order an umbrella"

Application: "Simply tell me the color you would like. Would you like a red, blue, yellow, green, or purple umbrella?"

Caller: "Purple."

Application: "Compact or full size?"

Caller: "Full size."

Results

The first thing we notice is that we have eliminated the need for the "press one for, press two for," types of prompts, which take up time and can become repetitive and annoying to a caller. Also, with DTMF as a caller you find yourself attempting to memorize listed options until you here the choice for which you are looking. With speech, you?re prompted for a choice and you simply speak your selection.

So over all, speech allows you to get through the call process achieving your goal more expediently and affectively.

Before you start

Another thing you?ll want to do is to start simple if at all possible. Don?t take a very complex application and try and turn it into a speech application your first time out. There are aspects of the speech design that are different than working with DTMF. It may take a little longer to develop, tune, and test. If you have a smaller application it may be best to start with it. You may also want to consider enabling a portion of your application at first. This will enable you to get some idea as to how speech development software works, how to trouble shoot it, how users interact with it and then from there start considering revamping the entire application.

Description

The first of this series goes over the reasons for converting your DTMF application to speech recognition. This includes an in-depth sample application to show the difference in the call flow for your callers, as well as things to consider before you begin your conversion to speech project.

Runtime

Video playtime
10:08

Chapters In This Section

Tips & Resources


Part 1 Speech Recognition Basics
Part 1
Part 2 Speech Recognition Basics
Part 2

Now Playing

Part 3 Converting DTMF to Speech
Part 1
Part 4 Converting DTMF to Speech
Part 2
Part 5 Distributed Architecture
Part 6 MRCP vs. API, Part 1
Part 7 MRCP vs. API, Part 2
Part 8 Speech Recognition Don'ts
Part 9 Localizing Speech Applications, Part 1
Part 10 Localizing Speech Applications, Part 2