Recognition methods for speech and DTMF input
Whether you’re using Mix.nlu to recognize the caller’s request or response, a Nuance Recognizer speech grammar (.grxml or .gram), DTMF, or a combination, VoiceXML Connector converts this into a <grammar>
tag for your application. Mix.dialog provides the flexibility to handle different combinations of models and grammars, at any question and answer nodes in your dialog design. For example, you might use:
- DTMF with Mix ASR/NLU resources (default methods for intent and entity recognition)
- Nuance Recognizer speech grammars (NR grammars) for directed dialog, without support for intent switching, to recognize and parse:
- entity in focus and commands
- confirmation
- NR grammars and DTMF
- DTMF only
This table shows what to configure (if anything) to enable intent recognition, entity collection and confirmation, and command handling, for these use cases.
Scenario | Example use cases | Enabling for entity collection and confirmation | Command handling |
---|---|---|---|
ASR/NLU models only | NLU collection (intent and entities) | Default, no actions required. | Commands must be part of the NLU model. See Configure global commands. |
ASR/NLU models, and DTMF | Menu-type collection, or simple input | See Enabling DTMF input. | Speech commands must be part of the NLU model.DTMF mappings can be specified in Mix.dialog, or through an external DTMF grammar. |
NR grammar only | Alphanumeric collection, or complex speech-only recognition | See Enabling Nuance Recognizer grammars. | Speech commands must be defined in an external grammar, and also configured in Mix.dialog. |
NR grammar, and DTMF | Collecting strings of digits, or reusing grammars from an existing IVR project | See Enabling Nuance Recognizer grammars and Enabling DTMF input. | Speech commands must be defined in an external grammar.DTMF mappings can be specified in Mix.dialog, or through an external DTMF grammar. |
DTMF only | Language selection | See Enabling DTMF input. | DTMF mappings can be specified directly in Mix.dialog, or through an external DTMF grammar. |
The Grammars tab of the NLU resource panel allows you to export a grammar specification document, to help design and manage external referenced grammars.
Note:
By default, VoiceXML Connector can relay up to three recognition results from NLU or GrXML grammars. If a speech grammar (NLU or GrXML) is likely to return more than three results, or if you want to limit the number of results, you can override themaxnbest
property of VoiceXML Connector: create a variable with the name property_maxnbest, set it to the desired value (maximum 15), and pass it as a Send data parameter at the question and answer node that references this grammar.
Feedback
Was this page helpful?
Glad to hear it! Please tell us how we can improve.
Sorry to hear that. Please tell us how we can improve.