Nuance Recognizer call flow

Nuance Recognizer accepts spoken or written speech and returns literal and semantic results. A recognition includes the following steps between the browser, Speech Server, and Nuance Recognizer:

  1. Preload grammars and acoustic models. Preload common or large grammars and the associated acoustic models at system initialization, so that they are available quickly for any session.
  2. Establish a session. Use SIP INVITE method to open a communications port. Determine available codecs.
  3. Prepare for recognition. Load any additional context-specific grammars and configure the Recognizer. This phase and the following phase are often both accomplished with a single RECOGNIZE request.
  4. Perform recognition. Send audio through Speech Server to the Recognizer, which starts the recognition.
  5. End recognition. End the recognition and get the results. Return the results to the application.
  6. Perform additional recognitions. Perform as many recognitions as needed within the active session.
  7. End the session.

This figure shows the series of calls that happen during the recognition process. The figure does not include any prompts, which would normally precede a recognition. For a simplified call flow that includes prompts, see Speech processing call flow.

Notes:
  • This diagram illustrates MRCPv2 signaling.
  • The numbering of particular calls corresponds to the descriptions in the sections that follow the diagram.
  • Click an area on the diagram to go directly to the description.

diagram for Nuance Recognizer call flow