Sample Python runtime client

DLGaaS offers a sample Python client application that you can download and use to access a deployed Dialog model with the Runtime gRPC API.

This sample app can accept text input or speech audio input and return text output or synthesized speech audio (TTS) output.

Prerequisites

To run this client, you need to have:

Installed Python 3.6 or later.
Generated Python stubs from gRPC setup.
Your Mix client ID and secret from Prerequisites from Mix.
A built and deployed Mix project with Dialog and NLU resources, as described in Prerequisites from Mix. The Mix Coffee app quick start project is an easy way to get started.
For TTS output, your project requires the TTS output modality.
For speech input, a speech audio file. A sample wave file is provided in the sample client package, or you can use your own audio file. Your project must support the voice input modality to use voice input.
The Mix URN for your deployed Dialog model
Your Mix client ID and secret. This is needed to authorize you to access your previously built and deployed Mix Dialog and NLU model.
The Python sample client files for Linux or Windows:
- Linux client: dialog-python-client-linux.zip
- Windows client: dialog-python-client-win.zip

Download the Python client zip file for Linux or Windows and extract its files into the same directory as the nuance directory that contains your proto files and Python stubs.

On Linux, give the run scripts execute permission with chmod +x. For example:

unzip dialog-python-client-linux.zip 
chmod +x *.sh

Client files

These are the resulting client files, in the same directory as the nuance directory holding your Python stubs:

    ├── dlg_client.py
    ├── run-mix-client.sh or run-mix-client.bat
    ├── run-mix-token-client.sh or run-mix-token-client.bat
    ├── OrderCoffee_i_want_a_double_espresso.wav
    ├── google
    └── nuance
        ├── dlg
        │   └── v1
        │       ├── common
        │       │   ├── dlg_common_messages.proto
        │       │   └── dlg_common_messages_pb2.py
        │       ├── dlg_interface.proto
        │       ├── dlg_interface_pb2.py
        │       ├── dlg_interface_pb2_grpc.py
        │       ├── dlg_messages.proto
        │       └── dlg_messages_pb2.py
        │
        ├── asr
        │   └── v1
        │       ├── recognizer_pb2_grpc.py
        │       ├── recognizer_pb2.py
        │       ├── recognizer.proto
        │       ├── resource_pb2.py
        │       ├── resource.proto
        │       ├── result_pb2.py
        │       └── result.proto
        │
        ├── tts
        │   └── v1
        │       ├── nuance_tts_v1.proto
        │       ├── nuance_tts_v1_pb2.py
        │       └── nuance_tts_v1_pb2_grpc.py
        ├── nlu
        │   └── v1
        │       ├── interpretation_common_pb2.py
        │       ├── interpretation-common.proto
        │       ├── multi_intent_interpretation_pb2.py
        │       ├── multi-intent-interpretation.proto
        │       ├── result.proto
        │       ├── result_pb2.py
        │       ├── runtime.proto
        │       ├── runtime_pb2.py
        │       ├── runtime_pb2_grpc.py
        │       ├── single_intent_interpretation_pb2.py
        │       └── single-intent-interpretation.proto
        └──rpc
            ├── error_details.proto
            ├── error_details_pb2.py
            ├── status.proto
            ├── status_pb2.py
            ├── status_code.proto
            └── status_code_pb2.py

Python app file

Each sample app package contains a common Python client application file, dlg_client.py. This file imports the generated Python stubs and contains the main application code. The client apps include command line scripts to run the app on the respective platforms along with other files to use with the app.

View dlg_client.py

import argparse
import time
import logging
import grpc  
import sys 
import os
import wave
from time import sleep
import urllib
import urllib.request
import base64
import json
from google.protobuf import text_format, json_format

from google.protobuf.json_format import MessageToJson, MessageToDict
 

from nuance.dlg.v1.common.dlg_common_messages_pb2 import *
from nuance.dlg.v1.dlg_messages_pb2 import *
from nuance.dlg.v1.dlg_interface_pb2 import *
from nuance.dlg.v1.dlg_interface_pb2_grpc import *

from nuance.tts.v1 import nuance_tts_v1_pb2
from nuance.asr.v1 import resource_pb2, result_pb2, recognizer_pb2, recognizer_pb2_grpc


oauth_token_expiry_threshhold_seconds = 30
oauth_token_expiry_seconds = 0
oauth_token = None
args = None

# Defines details for accepted command line arguments and help text
# Creates a global args object that contains the command line arguments
def parse_args():
    global args
    parser = argparse.ArgumentParser(
        prog="dlg_client.py",
        usage="%(prog)s [-options]",
        add_help=False,
        formatter_class=lambda prog: argparse.HelpFormatter(
            prog, max_help_position=45, width=100)
    )

    options = parser.add_argument_group("options")
    options.add_argument("-h", "--help", action="help",
                         help="Show this help message and exit")
    options.add_argument("--appId", metavar="appId", nargs="?", help="Mix appId. For self-hosted use only. Used by Dialog service to resolve resource URNs in self-hosted setup.")
    options.add_argument("--token", nargs="?", help=argparse.SUPPRESS)
    options.add_argument("--oauthURL", metavar="oauthUrl", nargs="?",
                         help="OAuth 2.0 URL")
    options.add_argument("--clientID", metavar="clientID", nargs="?",
                         help="OAuth 2.0 Client ID")
    options.add_argument("--clientSecret", metavar="clientSecret", nargs="?",
                         help="OAuth 2.0 Client Secret")
    options.add_argument("--oauthScope", metavar="oauthScope", nargs="?",
                         help="OAuth 2.0 Scope, default=dlg", default='dlg')
    options.add_argument("--secure", action="store_true",
                         help="Connect to the server using a secure gRPC channel")
    options.add_argument("-s", "--serverUrl", metavar="serverUrl", nargs="?",
                         help="Dialog server URL, default=localhost:8080", default='localhost:8080')
    options.add_argument('--modelUrn', metavar="modelUrn", nargs="?",
                         help="Dialog model URN, e.g. urn:nuance:mix/eng-USA/A2_C16/mix.dialog")
    options.add_argument("--textInput", metavar="textInput", nargs="?",
                         help="Text to perform interpretation on")
    options.add_argument("--audioFile", metavar="audioFile", nargs="?",
                         help="audio file name for speech input to trigger speech recognition and then interpretation")
    options.add_argument("--tts", help="Boolean whether to request TTS", action="store_true")
    options.add_argument("--audioDir", metavar="audio directory", nargs="?",
                         help="Audio output directory for TTS, default=audio. To be used together with --tts.", default='audio')

    args = parser.parse_args()

# Using clientID and clientSecret from the command line arguments, obtain an OAuth 2.0 token from the oauthURL HTTP endpoint
# Uses urllib.request to make the HTTP request
# Extracts and returns the access_token from the response, along with a Boolean called updated that indicates whether a new token was generated
def get_oauth2_token():
    global oauth_token
    global oauth_token_expiry_seconds
    global oauth_token_expiry_threshhold_seconds

    updated = False

    if args.oauthURL is None:
        return None
    
    current_time = time.monotonic()

    try:
        if oauth_token and oauth_token_expiry_seconds - oauth_token_expiry_threshhold_seconds > current_time:
            log.debug('OAuth token is still valid')
            return oauth_token, updated

        log.info("Obtaining auth token (Client ID: {}, URL: {})".format(args.clientID, args.oauthURL))

        encoded_credentials = base64.standard_b64encode("{}:{}".format(args.clientID, args.clientSecret).encode()).decode('utf-8')
        headers = { 'Authorization' : "Basic {}".format(encoded_credentials)  }

        data = {
            'grant_type': 'client_credentials',
            'scope': args.oauthScope,
        }

        request = urllib.request.Request(url=args.oauthURL, headers=headers, data=urllib.parse.urlencode(data).encode(), method='POST')
        updated = True
        with urllib.request.urlopen(request) as response:
            response = response.read().decode('utf-8')
            json_response = json.loads(response)

            oauth_token = json_response["access_token"]
            oauth_token_expiry_seconds = time.monotonic() + json_response["expires_in"]
        
            log.debug("Token TTL: %d" % json_response["expires_in"])
            return json_response["access_token"], updated
    except urllib.error.HTTPError as err:
        raise Exception("Failed to obtain authentication token. Status: {}, Error: {}".format(err.code, err.read().decode()))


# Creates a gRPC channel to the service
def create_channel(args):    
    call_credentials = None
    channel = None
    # Token passed in as a command line argument
    if args.token:
        log.debug('Adding CallCredentials using token parameter')
        call_credentials = grpc.access_token_call_credentials(args.token)
    else:
        # Request a token from the OAuth endpoint
        current_oauth_token, _ = get_oauth2_token()
        if current_oauth_token:
            log.debug('Adding CallCredentials from OAuth endpoint')
            call_credentials = grpc.access_token_call_credentials(current_oauth_token)
    # Secure channel. This is always used when contacting hosted Mix Dialog service. 
    # You need to pass in a Boolean command line argument --secure in this case
    if args.secure:
        log.debug("Creating secure gRPC channel")
        channel_credentials = grpc.ssl_channel_credentials()
        if call_credentials is not None:
            channel_credentials = grpc.composite_channel_credentials(channel_credentials, call_credentials)
        channel = grpc.secure_channel(args.serverUrl, credentials=channel_credentials)
    # Insecure channel. Not applicable to apps contacting Nuance-hosted Mix service. Can be used for self-hosted Mix Dialog service. 
    # You need to provide the Mix appId of your application as a command line argument --appId in this case.
    # In this case the sample app includes a custom header for Dialog passing in the appId as "x-nuance-client-id" 
    # Dialog uses this to resolve resource URNs in self-hosted setup.
    else:
        log.debug("Creating insecure gRPC channel")
        channel = grpc.insecure_channel(args.serverUrl, options = [("x-nuance-client-id", args.appId)])
    return channel

def read_session_id_from_response(response_obj):
    try:
        session_id = response_obj.get('payload').get('sessionId', None)
    except Exception as e:
        raise Exception("Invalid JSON Object or response object")
    if session_id:
        return session_id
    else:
        raise Exception("Session ID is not present or some error occurred")

# Generates the .wav file header for a given set of parameters. Auxiliary function for saving TTS output as a wav file.
def generate_wav_header(sample_rate, bits_per_sample, channels, datasize, formattype):
    # (4byte) Marks file as RIFF
    o = bytes("RIFF", 'ascii')
    # (4byte) File size in bytes excluding this and RIFF marker
    o += (datasize + 36).to_bytes(4, 'little')
    # (4byte) File type
    o += bytes("WAVE", 'ascii')
    # (4byte) Format Chunk Marker
    o += bytes("fmt ", 'ascii')
    # (4byte) Length of above format data
    o += (16).to_bytes(4, 'little')
    # (2byte) Format type (1 - PCM)
    o += (formattype).to_bytes(2, 'little')
    # (2byte) Will always be 1 for TTS
    o += (channels).to_bytes(2, 'little')
    # (4byte)
    o += (sample_rate).to_bytes(4, 'little')
    o += (sample_rate * channels * bits_per_sample // 8).to_bytes(4, 'little')  # (4byte)
    o += (channels * bits_per_sample // 8).to_bytes(2,'little')               # (2byte)
    # (2byte)
    o += (bits_per_sample).to_bytes(2, 'little')
    # (4byte) Data Chunk Marker
    o += bytes("data", 'ascii')
    # (4byte) Data size in bytes
    o += (datasize).to_bytes(4, 'little')
    return o

# Given bytearray() audio, and sampling details, saves as a .wav file, target_audio_file_name.
# audio - byte audio
# output_file_name - name of the intended output file name with extension 
# sample_rate - sample rate in Hz
# bits_per_sample - bits in each sample
# channels - number of channels
# formattype - format type, 1 for PCM
def save_audio_file_wav(audio, target_audio_file_name, sample_rate, bits_per_sample, channels, formattype):
    audio_file = ""
    output_file_path = os.path.join(args.audioDir, target_audio_file_name)
    os.makedirs(os.path.dirname(output_file_path), exist_ok=True)
    with open(output_file_path, "wb") as audio_file:
        datasize = len(audio)
        wav_header = generate_wav_header(sample_rate, bits_per_sample, channels, datasize, formattype)
        audio_file.seek(0, 0)
        audio_file.write(wav_header)
        audio_file.seek(0, 2)
        audio_file.write(audio)
    log.debug("Wrote generated speech audio response to %s" %  output_file_path)

def start_request(stub, model_ref_dict, session_id, selector_dict={}):
    selector = Selector(channel=selector_dict.get('channel'), 
                        library=selector_dict.get('library'),
                        language=selector_dict.get('language'))
    start_payload = StartRequestPayload(model_ref=model_ref_dict)
    start_req = StartRequest(session_id=session_id, 
                        selector=selector, 
                        payload=start_payload)
    log.debug(f'Start Request: {start_req}')
    start_response, call = stub.Start.with_call(start_req)
    response = MessageToDict(start_response)
    log.debug(f'Start Request Response: {response}')
    return response, call

def execute_request(stub, session_id, selector_dict={}, payload_dict={}):
    selector = Selector(channel=selector_dict.get('channel'), 
                        library=selector_dict.get('library'),
                        language=selector_dict.get('language'))
    input = UserInput(user_text=payload_dict.get('user_input').get('userText'))
    execute_payload = ExecuteRequestPayload(
                        user_input=input)
    execute_request = ExecuteRequest(session_id=session_id, 
                        selector=selector, 
                        payload=execute_payload)
    log.debug(f'Execute Request: {execute_payload}')
    execute_response, call = stub.Execute.with_call(execute_request)
    response = MessageToDict(execute_response)
    log.debug(f'Execute Response: {response}')
    return response, call


def execute_stream_request(args, stub, session_id, selector_dict={}, initial = False, interpret_text = False, request_asr = False, request_tts = False):
    # Receive stream outputs from Dialog, using stream of inputs
    stream_outputs = stub.ExecuteStream(build_stream_input(args, session_id, selector_dict, initial, interpret_text, request_asr, request_tts))
    responses = []
    audio = bytearray(b'')

    for stream_output in stream_outputs:
        if stream_output:
            if stream_output.HasField("response"):
                response = stream_output.response
                responses.append(response)
                # Extract execute response from the stream output
                response_dict = MessageToDict(stream_output.response)
                if response: 
                    responses.append(response)
                    log.debug(f'Received Execute response: {response_dict}')
            if stream_output.HasField('audio'):
                if stream_output.audio.HasField('audio'):
                    log.debug("Received TTS audio: %d bytes" % len(stream_output.audio.audio))
                    audio += stream_output.audio.audio
            if stream_output.HasField("asr_status"):
                asr_status = stream_output.asr_status
                log.debug("Received ASR status response: {} - {}".format(asr_status.code, asr_status.message))
            # if stream_output.HasField("asr_result"):
            #     asr_result = stream_output.asr_result
            #     log.debug("Received ASR result: {}".format(asr_result))
            
    return responses, audio

# Creates a stream of StreamInputs
def build_stream_input(args, session_id, selector_dict, initial = False, interpret_text = False, request_asr = False, request_tts = False):
    selector = Selector(channel = selector_dict.get('channel'),
                        library = selector_dict.get('library'),
                        language = selector_dict.get('language'))

    # Was TTS requested?
    if request_tts:
        # TTS requested
        # Settings for speech generation audio encoded as PCM 16KHz
        audio_format = nuance_tts_v1_pb2.AudioFormat(pcm = nuance_tts_v1_pb2.PCM(sample_rate_hz = 16000))
        audio_params = nuance_tts_v1_pb2.AudioParameters(audio_format = audio_format)
        voice = nuance_tts_v1_pb2.Voice(name = "Evan", model = "enhanced")
        # voice = nuance_tts_v1_pb2.Voice(name = "en-US-AmberNeural")
        # tts_control_v1 = TtsParamsV1(audio_params = audio_params)
        tts_control_v1 = TtsParamsV1(audio_params = audio_params, voice = voice)
    else:
        # No TTS needed
        tts_control_v1 = None 

    # Was text provided for interpretation?    
    if interpret_text:
        # Use text
        user_input = UserInput(user_text = args.textInput)
        execute_payload = ExecuteRequestPayload(user_input = user_input)
    else:
        if initial:
            # request flagged as initial request to kick off conversation and get initial prompts
            # Have to send an ExecuteRequestPayload with a user input, but with user_text empty
            user_input = UserInput(user_text = None)
            execute_payload = ExecuteRequestPayload(user_input = user_input)
        else:
            # Audio input case. Use empty payload
            execute_payload = ExecuteRequestPayload(user_input = None)
             
    # Build execute request object
    execute_request = ExecuteRequest(session_id = session_id,
                                     selector = selector,
                                     payload = execute_payload)
    
    # Audio file was provided. If so, open file, break it into packets, and stream
    if request_asr:
        with wave.open(args.audioFile, mode='r') as wf:
            # samples rate in Hz, samples per second
            sample_rate = wf.getframerate()
            # Desired time duration for each full audio packet, in seconds per packet. Using 0.02s per packet
            packet_duration = 0.020
            # number of samples for a packet, samples per second times seconds per packet
            packet_samples = int(sample_rate * packet_duration)
            audio_format = recognizer_pb2.AudioFormat(pcm = recognizer_pb2.PCM(sample_rate_hz = sample_rate))
            asr_control_v1 = AsrParamsV1(audio_format = audio_format, end_stream_no_valid_hypotheses = True)
            # first_packet flag distinguishes between first streaming packet and those that come after
            # For DLGaaS ExecuteStream(), first StreamInput contains audio config + first packet of audio bytes
            # Subsequent StreamInputs contain only audio bytes
            first_packet = True
            # the lambda reads a packet with packet_samples samples from the open audio file
            # iter creates an iterator that returns packets of the specified size. Using b'' as a sentinel value to stop
            log.debug(f'Streaming audio input...')
            for audio_packet in iter(lambda: wf.readframes(packet_samples), b''):
                if first_packet:
                    first_packet = False
                    # First packet includes the request header in addition to first chunk of audio bytes data
                    stream_input = StreamInput(
                        request = execute_request,
                        asr_control_v1 = asr_control_v1,
                        audio = audio_packet,
                        tts_control_v1 = tts_control_v1
                        )
                    log.debug(f'First streamed packet:')
                    log.debug(f'Sending parameters for ASR: {stream_input.asr_control_v1}')
                    log.debug(f'Sending parameters TTS: {stream_input.tts_control_v1}')
                    log.debug("Sending first speech input audio packet. Sending %d bytes" % len(audio_packet))
                else:
                    stream_input = StreamInput(audio = audio_packet)
                    # log.debug("Received audio: %d bytes" % len(stream_output.audio.audio))
                    log.debug("Sending subsequent speech audio packet. Sending %d bytes." % len(audio_packet))
                yield stream_input
                sleep(packet_duration)
            # Send a final empty StreamInput to signal to Dialog that the audio stream is complete
            stream_input = StreamInput(audio = b'')
            log.debug(f'Sending empty stream input to signal end of stream.')
            yield stream_input
    
    # Alternatively, no audio file provided. This branch handles the case of streaming with TTS only
    # Whether to kick off dialog or for first real turn of dialog
    else:
        stream_input = StreamInput(
            request = execute_request,
            tts_control_v1 = tts_control_v1
            )
        log.debug(f'Stream input with parameters for TTS: {stream_input.tts_control_v1}')
        yield stream_input
    
def stop_request(stub, session_id=None):
    stop_req = StopRequest(session_id=session_id)
    log.debug(f'Stop Request: {stop_req}')
    stop_response, call = stub.Stop.with_call(stop_req)
    response = MessageToDict(stop_response)
    log.debug(f'Stop Response: {response}')
    return response, call

def main():
    parse_args()
    log_level = logging.DEBUG
    global log
    log = logging.getLogger('')
    logging.basicConfig(
        format='%(asctime)s %(levelname)-5s: %(message)s', level=log_level)
    
    if args.oauthURL:
        if args.clientID is None:
            log.error("OAuth 2.0 URL was supplied but client ID is missing")
            return
        elif args.clientSecret is None:
            log.error("OAuth 2.0 URL was supplied but client secret is missing")
            return
    
    # Create channel to Dialog service
    with create_channel(args) as channel:
        stub = DialogServiceStub(channel)
        model_ref_dict = {
            "uri": args.modelUrn,
            "type": 0
        }
        selector_dict = {
            "channel": "default",
            "language": "en-US",
            "library": "default"
        }

        # Start the Dialog session
        response, call = start_request(stub, 
                            model_ref_dict=model_ref_dict, 
                            session_id = None,
                            selector_dict=selector_dict
                        )
        session_id = read_session_id_from_response(response)f-sample-app
        log.debug(f'Session: {session_id}')
        assert call.code() == grpc.StatusCode.OK
        log.debug(f'Initial request, no input from the user to get initial prompt')

        # Streaming required for ASR, TTS, or both
        if args.audioFile or args.tts:
            
            request_tts = args.tts
            if args.audioFile:
                interpret_text = False
                request_asr = True
            else:
                request_asr = False
                interpret_text = True
            # need to send initial request to kick off
            _, audio = execute_stream_request(args, stub, session_id, selector_dict = selector_dict, initial = True, request_tts = request_tts)
            if audio:
                 save_audio_file_wav(audio, "initial_tts_audio.wav", 16000, 16, 1, 1)
            else:
                log.debug(f'Something did not work with TTS initial prompts')

            # then send main request 
            _ , audio = execute_stream_request(args, stub, session_id, selector_dict = selector_dict, interpret_text = interpret_text, request_asr = request_asr, request_tts = request_tts)
            if audio:
                save_audio_file_wav(audio, "main_tts_audio.wav", 16000, 16, 1, 1)
            else:
                log.debug(f'Something did not work with TTS main response')

        # No streaming required
        else:
            payload_dict = {
                "user_input": {
                    "userText": None
                }
            }
            response, call = execute_request(stub, 
                                session_id=session_id, 
                                selector_dict=selector_dict,
                                payload_dict=payload_dict
                            )
            assert call.code() == grpc.StatusCode.OK
            log.debug(f'Second request, passing in user input')
            payload_dict = {
                "user_input": {
                    "userText": args.textInput
                }
            }
            response, call = execute_request(stub, 
                                session_id=session_id, 
                                selector_dict=selector_dict,
                                payload_dict=payload_dict
                            )
            assert call.code() == grpc.StatusCode.OK

if __name__ == '__main__':
    main()

Audio file for speech input

Each sample app package also includes a common audio file, OrderCoffee_i_want_a_double_espresso.wav, containing a text-to-speech rendering of the phrase “I want a double espresso.”

This audio file is intended for trying out speech processing with the client app, specifically in relation to a Dialog model built from the Mix Coffee app quick start project.

Given a Dialog model and associated NLU model built from this quick start project, this phrase will be interpreted by NLUaaS as the intent “ORDER_COFFEE” and with entity values of COFFEE_TYPE espresso and COFFEE_SIZE large. The Dialog model can then proceed down the path defined for those intent and entity values, and return responses accordingly.

Applicability of the audio file

Since this audio clip is designed for this specific coffee-shop themed project, the provided audio clip will only be useful for testing with models built from this quick start project. It may also be useful for other models relevant to a similar domain that includes intents of ordering coffee or other drinks. But if you’re using a different Dialog model that is related to a very different domain, you’ll need to provide your own audio clip that is appropriate for your model.

Specifications for creating your own audio file

The Python sample app currently only supports .wav audio files. The .wav audio file must be encoded with the following format to be usable with ASRaaS:

Linear pulse-code modulated (PCM)
16-bit signed little-endian samples
8 or 16 kHz sample rate

Run Python client for help

For a quick check that the client is working, and to see the arguments it accepts, run it on Linux or Windows using the help (-h or --help) option.

See the results below and notice:

-s, --serverUrl: URL for the Dialog server. By default this is localhost:8080 but the sample scripts specify hosted Mix Dialog at dlg.api.nuance.com.
Authorization: Include --oauthURL, --clientID, and --clientSecret. Alternatively, generate a token and use the (hidden) --token argument. The --oauthScope is set by default to dlg and so does not need to be specified for the provided Python client, which does not require any other scopes.
--secure: Boolean signalling whether to use a secure gRPC channel.
--modelUrn: Mix URN for the Dialog model to use.
--textInput: Text input string to the dialog.
--audioFile: Audio file containing speech input recording.
--tts: Boolean signalling whether text to speech output is required.
--audioDir: Directory for audio output files. This is set to audio by default.

python dlg_client.py -h
usage: dlg_client.py [-options]

options:
  -h, --help                               Show this help message and exit
  --appId [appId]                          Mix appId. For self-hosted use only. Used by Dialog
                                           service to resolve resource URNs in self-hosted setup.
  --oauthURL [oauthUrl]                    OAuth 2.0 URL
  --clientID [clientID]                    OAuth 2.0 Client ID
  --clientSecret [clientSecret]            OAuth 2.0 Client Secret
  --oauthScope [oauthScope]                OAuth 2.0 Scope, default=dlg
  --secure                                 Connect to the server using a secure gRPC channel
  -s [serverUrl], --serverUrl [serverUrl]  Dialog server URL, default=localhost:8080
  --modelUrn [modelUrn]                    Dialog model URN, e.g. urn:nuance:mix/eng-
                                           USA/A2_C16/mix.dialog
  --textInput [textInput]                  Text to perform interpretation on
  --audioFile [audioFile]                  audio file name for speech input to trigger speech
                                           recognition and then interpretation
  --tts                                    Boolean whether to request TTS
  --audioDir [audio directory]             Audio output directory for TTS, default=audio. To be used
                                           together with --tts.

Edit run script

First, edit the sample shell script (run-mix-client.sh) or batch file (run-mix-client.bat) to add your Mix client ID and secret. The script replaces the colons in the client ID with %3A so the value can be parsed correctly in subsequent operations.

The client ID and secret are used to authorize you to access the Dialog service. Note that the run scripts are set to authorize with the US geography by default. If you are using a different geography, you will need to update the --oauthURL in the script to use the appropriate OAuth URL for your geography.

See Authorize for more about authorization and how to use the run-mix-token-client.sh and *.bat scripts.

Linux: run-mix-client.sh
Windows: run-mix-client.bat

#!/bin/bash

CLIENT_ID=<Mix client ID, starting with appID:>
SECRET=<Mix client secret>
# Change colons (:) to %3A in client ID 
CLIENT_ID=${CLIENT_ID//:/%3A}

# Scenario 1: Text input and text output
python dlg_client.py --oauthURL https://auth.crt.nuance.com/oauth2/token \
    --clientID $CLIENT_ID --clientSecret $SECRET \
    --serverUrl dlg.api.nuance.com \
    --secure \
    --modelUrn $1 \
    --textInput $2 

# Scenario 2: Text input and TTS output
# python3 dlg_client.py --oauthURL https://auth.crt.nuance.com/oauth2/token \
#     --clientID $CLIENT_ID --clientSecret $SECRET \
#     --serverUrl dlg.api.nuance.com \
#     --secure \
#     --tts \
#     --modelUrn $1 \
#     --textInput $2 

# Scenario 3: Audio input and TTS output
# python3 dlg_client.py --oauthURL https://auth.crt.nuance.com/oauth2/token \
#     --clientID $CLIENT_ID --clientSecret $SECRET \
#     --serverUrl dlg.api.nuance.com \
#     --secure \
#     --tts \
#     --audioFile OrderCoffee_i_want_a_double_espresso.wav \
#     --modelUrn $1

@echo off
setlocal enabledelayedexpansion

set CLIENT_ID=<Mix client ID, starting with appID:>
set SECRET=<Mix client secret>
rem Change colons (:) to %3A in client ID
set CLIENT_ID=!CLIENT_ID::=%%3A!

rem Scenario 1: Text input and output
python dlg_client.py --oauthURL https://auth.crt.nuance.com/oauth2/token ^
    --clientID %CLIENT_ID% --clientSecret %SECRET% ^
    --serverUrl dlg.api.nuance.com ^
    --secure ^
    --modelUrn %1 ^
    --textInput %2 

rem Scenario 2: Text input and TTS output
rem python dlg_client.py --oauthURL https://auth.crt.nuance.com/oauth2/token ^
rem     --clientID %CLIENT_ID% --clientSecret %SECRET% ^
rem     --serverUrl dlg.api.nuance.com ^
rem     --secure ^
rem     --tts ^
rem     --modelUrn %1 ^
rem     --textInput %2 

rem Scenario 3: Audio file input and TTS output
rem python dlg_client.py --oauthURL https://auth.crt.nuance.com/oauth2/token ^
rem     --clientID %CLIENT_ID% --clientSecret %SECRET% ^
rem     --serverUrl dlg.api.nuance.com ^
rem     --secure ^
rem     --tts ^
rem     --audioFile OrderCoffee_i_want_a_double_espresso.wav ^
rem     --modelUrn %1

Run the sample client

With your client ID and secret added to the run script, you can run the sample client. There are three options for running the client, depending on the scenario you want to try.

Scenario 1: Text input and text output

By default, this client accepts a text string as input and returns a text response: the next prompt to send to the user. To try this scenario, run the sample shell script or batch file, passing the URN of your Dialog model and your text input to be interpreted.

Linux: run client text input example
Windows: run client text input example

./run-mix-client.sh "urn:nuance-mix:tag:model/TestMixClient/mix.dialog" "I want a double espresso"

run-mix-client.bat "urn:nuance-mix:tag:model/TestMixClient/mix.dialog" "I want a double espresso"

The client takes your text string and calls DLGaaS to interpret it and return the next prompt in the application as a text string. The prompt should be: “Perfect! A double espresso coming right up!” The response is the same on Linux and Windows.

2024-01-07 17:04:05,414 DEBUG: Creating secure gRPC channel
2024-01-07 17:04:05,420 DEBUG: Start Request: selector {
  channel: "default"
  language: "en-US"
  library: "default"
}
payload {
  model_ref {
    uri: "urn:nuance-mix:tag:model/TestMixClient/mix.dialog"
  }
}

2024-01-07 17:04:05,945 DEBUG: Start Request Response: {'payload': {'sessionId': '92705444-cd59-4a04-b79c-e67203f04f0d'}}
2024-01-07 17:04:05,948 DEBUG: Session: 92705444-cd59-4a04-b79c-e67203f04f0d
2024-01-07 17:04:05,949 DEBUG: Initial request, no input from the user to get initial prompt
2024-01-07 17:04:05,952 DEBUG: Execute Request: user_input {
}

2024-01-07 17:04:06,193 DEBUG: Execute Response: {'payload': {'messages':
[{'visual': [{'text': 'Hello and welcome to the coffee app.'}], 'view': {}}],
'qaAction': {'message': {'visual': [{'text': 'What can I get you today?'}]},
'data': {}, 'view': {}}}}
2024-01-07 17:04:06,198 DEBUG: Second request, passing in user input
2024-01-07 17:04:06,199 DEBUG: Execute Request: user_input {
  user_text: "I want a double espresso"
}

2024-01-07 17:04:06,791 DEBUG: Execute Response: {'payload': {'messages':
[{'visual': [{'text': 'Perfect, a double espresso coming right up!'}], 'view':
{}}], 'endAction': {'data': {}, 'id': 'End dialog'}}}

If you receive errors, or don’t get the response you expect (“Perfect…”) see Troubleshooting.

Scenario 2: Text input and TTS output

In this scenario, you input a text string but DLGaaS returns a wave file with synthesized text-to-speech audio, ready to play to the user instead of a text prompt.

Edit the sample shell script or batch file to uncomment scenario 2: the lines for text input and TTS output. Comment out scenario 1.

Note:

Remember that TTS output requires the TTS output modality in your Mix project.

Linux: edit client for text input + TTS output
Windows: edit client for text input + TTS output

# Scenario 1: Text input and output 
# python3 dlg_client.py –oauthURL https://auth.crt.nuance.com/oauth2/token \
#    --clientID $CLIENT_ID –clientSecret $SECRET \
#    --serverUrl dlg.api.nuance.com \ 
#    --secure \
#    --modelUrn $1 \
#    --textInput $2 

# Scenario 2: Text input and TTS output 
python3 dlg_client.py --oauthURL https://auth.crt.nuance.com/oauth2/token \
     --clientID $CLIENT_ID --clientSecret $SECRET \ 
     --serverUrl dlg.api.nuance.com \
     --secure \
     --tts \
     --modelUrn "$1" \
     --textInput "$2"

rem Scenario 1: Text input and output
rem python dlg_client.py --oauthURL https://auth.crt.nuance.com/oauth2/token ^
rem    --clientID %CLIENT_ID% --clientSecret %SECRET% ^
rem    --serverUrl dlg.api.nuance.com ^
rem    --secure ^
rem    --modelUrn %1 ^
rem    --textInput %2 

rem Scenario 2: Text input and TTS output
python dlg_client.py --oauthURL https://auth.crt.nuance.com/oauth2/token ^
    --clientID %CLIENT_ID% --clientSecret %SECRET% ^
    --serverUrl dlg.api.nuance.com ^
    --secure ^
    --tts ^
    --modelUrn %1 ^
    --textInput %2

As in scenario 1, run the client from the shell script or batch file, passing the URN of your Dialog model and your text input to be interpreted.

Linux: run client text input + TTS output
Windows: run client text input + TTS output

./run-mix-client.sh urn:nuance-mix:tag:model/TestMixClient/mix.dialog "I want a double espresso"

run-mix-client.bat urn:nuance-mix:tag:model/TestMixClient/mix.dialog "I want a double espresso"

The client takes the text string and calls DLGaaS to interpret it and return the next prompt in the application. In this scenario, the client signals Dialog via the streaming API to call TTSaaS and saves the audio that comes back as .wav files. You’ll see two audio files under a new folder, audio:

initial_tts_audio.wav: The audio for the initial prompt.
main_tts_audio.wav: The audio for the response to the user input.

If all goes well, you should see output similar to the following:

2024-01-07 16:16:20,415 DEBUG: Adding CallCredentials using token parameter
2024-01-07 16:16:20,416 DEBUG: Creating secure gRPC channel
2024-01-07 16:16:20,422 DEBUG: Start Request: selector {
  channel: "default"
  language: "en-US"
  library: "default"
}
payload {
  model_ref {
    uri: "urn:nuance-mix:tag:model/TestMixClient/mix.dialog"
  }
}

2024-01-07 16:16:20,738 DEBUG: Start Request Response: {'payload': {'sessionId': '6303610e-8d97-4f95-bf97-2d423c4baad0'}}
2024-01-07 16:16:20,738 DEBUG: Session: 6303610e-8d97-4f95-bf97-2d423c4baad0
2024-01-07 16:16:20,738 DEBUG: Initial request, no input from the user to get initial prompt
2024-01-07 16:16:20,739 DEBUG: Stream input with parameters for TTS: audio_params {
  audio_format {
    pcm {
      sample_rate_hz: 16000
    }
  }
}
voice {
  name: "Evan"
  model: "enhanced"
}

2024-01-07 16:16:20,931 DEBUG: Received Execute response: {'payload': {'messages': [{'nlg': [{'text': 'Hello and welcome to the coffee app.'}], 'visual': [{'text': 'Hello and welcome to the coffee app.'}], 'view': {}}], 'qaAction': {'message': {'nlg': [{'text': 'What can I get you today?'}], 'visual': [{'text': 'What can I get you today?'}]}, 'view': {}, 'recognitionSettings': {'collectionSettings': {'timeout': '7000', 'completeTimeout': '0', 'incompleteTimeout': '1500', 'maxSpeechTimeout': '12000'}, 'speechSettings': {'sensitivity': '0.5', 'bargeInType': 'speech', 'speedVsAccuracy': '0.5'}}}}}
2024-01-07 16:16:20,998 DEBUG: Received TTS audio: 70806 bytes
2024-01-07 16:16:20,998 DEBUG: Received TTS audio: 12596 bytes
2024-01-07 16:16:21,000 DEBUG: Received TTS audio: 42758 bytes
2024-01-07 16:16:21,001 DEBUG: Wrote generated speech audio response to audio/initial_tts_audio.wav
2024-01-07 16:16:21,002 DEBUG: Stream input with parameters for TTS: audio_params {
  audio_format {
    pcm {
      sample_rate_hz: 16000
    }
  }
}
voice {
  name: "Evan"
  model: "enhanced"
}

2024-01-07 16:16:21,176 DEBUG: Received Execute response: {'payload': {'messages': [{'nlg': [{'text': 'Perfect, a double espresso coming right up!'}], 'visual': [{'text': 'Perfect, a double espresso coming right up!'}], 'view': {}}], 'endAction': {'data': {}, 'id': 'End dialog'}}}
2024-01-07 16:16:21,193 DEBUG: Received TTS audio: 62572 bytes
2024-01-07 16:16:21,194 DEBUG: Received TTS audio: 36856 bytes
2024-01-07 16:16:21,196 DEBUG: Wrote generated speech audio response to audio/main_tts_audio.wav

If you receive errors, or don’t get the response you expect see Troubleshooting.

Scenario 3: Audio input and TTS output

In this final scenario, you input an audio file and DLGaaS returns wave files with synthesized text-to-speech audio, simulating a complete voice conversation between the application and the end user.

Edit the shell script or batch file to uncomment scenario 3: the lines for audio input and TTS output. Comment out the lines for scenarios 1 and 2.

Linux: edit client for audio input, audio output
Windows: edit client for audio input, audio output

# Scenario 1: Text input and output
# python3 dlg_client.py --oauthURL https://auth.crt.nuance.com/oauth2/token \
#    --clientID $CLIENT_ID --clientSecret $SECRET \
#    --serverUrl dlg.api.nuance.com \
#    --secure \
#    --modelUrn "$1" \
#    --textInput "$2" 

# Scenario 2: Text input and TTS output
# python3 dlg_client.py --oauthURL "https://auth.crt.nuance.com/oauth2/token" \
#     --clientID $CLIENT_ID --clientSecret $SECRET \ 
#    --serverUrl dlg.api.nuance.com \
#    --secure \
#    --tts \
#    --modelUrn "$1" \
#    --textInput "$2" 

# Scenario 3: Audio file input and TTS output
python3 dlg_client.py --oauthURL https://auth.crt.nuance.com/oauth2/token \
   --clientID $CLIENT_ID --clientSecret $SECRET \ 
    --serverUrl dlg.api.nuance.com \
    --secure \
    --tts \
    --audioFile OrderCoffee_i_want_a_double_espresso.wav \
    --modelUrn "$1"

rem Scenario 1: Text input and output
rem python dlg_client.py --oauthURL https://auth.crt.nuance.com/oauth2/token ^
rem    --clientID %CLIENT_ID% --clientSecret %SECRET% ^
rem    --serverUrl dlg.api.nuance.com ^
rem    --secure ^
rem    --modelUrn %1 ^
rem    --textInput %2

rem Scenario 2: Text input and TTS output
rem python dlg_client.py --oauthURL https://auth.crt.nuance.com/oauth2/token ^
rem    --clientID %CLIENT_ID% --clientSecret %SECRET% ^
rem    --serverUrl dlg.api.nuance.com ^
rem    --secure ^
rem    --tts ^
rem    --modelUrn %1 ^
rem    --textInput %2

rem Scenario 3: Audio file input and TTS output
python dlg_client.py --oauthURL https://auth.crt.nuance.com/oauth2/token ^
    --clientID %CLIENT_ID% --clientSecret %SECRET% ^
    --serverUrl dlg.api.nuance.com ^
    --secure ^
    --tts ^
    --audioFile OrderCoffee_i_want_a_double_espresso.wav ^
    --modelUrn %1

Run the client from the shell script or batch file and pass only the URN of your Dialog model. The audio file used as input is set in the run script.

Linux: run client audio input, audio output
Windows: run client audio input, audio output

./run-mix-client.sh "urn:nuance-mix:tag:model/TestMixClient/mix.dialog"

run-mix-client.bat "urn:nuance-mix:tag:model/TestMixClient/mix.dialog"

The client takes the audio file and calls DLGaaS to recognize the speech, interpret its meaning, and return the next prompt in the application.

The client simulates streaming of the audio by breaking the audio file into chunks and sending them to ASRaaS via DLGaaS’s streaming API. The client also signals DLGaaaS to call TTSaaS via the streaming API and saves the audio that comes back as .wav files. As in scenario 2, you’ll see two audio files under a new folder, audio:

initial_tts_audio.wav: The audio for the initial prompt.
main_tts_audio.wav: The audio for the response to the user input.

If all goes well, you should see output similar to the following:

2024-01-07 16:25:08,367 DEBUG: Adding CallCredentials using token parameter
2024-01-07 16:25:08,368 DEBUG: Creating secure gRPC channel
2024-01-07 16:25:08,373 DEBUG: Start Request: selector {
  channel: "default"
  language: "en-US"
  library: "default"
}
payload {
  model_ref {
    uri: "urn:nuance-mix:tag:model/TestMixClient/mix.dialog"
  }
}

2024-01-07 16:25:08,506 DEBUG: Start Request Response: {'payload': {'sessionId': 'a4ff950f-db77-455a-bc6e-bc5aad33b328'}}
2024-01-07 16:25:08,507 DEBUG: Session: a4ff950f-db77-455a-bc6e-bc5aad33b328
2024-01-07 16:25:08,507 DEBUG: Initial request, no input from the user to get initial prompt
2024-01-07 16:25:08,508 DEBUG: Stream input with parameters for TTS: audio_params {
  audio_format {
    pcm {
      sample_rate_hz: 16000
    }
  }
}
voice {
  name: "Evan"
  model: "enhanced"
}

2024-01-07 16:25:08,754 DEBUG: Received Execute response: {'payload': {'messages': [{'nlg': [{'text': 'Hello and welcome to the coffee app.'}], 'visual': [{'text': 'Hello and welcome to the coffee app.'}], 'view': {}}], 'qaAction': {'message': {'nlg': [{'text': 'What can I get you today?'}], 'visual': [{'text': 'What can I get you today?'}]}, 'view': {}, 'recognitionSettings': {'collectionSettings': {'timeout': '7000', 'completeTimeout': '0', 'incompleteTimeout': '1500', 'maxSpeechTimeout': '12000'}, 'speechSettings': {'sensitivity': '0.5', 'bargeInType': 'speech', 'speedVsAccuracy': '0.5'}}}}}
2024-01-07 16:25:08,820 DEBUG: Received TTS audio: 70806 bytes
2024-01-07 16:25:08,821 DEBUG: Received TTS audio: 12596 bytes
2024-01-07 16:25:08,822 DEBUG: Received TTS audio: 42758 bytes
2024-01-07 16:25:08,823 DEBUG: Wrote generated speech audio response to audio/initial_tts_audio.wav
2024-01-07 16:25:08,825 DEBUG: Streaming audio input...
2024-01-07 16:25:08,825 DEBUG: First streamed packet:
2024-01-07 16:25:08,825 DEBUG: Sending parameters for ASR: audio_format {
  pcm {
    sample_rate_hz: 16000
  }
}
end_stream_no_valid_hypotheses: true

2024-01-07 16:25:08,825 DEBUG: Sending parameters TTS: audio_params {
  audio_format {
    pcm {
      sample_rate_hz: 16000
    }
  }
}
voice {
  name: "Evan"
  model: "enhanced"
}

2023-12-19 16:25:08,825 DEBUG: Sending first speech input audio packet. Sending 640 bytes
2023-12-19 16:25:08,846 DEBUG: Sending subsequent speech audio packet. Sending 640 bytes.
. . . (more audio packets)
2023-12-19 16:25:11,103 DEBUG: Sending subsequent speech audio packet. Sending 640 bytes.
2023-12-19 16:25:11,118 DEBUG: Received ASR status response: 200 - Success
. . . (more audio packets)
2023-12-19 16:25:11,249 DEBUG: Sending subsequent speech audio packet. Sending 160 bytes.
2023-12-19 16:25:11,269 DEBUG: Sending empty stream input to signal end of stream.
2023-12-19 16:25:11,416 DEBUG: Received Execute response: {'payload': {'messages': [{'nlg': [{'text': 'Perfect, a double espresso coming right up!'}], 'visual': [{'text': 'Perfect, a double espresso coming right up!'}], 'view': {}}], 'endAction': {'data': {}, 'id': 'End dialog'}}}
2023-12-19 16:25:11,434 DEBUG: Received TTS audio: 62572 bytes
2023-12-19 16:25:11,435 DEBUG: Received TTS audio: 36856 bytes
2023-12-19 16:25:11,437 DEBUG: Wrote generated speech audio response to audio/main_tts_audio.wav

If you receive errors, or don’t get the response you expect, see Troubleshooting.

Troubleshooting

In these examples, the client should return two prompts, either in text or TTS format:

The initial prompt: “Hello and welcome… What can I do…”
The response to the user’s input: “Perfect! A double espresso coming right up!”

Depending on your input and how you have set up your project, you may encounter issues or receive different responses. Here are some tips for troubleshooting.

Dialog fails to generate TTS output

Confirm in Mix dashboard whether your project is configured to support the TTS output modality. If not, you will not be able to generate TTS output. Edit your project settings in the Mix dashboard to enable the TTS output modality in at least one channel.

Then rebuild project resources and redeploy the resources.

Dialog only partially captures the meaning of your input and asks a followup question

Instead of “Perfect,” your response may be “What type of coffee would you like?” or “What size coffee would you like?” These responses alert you to issues with your input text or your NLU model. The NLU model may fail to understand the size or type of coffee you want in terms of the entities and values defined in the model.

In this case you could try a new input more similar in wording and entity values to your NLU model training samples. Alternatively, add new training samples and rebuild and redeploy your NLU model.

Dialog fails to capture the meaning of the input / NO_MATCH

Check that your input is relevant to the domain on which your model is based and is reasonably similar to the training samples in your NLU model. If your input is very different from the training samples, the NLU model may fail to recognize valid intent or entities.

StatusCode.NOT_FOUND

If you receive a StatusCode.NOT_FOUND error, with details of “model … could not be found,” the model URN you specified does not exist under the client ID and secret you specified for authorization. Check that you have specified the correct URN for your Dialog model and that your authorization credentials give you access to that model.

Next steps

This is a very simple toy client to demonstrate some of the basic mechanics of how to access and use the DLGaaS API. It provides useful functions to access the methods of the DLGaaS API that could serve as building blocks in a more complete application. The client authorizes, starts the dialog, and goes through a single step of a dialog using a single text input string or audio file, provided as a command line argument.

However, additional work is required to create an app that can run through a full, multi-step, interactive dialog, collect input from a user, and handle data transfers.

What follows are some brief tips on next steps to build on this to create a more fully functional app.

Multi-step dialog loop

Most real dialogs include multiple steps of back and forth interaction. You will need to write a loop to cycle through playing prompts, collecting user input and data, as needed, and sending the input and data back within requests until the dialog is finished.

Collecting user input

You will need to write code to collect input, whether text or audio.

The app includes the function execute_request() to handle text input and the functions execute_stream_request() and build_stream_input() to process and stream audio input to Dialog from an existing audio file. However, you will need to write code to collect the text or audio input from the user and save to a file.

Supporting other audio formats

While this sample app only supports PCM encoding, ASRaaS and DLGaaS can support other ASRaaS supported audio formats. If needed for your application, you could add support for other audio formats.

Handling data transfers

Some dialogs rely on data transfers, whether client-side or server side. If your dialog contains data access nodes, then you would need to write code to recognize and handle both data access actions and continue actions as part of your main dialog loop.

Terminating the dialog

You will want to write code to handle an end action indicating that the dialog has terminated at its natural endpoint, as well as to allow the user to leave the conversation early and send a stop_request().

Authorize

DLGaaS is a hosted service on the Nuance Mix platform. To access this service, your client applications must be authorized with an access token generated by the OAuth 2 protocol.

In order to request a token, you need your Mix client ID and secret as described in Prerequisites from Mix. Once you have these credentials, you can request an access token in several ways.

The sample client supports two methods.

Let client generate token

The client includes token-generation code, checking first to see whether the token has expired. To use this method, pass your credentials and the location of the OAuth server in the --clientID, --clientSecret, and --oauthURL arguments.

Note: This is the preferred method.

Edit your run script, run-mix-client.sh or run-mix-client.bat, to add your Mix client ID and secret.

Linux: Edit run-mix-client.sh
Windows: Edit run-mix-client.bat

#!/bin/bash

CLIENT_ID=<Mix client ID, starting with appID:>
SECRET=<Mix client secret>
# Change colons (:) to %3A in client ID 
CLIENT_ID=${CLIENT_ID//:/%3A}

@echo off
setlocal enabledelayedexpansion

set CLIENT_ID=<Mix client ID, starting with appID:>
set SECRET=<Mix client secret>
rem Change colons (:) to %3A in client ID
set CLIENT_ID=!CLIENT_ID::=%%3A!

Generate token manually

For testing purposes, you may instead generate the token manually and pass it to the client as an environment variable in the --token argument.

This token expires after a short time (around 15 minutes) so must be regenerated frequently, but the number of requests is limited for security reasons.

To use this method, use the run-mix-token-client.sh or *.bat file, adding your Mix client ID and secret.

Linux: Edit run-mix-token-client.sh
Windows: Edit run-mix-token-client.bat

#!/bin/bash

CLIENT_ID=<Mix client ID, starting with appID:>
SECRET=<Mix client secret>
# Change colons (:) to %3A in client ID 
CLIENT_ID=${CLIENT_ID//:/%3A}

export MY_TOKEN="`curl -s -u "$CLIENT_ID:$SECRET" \
"https://auth.crt.nuance.com/oauth2/token" \
-d 'grant_type=client_credentials' -d 'scope=dlg' \
| python -c 'import sys, json; print(json.load(sys.stdin)["access_token"])'`"

@echo off
setlocal enabledelayedexpansion

set CLIENT_ID=<Mix client ID, starting with appID:>
set SECRET=<Mix client secret>
rem Change colons (:) to %3A in client ID 
set CLIENT_ID=!CLIENT_ID::=%%3A!

set command=curl -s -u %CLIENT_ID%:%SECRET% ^
-d "grant_type=client_credentials" -d "scope=dlg" ^
"https://auth.crt.nuance.com/oauth2/token"

for /f "delims={}" %%a in ('%command%') do (
    for /f "tokens=1 delims=:, " %%b in ("%%a") do set key=%%b
    for /f "tokens=2 delims=:, " %%b in ("%%a") do set value=%%b
    goto done:
)

:done
rem Check if the token was found
if not !key!=="access_token" (
    echo Access token not found^^!
    pause
    exit
)

rem Remove quotes
set MY_TOKEN=!value:"=!

Feedback

Was this page helpful?

Glad to hear it! Please tell us how we can improve.

Sorry to hear that. Please tell us how we can improve.