Configuration for tuning
After deployment to a production system, applications accept calls from real users (not test users), and the system generates call logs for each session that occurs. The logs represent a history of in-production performance, and are critical for gaining insight to application performance.
Application developers use the call logs (and call center reports from users and agents) to identify problems. The solution to a problem might involve a change to configuration, application logic, prompts, or communications between components (such as database interactions). Fixing problems with the configuration is the least expensive technique. No recompilation or deployment is needed. It is only necessary to keep a record of changes on the production system so they can be propagated to any mirrored systems, and replicated in the original development environment for future updates.
For an overview of the tuning lifecycle, see Tuning applications. The tuning mechanisms are the same as during application development (Configuration during application development).
To optimize the speed of Recognizer initialization, and subsequent availability of licenses, check the values of the parameters:
Parameter |
Description |
Default |
---|---|---|
Number of licenses to check out during initialization of the speech detector (endpointer). |
8 (licenses) |
|
The number of licenses to check out during initialization. |
4 (licenses) |
|
File that loads grammars during initialization. |
$SWISRSDK/config/SWIgrmPreload.xml |
|
Determines whether recognition servers share memory. |
0 (disabled) |
These parameters tune the collection of user speech:
Parameter |
Description |
Default |
---|---|---|
Allows callers to interrupt prompts. |
1 (enabled) |
|
How long to wait before concluding that a caller is finished speaking. |
0 (timer disabled) |
|
Duration of silence to determine that callers have finished speaking. |
1500 (milliseconds) |
|
Maximum duration of an utterance collected from users. |
-1 (no timeout) |
|
Disables barge-in briefly at the beginning of a prompt. |
0 (no delay) |
|
How long to wait for speech after a prompt ends. |
7000 (milliseconds) |
These parameters tune the collected audio:
Parameter |
Description |
Default |
---|---|---|
Sensitivity of the speech detector when looking for speech. |
0.5 |
|
Safety margin to ensure that the begin-of-speech is captured. |
200 (milliseconds) |
|
Safety margin to ensure the end-of-speech is captured. |
350 (milliseconds) |
|
Controls how loudly callers must speak to interrupt prompts (barge-in) and detect speech. |
50 (percent) |
|
Maximum CPU time used to recognize an answer. |
20000 (milliseconds) |
These parameters control magic word recognition, which enables the application to recognize a specific speech or DTMF sequence and ignore anything else.
Parameter |
Description |
Default |
---|---|---|
Maximum duration of a magic word candidate for recognition. |
800 (milliseconds) |
|
Minimum duration of a magic word candidate for recognition. |
200 (milliseconds) |
|
Sets special recognition modes (such as magic word) in the endpointer. |
begin_only |
|
Sets special recognition modes in Recognizer. |
normal |
These parameters accept multiple values to maximize accuracy given the available CPU capacity. They can improve accuracy during periods of reduced load, and slightly reduce accuracy during periods of heavy load to avoid delays in response time.
Parameter |
Description |
Default |
---|---|---|
Defines ranges of system activity (idle, normal, and busy) based on CPU capacity. |
0, 14, 40, 101 (percentages) |
|
Overrides the automatic detection of CPU load, and forces specific values for parameters that balance speed and accuracy. |
on |
|
Maximum number of active FSM arcs. |
10000, 5000, 3000 |
|
Provides a secondary guide to the Viterbi beam search. |
-30, -60, -60 |
|
Limits search paths that end in a silence model during pruning. |
56, 56, 56 |
|
Primary guide for the Viterbi beam search. |
0, -15, -35 |
|
Temporarily stops self-learning activities for one or more languages. |
(depends on language) |
These parameters balance accuracy and CPU load:
Parameter |
Description |
Default |
---|---|---|
Maximum number of nodes visited during the a-star search. |
100000 (nodes visited) |
|
Maximum CPU time for the search phase of recognition. |
5000 (milliseconds) |
|
Maximum number of candidates for filling the n-best list. |
999999 (sentences) |
|
Density of the word lattice. |
100.0 |
|
Density of the word lattice. |
7.0 |
These parameter optimize memory usage when you run more than one recognition server on a host:
Parameter |
Description |
Default |
---|---|---|
Base address in memory for Recognizers to use for shared memory. |
(empty, the default address is used) |
|
Determines whether recognition servers share memory. |
0 (disabled) |
|
Maximum time for a process to wait for a resource in shared memory. |
30 (seconds) |
|
Size of the memory region shared by recognition server processes. |
0 (automatically determined) |
|
Generates warnings when shared memory is getting filled. |
90 (percent) |
This parameter is useful when Nuance provides a custom language model for an application:
Parameter |
Description |
Default |
---|---|---|
Adds weight to match the dynamic ranges of language and acoustic models. |
1.0 |
These parameters are useful when Nuance provides custom acoustic models for an application:
Parameter |
Description |
Default |
---|---|---|
Defines allophone maps for secondpass processing in Recognizer. |
(default mapfiles used) |
|
Defines finite state machines for secondpass processing in Recognizer. |
(default fsm files used) |
|
Acoustic models for secondpass processing in Recognizer. |
(default models used) |
These parameters control Recognizer’s caching behavior:
Parameter |
Description |
Default |
---|---|---|
Enables (or disables) the disk and inet caches. |
1 (enabled) |
|
Default optimization level for fully-optimized grammars. |
9 |
|
Number of previous activations before fully optimizing a grammar. |
3 |
|
Largest size of the disk and inet caches after cache cleanup. |
0 (adaptation enabled) |
|
Minimum size of disk and inet caches entries. |
0 (KB) |
|
Desired maximum size of the disk and inet caches. |
500 (MB) |
|
Keeps preloaded grammars in the memory cache. |
0 (flushing allowed) |
|
Maximum size of the memory cache after removing grammars to create available space. |
85 (MB) |
|
Minimum size for memory cache entries. |
0 (KB) |
|
Desired size of the memory cache. |
100 (MB) |
For details on caching, see Understanding grammar caching.
Performance-related licensing problems can be detected these parameters:
Parameter |
Description |
Default |
---|---|---|
Number of licenses the endpointer can claim before the system generates a warning. |
-1 (disabled) |
|
The number of licenses Recognizer can claim before the system generates a warning. |
-1 (disabled) |
In addition to checking Recognizer’s diagnostic log for licensing issues, check the logs from the connected License Manager.