SSM training file

Below is an example training file. A description of the header and main body of the XML-formatted file follow.

<!DOCTYPE SSMTraining SYSTEM "SSMTraining.dtd">

<SSMTraining version="1.0.0" xml:lang="en-us">

    <features>

        <word>broken</word>

        <word>computer</word>

        <word>is</word>

        <!-- more words, -->

        <word>what</word>

        <word>are</word>

        <word>promotions</word>

    </features>

    <semantic_models>

        <SSM>

            <meaning prior="1.0">

                <slot name="route">sales</slot>

            </meaning>

            <meaning prior="0.8">

                <slot name="route">tech_support</slot>

            </meaning>

        </SSM>

    </semantic_models>

        <training>

        <sentence count="1">

            <semantics>

            <slot name="route">tech_support</slot>

            </semantics>

            my computer is broken

        </sentence>

        <sentence count="1">

            <semantics>

                <slot name="route">sales</slot>

            </semantics>

            what are the promotions

        </sentence>

    </training>

</SSMTraining>

SSM training file header

The initial header lines for an SSM training file are similar to those required in an SLM training file (see SLM training file header):

<?xml version="1.0" encoding="UTF-8"?>

<!DOCTYPE SSMTraining SYSTEM "SSMTraining.dtd">

<SSMTraining version="1.0.0" xml:lang="en-us">

The most important difference here is that the document type is "SSMTraining", the related document type definition is SSMTraining.dtd (also located in the %SWISRSDK%\config directory), and the main declaration uses an <SSMTraining> element rather than the <SLMTraining> element.

<SSMTraining> element

Unlike the <SLMTraining> element, the <SSMTraining> element does not support the <meta> element for specifying configuration parameters, nor the <lexicon> element for specifying a user dictionary. To specify parameters, you must instead use the <param> and <value> elements in the file main body: see SSM training configuration parameters for details.

However, <SLMTraining> does support the <training> and <test> elements. See Training and test sections.

The <SLMTraining> element does support two SSM-specific elements used in the training file main body: the <semantic_models> and <features> elements.

SSM training file main body

The main body of the training file uses several elements that are specific to SSM training. These are organized into two main sections of the file: the <features> section, and the <semantic_models> section.

<features> section (vocabulary and classes)

The <features> section in an SSM training file serves a similar role to the <vocab> section in an SLM training file: it defines a section of vocabulary words and classes using the <word> and <ruleref> elements. The vocabulary section of the training file defines all words allowed in the other sections. (Omitted words are ignored if they appear in sentences.)

You can also use ECMAScript in the <features> section to modify or augment <ruleref> meanings. See Feature extraction and ECMAScript.

<ruleref> grammar classes

feature_generation attribute

Value	Description
fragment	Retains the string and the matching <ruleref>. This is the default. For example, “departing january twenty seventh” might become “departing january twenty seventh <ruleref uri="date.grxml"/>
remove	Removes the string and the matching <ruleref>. For example, “departing january twenty seventh” might become “departing”
stem	Replaces the string with the <ruleref>. For example, “departing january twenty seventh” might become “departing <ruleref uri="date.grxml"/>”

feature_keys and feature_values attributes

If someone builds a call routing context into an existing application, they likely already have several grammars representing semantics relevant to the SSM. The feature_values and (especially) feature_keys attributes are a great time saver, enabling re-use of existing grammars in the feature extractor of the SSM and leveraging the existing semantic interpretations that those grammars can provide. They can also tremendously increase the coverage of the feature extractor of an SSM by leveraging the existing coverage of those grammars.

The feature_keys attribute works in conjunction with a <ruleref> that points to a grammar that sets key/value pairs (slots) in ECMAScript (found in a <tag> statement in that grammar). The feature_keys attribute is a space-delimited list of keys that are returned by the grammar. Upon encountering a fragment in a sentence that fires the rule mentioned in the <ruleref>, the resulting key/value pairs are analyzed and matched against those listed in the feature_keys attribute. If a match is found, the key NAME is added to the <ruleref>'s name to form the full feature name.

The feature_values attribute allows for dynamic (training time) generation of feature names based on the input sentence. In other words, features are listed as is in the <features> section and used as is when encountered during training. With the feature_values attribute, the exact name of the feature is determined at training time and not explicitly listed in the <features> section of the XML training file.

The feature_values attribute also works in conjunction with a <ruleref> pointing to a grammar that sets key/value pairs (slots) in ECMAScript (found in a <tag> statement in that grammar). The feature_values attribute is a space-delimited list of values that are returned by the grammar. Upon encountering a fragment in a sentence that fires the rule mentioned in the <ruleref>, the resulting key/value pairs are analyzed and matched against those listed in the feature_values attribute. If a match is found, the key VALUE is added to the ruleref's name to form the full feature name.

The following example shows a typical use of feature_keys and feature_values, based on a <ruleref> from a date grammar (my_gram.grxml).

Suppose the grammar includes the following rule:

<rule id="month" scope="public">

  <one-of>

    <item> january

        <tag> MONTH = "01"; WINTER="1"; SPECIAL_DAY="New Year" </tag>     </item>

    <item> february

        <tag> MONTH = "02"; WINTER="1"; SPECIAL_DAY="Valentine" </tag>

    </item>

...

...

...

    <item> july <tag> MONTH = "07"; SUMMER="1"  </tag> </item>

...

...

...

    <item> december

        <tag> MONTH = "12"; FALL="1"; SPECIAL_DAY="Christmas"  </tag>

    </item>

  </one-of>

</rule>

You then define two <ruleref>s that refer to this rule—one for the feature_keys, and one for the feature_values:

Example 1 (feature_keys):

<ruleref uri="my_gram.grxml#month" feature_generation="stem"feature_keys="WINTER SPRING SUMMER FALL">

Example 2 (feature_values):

<ruleref uri="my_gram.grxml#month" feature_generation="fragment"feature_values="MONTH SPECIAL_DAY">

Based on this grammar and these <ruleref> definitions, the following features will be used in training the SSM (assuming that all words are defined in the <features> section):

Input	Example 1 (feature keys)	Example 2 (feature values)
for february ninth	for my_gram.grxml#month#WINTER ninth	for february my_gram.grxml#month#02 my_gram.grxml#month#Valentine ninth
july second	my_gram.grxml#month#SUMMER second	july my_gram.grxml#month#07 second
on december third	on my_gram.grxml#month#FALL third	on my_gram.grxml#month#12 my_gram.grxml#month#Christmas third

Input

Example 1 (feature keys)

Example 2 (feature values)

for february ninth

for

my_gram.grxml#month#WINTER

ninth

for

february my_gram.grxml#month#02

my_gram.grxml#month#Valentine

ninth

july second

my_gram.grxml#month#SUMMER

second

july

my_gram.grxml#month#07

second

on december third

my_gram.grxml#month#FALL

third

my_gram.grxml#month#12

my_gram.grxml#month#Christmas

third

If you specify both feature_keys and feature_values in the same <ruleref>, the resulting feature contains both. For example, suppose you combine both <ruleref>s like this:

<ruleref uri="my_gram.grxml#month" feature_generation="stem" feature_keys="WINTER SPRING SUMMER FALL MONTH SPECIAL_DAY"feature_values="MONTH SPECIAL_DAY">

The following table summarizes the features can appear in this case:

Input	Result
for february ninth	for my_gram.grxml#month#WINTER my_gram.grxml#month#MONTH#02 my_gram.grxml#month#SPECIAL_DAY#Valentine ninth
july second	my_gram.grxml#month#SUMMER my_gram.grxml#month#MONTH#07 second
on december third	on my_gram.grxml#month#FALL my_gram.grxml#month#MONTH#12 my_gram.grxml#month#SPECIAL_DAY#Christmas third

Input

Result

for february ninth

for

my_gram.grxml#month#WINTER

my_gram.grxml#month#MONTH#02

my_gram.grxml#month#SPECIAL_DAY#Valentine

ninth

july second

my_gram.grxml#month#SUMMER

my_gram.grxml#month#MONTH#07

second

on december third

my_gram.grxml#month#FALL

my_gram.grxml#month#MONTH#12

my_gram.grxml#month#SPECIAL_DAY#Christmas

third

Note: Like all other features, even if it is listed in the <features> section, an attribute needs to be exercised by <sentence>'s in the <train> section to be used to train the SSM. This is especially true for feature_values. In the above example, if "may" is never seen during training, the feature "my_gram.grxml#month#05" will never participate in any decision at testing time. In the case of feature_keys, if "april" was seen in training, the feature "my_gram.grxml#month#SPRING" will be active in the SSM. This means that at testing time "may" will indeed participate in the SSM's decision.

<semantic_models> section

The <semantic_models> section is a required section. It declares the SSM label classifiers, sets parameters, and lists all possible meanings returned by the SSM. The main entries in the section consist of <SSM> elements, which specify the label names, and define the associated meanings using the <meaning> element. The meaning element itself may have different attributes, as discussed below.

As previously discussed, SSM training configuration parameters can only be set in this section. See SSM training configuration parameters for details.

The training file fragment below specifies a classifier labelled "action". By default, the <SSM> element fills a slot of the specified label name. In this example, the action slot has possible values of dial and enroll:

<semantic_models>

    <SSM label="action">

        <meaning prior="-1.3">

            dial

        </meaning>

        <meaning prior="-.8">

            enroll

        </meaning>

    </SSM>

</semantic_models>

Name slots have precedence over labels. If an <SSM> element has both a label and named slots in the <meaning>, then the label merely refers to the SSM, and the meanings determine the values of the named slots.

You can specify an initial probability for each meaning using the prior attribute of the <meaning> element. The training program uses such initial probabilities as preliminary values, and adjusts them during processing. (For a related discussion, see use_prior_weight.)

By default, the special key SWI_meaning has the value of the concatenation of all labels set in the SSM (see SWI_meaning). You also can set SWI_meaning explicitly as a single slot, just as you would any label:

<SSM label="SWI_meaning">

    <meaning> dial </meaning>

</SSM>

In the next example given below, a single decision by the classifier sets two slots—action, and destination:

<semantic_models>

    <SSM>

        <meaning>

            <slot name="action">dial</slot>

            <slot name="destination">home</slot>

        </meaning>

        <meaning>

            <slot name="action">dial</slot>

            <slot name="destination">office</slot>

        </meaning>

    </SSM>

</semantic_models>

<meaning> element

The <meaning> element, which must be a child of an <SSM> element, is a container for slots and values. There is a limit of 5,000 meanings in an SSM.

Training and test sections

The <training> and <test> sections declare sentences for training the SSM and estimating its accuracy. The elements have identical syntax and child elements.

The sentences contained in the <training> and <test> sections independently reflect the actual distribution of the sentences seen in the actual application. The same literal sentence can be used in both sections; however, the training and test sections cannot be identical. The best way to select training sentences is to select a number of sentences randomly from your corpus of data. The remaining sentences are then used for the test section.

Typically, a training file has one <training> and <test> element; but you can divide sentences among many training or test sections to allow for different settings of the feature_extraction attribute.

SSM training file

SSM training file header

SSM training file main body

<features> section (vocabulary and classes)

<semantic_models> section

<meaning> element

Training and test sections

Related topics