Supported fields and defaults

Neural TTSaaS supports a subset of the messages and fields in the Synthesizer API file. Some fields are ignored and others are not allowed, meaning they trigger an error if included.

Supported in GetVoicesRequest

These fields are supported in GetVoicesRequest.

Field in GetVoicesRequest Supported and default
voice (Voice) Default: all voices
   name Default: all names
   language Default: all languages
   gender (EnumGender) Default: all genders
   sample_rate_hz Default: all sample rates
   foreign_languages Default: all foreign languages
   styles Default: all styles
  Supported proto fields in GetVoicesRequest  

Supported in SynthesisRequest

These fields are supported in SynthesisRequest.

Field in SynthesisRequest Supported and default
voice (voice)
   name Mandatory for plain text, optional for SSML input.
audio_params (AudioParameters)
   audio_format (AudioFormat) Default: PCM 22050 Hz
      pcm Signed 16-bit little endian PCM
      alaw G.711 A-law, 8 kHz
      ulaw G.711 Mu-law, 8 kHz
      ogg_opus Ogg Opus, 16 kHz or 24 kHz
      ogg Opus, 16 kHz or 24 kHz
input (Input)
   text (Text) Either text or SSML is mandatory
   ssml (SSML) Either text or SSML is mandatory
event_params (EventParameters)
   send_bookmark_marker_events Default is false: Do not send
   send_visemes Default is false: Do not send
   suppress_input Default is false: Include input
client_data Default is blank
user_id Default is blank
  Supported proto fields in SynthesisRequest  

Not allowed

These fields are not supported in SynthesisRequest and return an error when included.

Message in SynthesisRequest Fields not allowed
input (Input) text.uri
ssml.uri
tokenized_sequence

Ignored

These fields are ignored. They may be included without error but have no effect.

Item Ignored
UnarySynthesize UnarySynthesisResponse
voice (Voice) model
age_group
language_tls
restricted
versions
enum_gender (EnumGender) NEUTRAL
audio_params (AudioParameters) volume_percentage (set to 80)
speaking_rate_factor (set to 1.0)
audio_chunk_duration_ms
target_audio_length_ms
disable_early_emisssion
ogg_opus (OggOpus) bit_rate_bps
max_frame_duration_ms
complexity
vbr
opus (Opus) max_frame_duration_ms
complexity
vbr
input (Input) resources
lid_params
download_parameters
ssml (SSML) ssml_validation_mode
event_params (EventParameters) send_sentence_marker_events
send_word_marker_events
send_phoneme_marker_events
send_paragraph_marker_events
send_log_events