Tools

Streaming Media Properties (MRCP)

Reference Number: AA-01101 Views: 18170

0 Rating/ Voters

The following parameters are part of the MRCP specification, and can be controlled via the LvMediaServer.config file, located in the config directory of the Windows Lumenvox installation folder. By default, this location is C:\Program Files\Lumenvox\config\.

For Linux installations, you would edit the mediaserver.conf file in /etc/lumenvox/.

They may also be set with the appropriate header as part of a RECOGNITION or SET-PARAMS method. Refer to the MRCPv2 draft, section 9.4, for more information about changing the parameters using recognizer headers. A majority of these parameters are hidden from view in the configuration file since we recommend usage of the headers in SET-PARAMS or RECOGNITION to set these values. However, entering any of these values in the [MRCP] section of the media server config file should work.

See Configuring Parameters for more information about changing various MRCP parameters.

nbest_length

When the recognizer matches audio with a grammar, it may come up with more than one alternative match because of confidence levels in certain words or conversation paths. By default, the Speech Engine will only return the best match above the confidence threshold. The nbest_length parameter determines how many different results to return. All alternatives must still be above the confidence-threshold. A value greater than one does not guarantee that the recognizer will send the requested number of alternatives. The minimum value for this field is 1.

Default: 1

confidence_thrsld

When a recognition resource recognizes an utterance with some portion of the grammar, it associates a confidence level with that conclusion. The confidence-threshold parameter tells the recognizer resource what confidence level should be considered a successful match. This is a number from 0-1 indicating the recognizer's confidence in the recognition. If the recognizer determines that its confidence in all its recognition results is less than the confidence threshold, then it returns no-match as the recognition result.

Default: 0.45

sensitivity_lvl

Controls the sensitivity level for barge-in. Tthe higher the value, the more easily the system will barge-in. This parameter currently maps to the volume sensitivity in the LumenVox API, however, in the LumenVox API, a higher value refers to lower sensitivity. Another similar value to control sensitivity is available in our Vendor-Specific Recognizer Properties as snr_sensitivity.

Range: 0 - 100 (MRCP v1) or 0.0 - 1.0 (MRCP v2)

Default: 50 (MRCP v1) or 0.5 (MRCP v2)

speed_vs_accuracy

Specifies how much accuracy is valued from an SRE decode, at the cost of speed. In MRCP v1, a value of 0 is for low speed and higher accuracy. In MRCP v2 and the LumenVox API, a value of 0 is for high speed and lower accuracy.

Range: 0 - 100 (MRCP v1) or 0.0 - 1.0 (MRCP v2)

Default: 50 (MRCP v1) or 0.5 (MRCP v2)

speech_incomplete_timeout

At this time, this parameter is unused.

no_input_timeout

After recognition has started, if there is no speech detected for a certain period of time, the recognizer will send a RECOGNITION-COMPLETE event to the client and terminate the recognition operation. The no-input-timeout parameter controls the length of this time period. The value is in milliseconds.

Default: 20000

dtmf_termination_timeout

The terminating timeout to use when recognizing DTMF input. When no DTMF input has been detected for the length of time set by this parameter, the system will stop waiting for DTMF input. The value is in milliseconds. The value for this field ranges from 0 to MAXTIMEOUT, where MAXTIMEOUT is platform specific. Refer to the VoiceXML 2.0 specification, Appendix D for detailed information.

Default: 10000

dtmf_inter_digit_timeout

This parameter specifies the inter-digit timeout value to use when recognizing DTMF input. The value is in milliseconds. The value for this field ranges from 0 to MAXTIMEOUT, where MAXTIMEOUT is platform specific. Refer to the VoiceXML 2.0 specification, Appendix D for detailed information.

Default: 5000

recognizer_start_timers

If set to false (the default), the recognizer will not start the timers until the client sends a RECOGNITION-START-TIMERS method to the recognizer.

Default: false

recognition_timeout

After recognition has started and there is no match for a certain period of time, the recognizer can send a RECOGNITION-COMPLETE event to the client and terminate the recognition operation. The recognition-timeout parameter field sets this timeout value. The value is in milliseconds. The value for this field ranges from 0 to MAXTIMEOUT, where MAXTIMEOUT is platform specific.

Default: 10000

dtmf_inter_digit_timeout

Default: 5000

save_waveform

The MRCP server can save the audio stream that was used for a recognition if the client sets this parameter to true. The server records the audio and makes it available to the client in the form of a URI returned in the waveform-uri header field in the RECOGNITION-COMPLETE event. If there is an error in recording the stream, the recognizer returns a blank waveform-uri header field.

Default: false

Version 10.4.100 onward

waveform_url_location

This parameter is used in conjunction with the save-waveform parameter. This tells the MRCP server where to put the recorded audio-stream file on the local filesystem. It is only used if the save_waveform parameter is set to true.

Default: "" (empty string)

waveform_url_prefix

Note: This option was added to Version 10.4 of the LumenVox Media Server.

Specifies prefix of returned wavefile URLs. This prefix will be returned with the filename appended. The default waveform_url_location will be used if this value is not set.

Using this option may be useful when exposing these files via some http/https server, allowing you to specify a different returned URL in place of the actual location of the audio file(s) on the file system.

Default: "" (empty string)

remove_waveform_files

If enabled (1), any waveform files generated on the local filesystem will be erased at the end of the session.

This option extends the MRCP specification, which indicates that any generated waveform files should be erased.

Disabling this option (0) allows these files to persist beyond the lifespan of the session, however in this situation, the Media Server will never clean up these files, and it is therefore the responsibility of some external mechanism to maintain these files if this option is selected.

Possible Values:

0 (disabled) - files will not be erased

1 (enabled) - files will be erased at session end

Default: 1 (enabled)

Streaming Media Properties (MRCP)

nbest_length

confidence_thrsld

sensitivity_lvl

speed_vs_accuracy

speech_incomplete_timeout

no_input_timeout

dtmf_termination_timeout

dtmf_inter_digit_timeout

recognizer_start_timers

recognition_timeout

dtmf_inter_digit_timeout

save_waveform

waveform_url_location

waveform_url_prefix

remove_waveform_files

See Also