For example, when the user replies with a long description that is comprised of Sometimes, the detection of end of utterance occurs too quickly and the user is cut off while VoiceAI Connect to the bot as a separate textual message. Of detected audio silence (or by other means). } Continuous automatic speech recognition (ASR)īy default, the speech-to-text service recognizes the user's end of utterance according to the duration For example, a value of 0.35 indicates that a 35% or above confidence level of a language match must be reached to enable a language switch.Įnables the language switch to an alternate language if languageDetectionMinConfidence value has been reached.įalse: (Default) Disables a language switch even if the languageDetectionMinConfidence value has been reached VoiceAI Connect Enterprise supports this feature from Version 3.4 and later.įalse: (Default) Disables language detection.ĭefines a list of up to three alternative languages (in addition to the current language) that will be used to detect the language spoken.ĭefines the confidence level that a language recognition must reach to enable a language switch. This parameter is applicable only for AWS. Refer to the Voices in Amazon Polly table to check if the specific language voice supports Neural Voice and/or Standard Voice. Note: This string is obtained from the text-to-speech service provider and must be provided to AudioCodes, as discussed in Text-to-speech service information.ĭefines the AWS text-to-speech voice as Neural Voice or Standard Voice. Google: The parameter is configured with the value from the 'Voice name' column in Google's Cloud Text-to-Speech table (e.g., "en-US-Wavenet-A").ĪWS: The parameter is configured with the value from the 'Name/ID' column in Amazon's Polly TTS table (e.g., "Hans").Īlmagu: The parameter is configured with the value from the 'Voice' column in Almagu's TTS table (e.g., "Osnat"). VoiceAI Connect Enterprise supports this parameter from Version 3.2 and later.ĭefines the voice name for the text-to-speech service.Īzure: The parameter is configured with the value from the 'Short voice name' column in Azure's Text-to-Speech table (e.g., "it-IT-ElsaNeural"). The value is obtained from the service provider. If these services use the same language, you can use the language parameter instead. The parameter is required if different languages are used for the text-to-speech and speech-to-text services. If different languages are used for the text-to-speech and speech-to-text services, use the ttsLanguage and sttLanguage parameters instead.ĭefines the language (e.g., "en-ZA" for South African English) of the bot conversation and is used for the speech-to-text service. This string is obtained from the speech-to-text and text-to-speech service provider and must be provided to AudioCodes, as discussed in Text-to-speech service information and Speech-to-text service information. Google: The parameter is configured with the value from the 'Language code' column in Google's Cloud Text-to-Speech table (e.g., "en-US").ĪWS: The parameter is configured with the value from the 'Language' column in Amazon's Polly TTS table (e.g., "de-DE"). Google: The parameter is configured with the value from the 'languageCode' (BCP-47) column in Google's Cloud Speech-to-Text table (e.g., "nl-NL").Īzure: The parameter is configured with the value from the 'Locale' column in Azure's Text-to-Speech table (e.g., "it-IT"). The value is obtained from the service provider.Īzure: The parameter is configured with the value from the 'Locale' column in Azure's Speech-Text table (e.g., "en-GB"). This feature is configured per bot by the Administrator, or dynamically by the bot during conversation:ĭefines the language (e.g., "en-ZA" for South African English) of the bot conversation and is used for speech-to-text and text-to-speech functionality. In addition, you can also integrate VoiceAI Connect with Azure's Custom Neural Voice text-to-speech feature, which allows you to create a customized synthetic voice for your bot. Text-to-speech services also use a selected voice to speak to the user (e.g., female or male). Speech-to-text and text-to-speech services interface with the user using a selected language (e.g., English US, English UK, or German). You can customize various speech features, as discussed in below.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |