Text markup for speech synthesis
Written by
Updated at May 12, 2023
You can control pronunciation during speech synthesis by marking up text to synthesize. Yandex SpeechKit fully supports markup for texts in Russian only. Some pronunciation control features are also supported for other languages.
For Russian and Kazakh, Yandex SpeechKit supports the synthesis of normalized text:
- Abbreviations do not need to be represented phonetically.
- Numbers can be written as Arabic numerals. During speech synthesis, they are converted into numbers pronounced as words.
Note
SpeechKit is designed for natural speech synthesis. Marking up data for speech synthesis helps set up pronunciation of separate words, phrases, and sentences. However, it is not intended for generating separate sounds and silence.
In SpeechKit, there are two markup formats: