Yandex Cloud
  • Services
  • Solutions
  • Why Yandex Cloud
  • Blog
  • Pricing
  • Documentation
  • Contact us
Get started
Language / Region
Yandex project
© 2023 Yandex.Cloud LLC
Yandex SpeechKit
  • SpeechKit technology overview
  • Getting started
  • Speech recognition
    • About the technology
    • Streaming recognition
    • Audio file recognition
      • Synchronous recognition
      • Asynchronous recognition
    • Supported languages
    • Extending a speech recognition model
  • Speech synthesis
    • About the technology
    • List of voices
    • Text markup
      • TTS markup
      • SSML markup
      • List of supported SSML phonemes
    • SpeechKit Brand Voice
      • About SpeechKit Brand Voice technology
      • Input data format for Brand Voice Adaptive
  • Audio formats
  • IVR integration
  • Using the API
    • Authentication in the API
    • Response format
    • Troubleshooting
    • Recognition API reference guides
      • API v2
        • Streaming Recognition API
        • Synchronous Recognition API
        • Asynchronous Recognition API
      • gRPC API v3 (eng)
        • Overview
        • Recognizer
    • Example uses for the recognition API
      • Streaming recognition, API v3
      • Automatic language detection, API v3
      • Streaming recognition, API v2
      • Synchronous recognition, API v2
      • Asynchronous recognition of LPCM format, API v2
      • Asynchronous recognition of OggOpus format, API v2
    • Synthesis API reference guides
      • API v1
      • gRPC API v3 (eng)
        • Overview
        • Synthesizer
    • Example uses for the synthesis API
      • Synthesis API v3
      • Synthesize speech in WAV format using API v1
      • Synthesize speech in OggOpus format using API v1
      • Synthesize SSML text using API v1
  • SpeechKit Hybrid
    • About the technology
    • System requirements
  • Quotas and limits
  • Access management
  • Pricing policy
  • Releases
    • Recognition releases
    • Synthesis releases
    • Release archive
  • Questions and answers
    • General questions
    • Speech recognition
    • Speech synthesis
    • All questions on one page
  1. Speech recognition
  2. Supported languages

Supported languages and recognition models

Written by
Yandex Cloud
,
improved by
amatol
  • Version tags
  • Supported recognition languages
  • Recognition accuracy

A recognition model is a model that is trained to recognize speech in a specific language. The models are trained on datasets generated by Yandex services and applications. This allows us to continually improve speech recognition quality.

The main supported model for each type of recognition is the general model. It recognizes speech on any topic in a given language, including short and long utterances, names, addresses, dates, and numbers.

Version tags

Three versions of the general model are available at the same time. You can select the desired version by tag:

  • general: The main version of the model.
  • general:rc: The version of a release candidate that you can test.
  • general:deprecated: The previous version of the model.

Note

Versions available by the general:deprecated tag stop being supported when new models are released: SpeechKit guarantees two weeks of support for the previous version after updating the version by the general tag. You can find the list of updates in Recognition releases.

In addition, the deferred-general tag is available for asynchronous recognition. Learn more about asynchronous recognition modes.

Supported recognition languages

Specify the language in ISO 639-1 format. All available language code values are case insensitive.

Code Language
auto Automatic language recognition
de-DE German
en-US English
es-ES Spanish
fi-FI Finnish
fr-FR French
it-IT Italian
kk-KK Kazakh
nl-NL Dutch
pl-PL Polish
pt-PT Portuguese
pt-BR Brazilian Portuguese
is only available in general:rc.
ru-RU Russian (default)
sv-SE Swedish
tr-TR Turkish

Recognition accuracy

In Yandex DataSphere, you can assess the recognition quality of a SpeechKit model yourself using your data.

See also

  • Supported audio formats
  • Supported languages and recognition models
  • Streaming speech recognition
  • Synchronous audio recognition
  • Asynchronous recognition
  • Extending a speech recognition model

Was the article helpful?

Language / Region
Yandex project
© 2023 Yandex.Cloud LLC
In this article:
  • Version tags
  • Supported recognition languages
  • Recognition accuracy