Supported languages
Transcription:Batch Real-Time Deployments:AllThis page lists the range of languages supported by Speechmatics. For more information on how to use these, please refer to the guide on Accuracy and Language
To automatically identify the language in an audio file, use our Language Identification feature.
Languages
Speechmatics supports the following languages. Your ability to use any or all of the languages will depend on what languages you are contracted to use.
Speechmatics takes a global-first approach to our languages. In a single language pack, we aim to support many different accents and dialects. This simplifies your workflow when selecting which language to use, not requiring you to know which accent is being spoken in your audio up-front. With this approach we still achieve very high accuracy compared to accent specific language packs.
Language | Language Code |
---|---|
Arabic | ar |
Bashkir | ba |
Basque | eu |
Belarusian | be |
Bulgarian | bg |
Cantonese | yue |
Catalan | ca |
Croatian | hr |
Czech | cs |
Danish | da |
Dutch | nl |
English | en |
Esperanto | eo |
Estonian | et |
Finnish | fi |
French | fr |
Galician | gl |
German | de |
Greek | el |
Hindi | hi |
Hungarian | hu |
Interlingua | ia |
Italian | it |
Indonesian | id |
Japanese | ja |
Korean | ko |
Latvian | lv |
Lithuanian | lt |
Malay | ms |
Mandarin | cmn |
Marathi | mr |
Mongolian | mn |
Norwegian | no |
Polish | pl |
Portuguese | pt |
Romanian | ro |
Russian | ru |
Slovakian | sk |
Slovenian | sl |
Spanish | es |
Swedish | sv |
Tamil | ta |
Thai | th |
Turkish | tr |
Uyghur | ug |
Ukrainian | uk |
Vietnamese | vi |
Welsh | cy |
Please note any languages outside this list are not explicitly supported. Only one language can be processed within each request. Each language above has a two-letter code (ISO639-1) or three-letter code (ISO639-3) that must be provided for any transcription request.
Domain Language
The Speechmatics SaaS also supports specialized language packs that enhance the requested transcription language with optimization for a particular field. This is particularly useful for increasing the accuracy for domains that have specific terminology. The domain packs build on our global languages to give the best accuracy.
Domain | Supported languages | Description |
---|---|---|
Finance | en | Improve accuracy for audio containing financial terms such as those found in earnings calls or financial broadcast |
Translation languages
Translation is supported for the majority of Speechmatics' languages. The supported translation pairs are listed below. For more details, see Translation.
Audio Language | Translation Target Language |
---|---|
English (en) | Bulgarian (bg), Catalan (ca), Mandarin (cmn), Czech (cs), Danish (da), German (de), Greek (el), Spanish (es), Estonian (et), Finnish (fi), French (fr), Galician (gl), Hindi (hi), Croatian (hr), Hungarian (hu), Indonesian (id), Italian (it), Japanese (ja), Korean (ko), Lithuanian (lt), Latvian (lv), Malay (ms), Dutch (nl), Norwegian (no), Polish (pl), Portuguese (pt), Romanian (ro), Russian (ru), Slovakian (sk), Slovenian (sl), Swedish (sv), Turkish (tr), Ukrainian (uk), Vietnamese (vi) |
Bulgarian (bg), Catalan (ca), Mandarin (cmn), Czech (cs), Danish (da), German (de), Greek (el), Spanish (es), Estonian (et), Finnish (fi), French (fr), Galician (gl), Hindi (hi), Croatian (hr), Hungarian (hu), Indonesian (id), Italian (it), Japanese (ja), Korean (ko), Lithuanian (lt), Latvian (lv), Malay (ms), Dutch (nl), Norwegian (no), Polish (pl), Portuguese (pt), Romanian (ro), Russian (ru), Slovakian (sk), Slovenian (sl), Swedish (sv), Turkish (tr), Ukrainian (uk), Vietnamese (vi) | English (en) |
Norwegian Bokmål (no) | Norwegian Nynorsk (nn) |
Currently unsupported Speechmatics languages: Arabic, Bashkir, Belarusian, Welsh, Esperanto, Basque, Interlingua, Mongolian, Marathi, Tamil, Thai, Uyghur, Cantonese.