Skip to main content

Supported languages

Transcription:Batch Real-Time Deployments:All

This page lists the range of languages supported by Speechmatics. For more information on how to use these, please refer to the guide on Accuracy and Language

info

To automatically identify the language in an audio file, use our Language Identification feature.

Languages

Speechmatics supports the following languages. Your ability to use any or all of the languages will depend on what languages you are contracted to use.

Speechmatics takes a global-first approach to our languages. In a single language pack, we aim to support many different accents and dialects. This simplifies your workflow when selecting which language to use, not requiring you to know which accent is being spoken in your audio up-front. With this approach we still achieve very high accuracy compared to accent specific language packs.

LanguageLanguage Code
Arabicar
Bashkirba
Basqueeu
Belarusianbe
Bulgarianbg
Cantoneseyue
Catalanca
Croatianhr
Czechcs
Danishda
Dutchnl
Englishen
Esperantoeo
Estonianet
Finnishfi
Frenchfr
Galiciangl
Germande
Greekel
Hindihi
Hungarianhu
Interlinguaia
Italianit
Indonesianid
Japaneseja
Koreanko
Latvianlv
Lithuanianlt
Malayms
Mandarincmn
Marathimr
Mongolianmn
Norwegianno
Polishpl
Portuguesept
Romanianro
Russianru
Slovakiansk
Sloveniansl
Spanishes
Swedishsv
Tamilta
Thaith
Turkishtr
Uyghurug
Ukrainianuk
Vietnamesevi
Welshcy

Please note any languages outside this list are not explicitly supported. Only one language can be processed within each request. Each language above has a two-letter code (ISO639-1) or three-letter code (ISO639-3) that must be provided for any transcription request.

Domain Language

The Speechmatics SaaS also supports specialized language packs that enhance the requested transcription language with optimization for a particular field. This is particularly useful for increasing the accuracy for domains that have specific terminology. The domain packs build on our global languages to give the best accuracy.

DomainSupported languagesDescription
FinanceenImprove accuracy for audio containing financial terms such as those found in earnings calls or financial broadcast

Translation languages

Translation is supported for the majority of Speechmatics' languages. The supported translation pairs are listed below. For more details, see Translation.

Audio LanguageTranslation Target Language
English (en)Bulgarian (bg), Catalan (ca), Mandarin (cmn), Czech (cs), Danish (da), German (de), Greek (el), Spanish (es), Estonian (et), Finnish (fi), French (fr), Galician (gl), Hindi (hi), Croatian (hr), Hungarian (hu), Indonesian (id), Italian (it), Japanese (ja), Korean (ko), Lithuanian (lt), Latvian (lv), Malay (ms), Dutch (nl), Norwegian (no), Polish (pl), Portuguese (pt), Romanian (ro), Russian (ru), Slovakian (sk), Slovenian (sl), Swedish (sv), Turkish (tr), Ukrainian (uk), Vietnamese (vi)
Bulgarian (bg), Catalan (ca), Mandarin (cmn), Czech (cs), Danish (da), German (de), Greek (el), Spanish (es), Estonian (et), Finnish (fi), French (fr), Galician (gl), Hindi (hi), Croatian (hr), Hungarian (hu), Indonesian (id), Italian (it), Japanese (ja), Korean (ko), Lithuanian (lt), Latvian (lv), Malay (ms), Dutch (nl), Norwegian (no), Polish (pl), Portuguese (pt), Romanian (ro), Russian (ru), Slovakian (sk), Slovenian (sl), Swedish (sv), Turkish (tr), Ukrainian (uk), Vietnamese (vi)English (en)
Norwegian Bokmål (no)Norwegian Nynorsk (nn)

Currently unsupported Speechmatics languages: Arabic, Bashkir, Belarusian, Welsh, Esperanto, Basque, Interlingua, Mongolian, Marathi, Tamil, Thai, Uyghur, Cantonese.