Language Identification – Languages
Recognized languages
Languages pre-trained in the default language pack are listed in the table below, each LID generation is a separate column (in the 4th generation we switched to using language tags instead of names):
L4 | L3, XL3 | S2, L2 (deprecated) |
---|---|---|
sq-AL (Albanian) | Albanian | Albanian |
am-ET (Amharic) | Amharic | Amharic |
ar-EG (Arabic_Egypt) | Arabic | |
ar-KW (Arabic_Gulf) | Arabic_Gulf | |
ar-IQ (Arabic_Iraqi) | Arabic_Iraqi | Arabic_Iraqi |
ar-XL (Arabic_Levantine) | Arabic_Levantine | Arabic_Levantine |
ar-MA (Arabic_Maghrebi) | Arabic_Maghrebi | |
arb (Arabic_MSA) | Arabic_MSA | Arabic_MSA |
as-IN (Assamese) | ||
az-AZ (Azerbaijani) | Azerbaijani | Azerbaijani |
bn-BD (Bangla_Bengali) | Bangla_Bengali | Bangla_Bengali |
be-BY (Belarusian) | ||
bg-BG (Bulgarian) | ||
my-MM (Burmese) | Burmese | Burmese |
ceb-PH (Cebuano) | ||
zh-HK (Chinese_Cantonese) | Chinese_Cantonese | Chinese_Cantonese |
zh-CN (Chinese_Mandarin) | Chinese_Mandarin | Chinese_Mandarin |
nan-CN (Chinese_Min_Nan) | Chinese_Dialects | |
wuu-CN (Chinese_Wu) | Chinese_Dialects | |
cv-RU (Chuvash) | ||
Creole | Creole | |
cs-CZ (Czech) | Czech | Czech |
fa-AF (Dari) | Dari | Dari |
nl (Dutch) | ||
en-US (English_American) | English_American | English_American |
en-GB (English_British) | English_British | English_British |
en-IN (English_Indian) | English_Indian | |
fa-IR (Farsi) | Farsi | Farsi |
fr (French) | French | French |
ka-GE (Georgian) | Georgian | Georgian |
de (German) | German | German |
el-GR (Greek) | Greek | Greek |
gn (Guarani) | ||
ht-HT (Haitian_Creole) | ||
ha (Hausa) | Hausa | Hausa |
Hebrew | Hebrew | |
hi-IN (Hindi) | Hindi | Hindi |
hu-HU (Hungarian) | Hungarian | |
id-ID (Indonesian) | Indonesian | Indonesian |
it (Italian) | Italian | Italian |
ja-JP (Japanese) | Japanese | Japanese |
kk-KZ (Kazakh) | ||
km (Khmer) | Khmer | Khmer |
rn-BI (Kirundi_Kinyarwanda) | Kirundi_Kinyarwanda | Kirundi_Kinyarwanda |
ko-KR (Korean) | Korean | Korean |
ku (Kurdish) | ||
lo-LA (Lao) | Lao | Lao |
lt-LT (Lithuanian) | ||
lb-LU (Luxembourgish) | ||
mk-MK (Macedonian) | Macedonian | Macedonian |
nd-ZW (Ndebele) | Ndebele | Ndebele |
om (Oromo) | Afan_Oromo | Afan_Oromo |
ps (Pashto) | Pashto | Pashto |
pl-PL (Polish) | Polish | Polish |
pt (Portuguese) | Portuguese | Portuguese |
pa (Punjabi) | Punjabi | |
ro-RO (Romanian) | ||
ru-RU (Russian) | Russian | Russian |
sh (Serbo-Croat-Bosnian) | Serbian | Serbian |
sh (Serbo-Croat-Bosnian) | Croatian | Croatian |
sh (Serbo-Croat-Bosnian) | Bosnian | Bosnian |
sn (Shona) | Shona | Shona |
sk-SK (Slovak) | Slovak | Slovak |
sl-SI (Slovenian) | ||
so (Somali) | Somali | Somali |
es-XA (Spanish_American) | ||
es-ES (Spanish_European) | Spanish | Spanish |
sw (Swahili) | Swahili | Swahili |
sv-SE (Swedish) | Swedish | |
tl-PH (Tagalog) | Tagalog | |
ta (Tamil) | Tamil | Tamil |
te-IN (Telugu) | ||
th-TH (Thai) | Thai | Thai |
bo (Tibetan) | Tibetan | Tibetan |
ti (Tigrignya) | Tigrigna | Tigrigna |
tpi-PG (Tok_Pisin) | ||
tr-TR (Turkish) | Turkish | Turkish |
uk-UA (Ukrainian) | Ukrainian | Ukrainian |
ur (Urdu) | Urdu | Urdu |
uz-UZ (Uzbek) | Uzbek | Uzbek |
vi-VN (Vietnamese) | Vietnamese | Vietnamese |
zu (Zulu) |