Skip to content Skip to main navigation Skip to footer

Search: language

60 results

Language Identification (LID)

…Routing particular calls (languages) to human operators (language experts) Scoring and results The LID language pack defines a set of recognizable languages (represented by a language models). When identifying the language in audio recording (or languageprint), LID does the following: creates languageprint of the recording (if the input is audio recording) compares that languageprint with each language model in a…

LID: Terminology and adaptation

…you audio files using GET /technologies/languageid/extractlp endpoint Create new (yet empty) language model using POST /technologies/languageid/languagemodels/{name} endpoint Upload languageprint– or languageprint archive file to the language model using POST /technologies/languageid/languagemodels/{name}/file endpoint repeat this upload for all necessary files – e.g. when creating completely new language model from your own audio files, this would be hundreds or thousands of files (see…

Language Identification – Languages

Recognized languages Languages pre-trained in the default language pack are listed in the table below, each LID generation is a separate column (in the 4th generation we switched to using language tags instead of names): L4 L3, XL3 S2, L2 (deprecated sq-AL Albanian Albanian Albanian am-ET Amharic Amharic Amharic ar-EG Arabic (Egypt) Arabic   ar-KW Arabic (Gulf, Kuwait) Arabic_Gulf  …

STT: Language Model Customization tutorial

Language Model Customization tool (LMC) provides a way to improve the Speech To Text performance by creating customized language model. Language model is an important part of Phonexia Speech To Text. In a simplified way it can be imagined as a large dictionary with multiple statistics. The Speech To Text technology uses this dictionary and statistical model to convert audio…

STT: Adding words to language model on the fly

…following internal linguistic rules for the given STT language. Still, the automatically generated pronunciation may not be in line with expectations, especially for foreign words (due to pronunciation differences between the word’s native language and the STT language). Therefore it is recommended to define the pronunciations explicitly, to help prevent mistranscriptions caused by incorrect generated default pronunciations. It is also…

Speech To Text / Keyword Spotting supported languages

Languages supported by Speech To Text and Keyword Spotting Standard = Maintained until newer generation is released, or end of support is reached. Language generation is specified by the number in “Model name”. Language (region) Model name Released End of support Maintenance Arabic (Gulf, Kuwait) AR_KW_6 2022-04 8th gen. Standard Arabic (Levantine) AR_XL_6 2021-05 8th gen. Standard AR_XL_5 2020-08 7th…

Adding new language or technology model (Browser)

…Installation packages with new language models are custom created and provided to you by your assigned Phonexia contact. As the language models are version dependent, we need to know the exact version of the Speech Engine you have. Version of installation package must match the version of your SPE. How to check SPE version In the Phonexia Browser right click…

Q: What languages do you offer?

It depends on the technology. Phonexia Language Identification (LID) is pre-trained for 60+ languages. Phonexia Keyword Spotting (KWS) and Phonexia Speech Transcription (STT) for 20+ languages including English, French, German, Russian, Spanish and many more….

Releases and Changelogs (SPE)

…LID language pack, hash of the file contained in the custom language pack report is incorrectly calculated (occurs mainly in Windows) Fixed: Items builtin_language_models and custom_language_models in a body of POST /technologies/languageid/languagepacks/{name} are now optional. At least one of them must not be empty. Fixed: Better server response message when language model was not found during creation of new LID…