Skip to content Skip to main navigation Skip to footer

Search: model L

58 results

Releases and Changelogs (SPE)

…need for command line tools… finally!) New: LID Language Packs allow to store meta-files New: New entity “LID Language Model” (equivalent of *.lpa LanguagePrint Archive) Improved: Updated STT model RU_RU_A to version 4.6.0 of (updated language model) Removed: Support for RLS-enforced licences in command line applications Removed: FeaturePasterRepeat warning on null/empty repeat vector Speech Engine 3.37 Speech Engine 3.37.1, DB…

Release Notes

…XL4 model (compatibility must be explicitly enabled) Speech Engine: Speech to Text (STT) We have several exciting new features relevant to STT and KWS technologies: Czech (Czech Republic) language model updated (tech. model name: CS_CZ_6): We added new words to the language model, so recent frequent words like “COVID” are correctly transcribed. Slovak (Slovakia) language model updated (tech. model name:…

LID: Terminology and adaptation

…use absolute paths paths lead to the correct technological model directory of your choice (l4, l3, xl3, …) Example below assumes that the listfile will be saved to {SPE} directory (hence the relative paths to bsapi/…) and also assumes the “L4” model. You should reflect your setup accordingly. cs-CZ bsapi/lid/lprints/l4/cs-CZ.lpa pl-PL bsapi/lid/lprints/l4/pl-PL.lpa en-GB bsapi/lid/lprints/l4/en-GB.lpa ru-RU bsapi/lid/lprints/l4/ru-RU.lpa MyLanguage bsapi/lid/lprints/l4/MyLanguage.lpa MyOtherLanguage bsapi/lid/lprints/l4/MyOtherLanguage.lpa…

STT: Language Model Customization tutorial

…STT model, put its name in the model parameter, like this: GET /technologies/stt?path=foobar.wav&model=<customized_model_name> Using customized STT model in command line STT To use customized STT model in command line STT, simply specify the new configuration file belonging to the customized STT model in the -config parameter. For example, assuming that original pl_pl_5 model was customized, specifying updated as the model

Understand SPE database

…user), technology model used to create the profile, file with the profile content, hash rest_profile_sid4_metafiles list of files used as SID4 Audio Source Profiles metafiles rest_model_lid list of LID language packs – name, owner (SPE user), technology model to which the language pack belongs (i.e. technology model used to create source languageprints/language models) rest_model_lid_metafiles list of LID language packs metafiles…

Speech to Text (STT)

…data are similar to desired usage of resulting technology model, which is usually spontaneous speech. However as it is complicated to obtain such amount of data of this type, also other sources are used. Adaptation The technology can be adapted in two levels – in the Acoustic Model or the Language Model. Adapting the Acoustic Model to speakers from a…

Understand SPE directory structure

…for individual models settings BSAPI configuration files (*.bs) and optionally manually created user configs (*.bs.usr) There is one exception – LID – which has additional two directories containing pre-built languageprint archives (*.lpa) and language packs: lprints and models. Schemes below show examples of directories for GID (Gender Identification), STT (Speech To Text) and LID (Language Identification): – GID and LID…

SPE and Browser installation: standalone SPE

…nr. 23) 1) Age Estimation [active model: XL5(1x)] 2) Denoiser Technology [active model: EN_US(1x)] 3) Diarization [active model: XL4(1x)] 4) Gender Identification [active model: XL5(1x)] 5) Keyword Spotting [active model: EN_US_6(1x)] 6) Phoneme Recognition [active model: EN_US_6(1x)] 7) Keyword Spotting Stream [active model: EN_US_6(1x)] 8) Language Identification LanguagePrint Comparator [active model: L4(1x)] 9) Language Identification LanguagePrint Extractor [active model: L4(1x)]…

Adding new language or technology model (Browser)

…our example, we are adding new Spanish model (ES_6 technology model) of Speech to Text and Keyword Spotting (with Phoneme Recognizer). When you install new languages or models, they are turned off by default and need to be enabled in Phonexia Browser. To turn new models on, open Phonexia Browser: go to Settings Switch to Speech Engine tab Open STT…

STT: What is Preferred Phrases feature and how to use it

…to use preferred phrases containing such ‘unknown words’, it’s necessary to add these words to the language model first, using LMC – see STT Language Model Customization tutorial then perform the transcription using the customized STT model, specifying the preferred phrases in the POST /technologies/stt or POST /technologies/stt/input_stream REST call. Note: The REST call body does not allow specifying custom…

Phonexia technology models EoL

…6th gen. SID 5th gen. SID L4 2019-02 6th gen. SID 5th gen. SID LID L4 2020-10 6th gen. LID 5th gen. LID XL3 2015-07 5th gen. LID 4th gen. LID L3 2015-07 5th gen. LID 4th gen. LID GID XL4 2021-10 6th gen. GID 5th gen. GID L4 2019-06 6th gen. GID 5th gen. GID XL3 (XL1) 2016-09 5th…

Releases and Changelogs (Browser)

…in SID evaluation wizard [#87] Added information dialog during license checking [#93] Browser could be started in different working directory (CWD) [#95] Added server load and license information to context menu of localhost [#99] Fixed pass correct license file path to embedded SPE if license is not in the Browser’s directory [#105] Fixed filtering by recording length and speech length…

Understand SPE executable files

…that all the technologies/models are available in that SPE installation, this command adds(*) the following to the technologies configuration file: SIDE_STREAM for both L3 and XL3 model, 3 instances of each SIDC_STREAM for both L3 and XL3 model, 3 instances of each SID4E_STREAM for both L4 and XL4 model, 1 instance of each SID4C_STREAM for both L4 and XL4 model,…

Understand SPE technologies configuration file

…technologies.xml file containing the following setup: STT (Speech To Text) with 8 instances of SK_SK_5 model STT_STREAM (Speech To Text for stream processing) with 2 instances of CS_CZ_6 model SID4E (Speaker Identification 4 Voiceprint Extractor) with 2 instances of L4 model 3 instances of XL4 model SID4C (Speaker Identification 4 Voiceprint Comparator) with 2 instances of L4 model 3 instances…

Support Lifecycle Policy (PSP)

…SID 5th gen. SID LID L4 2020-10 6th gen. LID 5th gen. LID XL3 2015-07 5th gen. LID 4th gen. LID L3 2015-07 5th gen. LID 4th gen. LID GID XL4 2021-10 6th gen. GID 5th gen. GID L4 2019-06 6th gen. GID 5th gen. GID XL3 (XL1) 2016-09 5th gen. GID 4th gen. GID AGE L4 2019-06 6th gen….