Search: model%20L

58 results

Releases and Changelogs (SPE)

…– the following technologies/models used multiple threads when they should not: STT/KWS – all 6th generation models LID – L4 model DIAR, GID, SID4 – XL4 model SQE – GENERIC model VAD – GENERIC3 model Speech Engine 3.45.6, DB v1701, BSAPI 3.45.7 (2022-04-14) Fixed: Licensing subsystem fails to get license when multiple applications run under different OS user accounts Speech…

Release Notes

…XL4 model (compatibility must be explicitly enabled) Speech Engine: Speech to Text (STT) We have several exciting new features relevant to STT and KWS technologies: Czech (Czech Republic) language model updated (tech. model name: CS_CZ_6): We added new words to the language model, so recent frequent words like “COVID” are correctly transcribed. Slovak (Slovakia) language model updated (tech. model name:…

STT: Language Model Customization tutorial

…STT model, put its name in the model parameter, like this: GET /technologies/stt?path=foobar.wav&model=<customized_model_name> Using customized STT model in command line STT To use customized STT model in command line STT, simply specify the new configuration file belonging to the customized STT model in the -config parameter. For example, assuming that original pl_pl_5 model was customized, specifying updated as the model…

Understand SPE database

…voiceprints – voiceprint data, technology model used to create the voiceprint, speaker model to which the voiceprint belongs (speaker model voiceprints), calibration set to which the voiceprint belongs (FAR calibration set voiceprints) rest_model_sid_calib_voiceprint SID speaker model voiceprints calibrated to FAR – voiceprint data, speaker model, technology model used to create the voiceprint, max. FAR, calibration set used to calibrate the…

LID: Terminology and adaptation

…languageprints created using model L4 can be combined into languageprint archive and/or language model only with languageprints created using model L4… and language pack for model L4 must consist only from language models created using languageprints/archives of model L4. Adaptation types overview Creating new language model from your own audio files, to add new language not supported out-of-the-box at least…

Speech to Text (STT)

…data are similar to desired usage of resulting technology model, which is usually spontaneous speech. However as it is complicated to obtain such amount of data of this type, also other sources are used. Adaptation The technology can be adapted in two levels – in the Acoustic Model or the Language Model. Adapting the Acoustic Model to speakers from a…

SPE and Browser installation: standalone SPE

…nr. 23) 1) Age Estimation [active model: XL5(1x)] 2) Denoiser Technology [active model: EN_US(1x)] 3) Diarization [active model: XL4(1x)] 4) Gender Identification [active model: XL5(1x)] 5) Keyword Spotting [active model: EN_US_6(1x)] 6) Phoneme Recognition [active model: EN_US_6(1x)] 7) Keyword Spotting Stream [active model: EN_US_6(1x)] 8) Language Identification LanguagePrint Comparator [active model: L4(1x)] 9) Language Identification LanguagePrint Extractor [active model: L4(1x)]…

Adding new language or technology model (Browser)

…our example, we are adding new Spanish model (ES_6 technology model) of Speech to Text and Keyword Spotting (with Phoneme Recognizer). When you install new languages or models, they are turned off by default and need to be enabled in Phonexia Browser. To turn new models on, open Phonexia Browser: go to Settings Switch to Speech Engine tab Open STT…

STT: What is Preferred Phrases feature and how to use it

…a decoder. The decoder uses the information from acoustic model, combines it with information from language model recognition network (which describes the statistics about word grouping and sentences of a given language) and provides the transcription output. (See the Speech To Text article for more details about speech transcription principles) When using preferred phrases, we build additional language model…

Phonexia technology models EoL

…4th generation models, typically marked with a number 1, 2, 3 or 4 in the model name. Other technology models (SID, LID, GID, DIAR, AGE, SQE, VAD, DENOISE) Tech. models supported (generation specified by number in “Tech. model name”). Technology Tech. model name Released End of support Maintenance SID4 XL5 2022-09 6th gen. SID 5th gen. SID XL4 2020-03…

Releases and Changelogs (Browser)

…editor now distinguish KWS/Diar technology models (it is possible to open results for more models at once) [#4979] SID models status indication [#4979] User can prepare SID model/group by context menu [#4980] Show speech length for speaker models [#5041] Fixed processing a lot of files in SID evaluation cause application crash Phonexia Browser v3.8.2, BSAPI 3.12.0 – Jun 29 2017…

Understand SPE directory structure

…with data for XL3 and L4 models – STT with data for 5th generation English language and 6th generation Czech language gid stt lid ├── data ├── data ├── data │ ├── l4 │ ├── models_cs_cz_6 │ ├── l4 │ └── xl3 │ └── models_en_us_5 │ └── xl3 ├── example ├── example ├── example │ ├── l4 │ ├── cs_cz_6…

Understand SPE executable files

…that all the technologies/models are available in that SPE installation, this command adds(*) the following to the technologies configuration file: SIDE_STREAM for both L3 and XL3 model, 3 instances of each SIDC_STREAM for both L3 and XL3 model, 3 instances of each SID4E_STREAM for both L4 and XL4 model, 1 instance of each SID4C_STREAM for both L4 and XL4 model,…

Understand SPE technologies configuration file

…of XL4 model <?xml version=”1.0″?> <technology_subsystem_settings> <technologies> <item> <name>STT</name> <models> <item> <name>SK_SK_5</name> <n_instances>8</n_instances> <config_file /> </item> </models> </item> <item> <name>STT_STREAM</name> <models> <item> <name>CS_CZ_6</name> <n_instances>2</n_instances> <config_file /> </item> </models> </item> <item> <name>SID4E</name> <models> <item> <name>L4</name> <n_instances>2</n_instances> <config_file /> </item> <item> <name>XL4</name> <n_instances>3</n_instances> <config_file /> </item> </models> </item> <item> <name>SID4C</name> <models> <item> <name>L4</name> <n_instances>2</n_instances> <config_file /> </item> <item> <name>XL4</name> <n_instances>3</n_instances> <config_file />…

Support Lifecycle Policy (PSP)

…3 or 4 in the model name. Other technology models (SID, LID, GID, DIAR, AGE, SQE, VAD, DENOISE) Tech. models supported (generation specified by number in “Tech. model name”). Technology Tech. model name Released End of support Maintenance SID4 XL5 2022-09 6th gen. SID 5th gen. SID XL4 2020-03 6th gen. SID 5th gen. SID L4 2019-02 6th gen….