Skip to content Skip to main navigation Skip to footer

Search: model customization

9 results

STT: Language Model Customization tutorial

…STT model, put its name in the model parameter, like this: GET /technologies/stt?path=foobar.wav&model=<customized_model_name> Using customized STT model in command line STT To use customized STT model in command line STT, simply specify the new configuration file belonging to the customized STT model in the -config parameter. For example, assuming that original pl_pl_5 model was customized, specifying updated as the model

Releases and Changelogs (SPE)

…– the following technologies/models used multiple threads when they should not: STT/KWS – all 6th generation models LID – L4 model DIAR, GID, SID4 – XL4 model SQE – GENERIC model VAD – GENERIC3 model Speech Engine 3.45.6, DB v1701, BSAPI 3.45.7 (2022-04-14) Fixed: Licensing subsystem fails to get license when multiple applications run under different OS user accounts Speech…

Release Notes

…XL4 model (compatibility must be explicitly enabled) Speech Engine: Speech to Text (STT) We have several exciting new features relevant to STT and KWS technologies: Czech (Czech Republic) language model updated (tech. model name: CS_CZ_6): We added new words to the language model, so recent frequent words like “COVID” are correctly transcribed. Slovak (Slovakia) language model updated (tech. model name:…

STT: What is Preferred Phrases feature and how to use it

…to use preferred phrases containing such ‘unknown words’, it’s necessary to add these words to the language model first, using LMC – see STT Language Model Customization tutorial then perform the transcription using the customized STT model, specifying the preferred phrases in the POST /technologies/stt or POST /technologies/stt/input_stream REST call. Note: The REST call body does not allow specifying custom…

Understand SPE directory structure

…with data for XL3 and L4 models – STT with data for 5th generation English language and 6th generation Czech language gid stt lid ├── data ├── data ├── data │ ├── l4 │ ├── models_cs_cz_6 │ ├── l4 │ └── xl3 │ └── models_en_us_5 │ └── xl3 ├── example ├── example ├── example │ ├── l4 │ ├── cs_cz_6…

Understand SPE technologies configuration file

…of XL4 model <?xml version=”1.0″?> <technology_subsystem_settings> <technologies> <item> <name>STT</name> <models> <item> <name>SK_SK_5</name> <n_instances>8</n_instances> <config_file /> </item> </models> </item> <item> <name>STT_STREAM</name> <models> <item> <name>CS_CZ_6</name> <n_instances>2</n_instances> <config_file /> </item> </models> </item> <item> <name>SID4E</name> <models> <item> <name>L4</name> <n_instances>2</n_instances> <config_file /> </item> <item> <name>XL4</name> <n_instances>3</n_instances> <config_file /> </item> </models> </item> <item> <name>SID4C</name> <models> <item> <name>L4</name> <n_instances>2</n_instances> <config_file /> </item> <item> <name>XL4</name> <n_instances>3</n_instances> <config_file />

FAQs (PSP)

…details, see KWS technology documentation. in FAQ Phonexia Browser, FAQ Speech Platform Permalink Q: What languages are supported by STT? A: Please see List of supported STT Languages. For more details, see STT technology documentation. in FAQ Phonexia Browser, FAQ Speech Platform Permalink Q: Can I add words into dictionary? A: Yes, you can use Language Model Customization (LMC). For…

Waveform Denoiser (DENOISER)

Phonexia Waveform Denoiser (DENOISER) ensures automatic dereverberation (removal of echoes caused by sound in the rooms) and automatic noise reduction of the speech signal. The data model is usually trained for various types of noise using the latest generation of algorithms based on neural networks. Automatically removed are mainly noises similar to those that was software trained on. Conversely, the…