Skip to content Skip to main navigation Skip to footer

Search: kws

29 results

Releases and Changelogs (SPE)

…of second channel was shifted [#4543] Fixed KWS Pashto needs Dutch data [#4378] Fixed STT ES_AMER1 may returns empty transcription [#4377] Updated models STT RU_RU, RU_RU_FIN, RU_RU_A to 4.0.0 [#4306] Updated models STT CS_CZ, CS_CZ_FIN, CS_CZ_ENERGY, CS_CZ_TELCO, CS_CZ_IT to 4.0.0 [#4305] Updated KWS DE_DE model to version 3.0.0 [#4377] Updated KWS RU_RU model to version 4.0.0 [#4306] Updated KWS CS_CZ…

Keyword Spotting (KWS)

Phonexia Keyword Spotting (KWS) identifies occurrences of keywords and/or keyphrases in audio recordings. It can help you to get valuable information from huge quantities of speech recordings. You only need to specify the keywords or phrases you wish to find. This technology identifies all recordings with keyword occurrences and allows you to automatically route important recordings or calls to your…

KWS: Results explained

…sheet demonstrating the sigmoid function: Score-to-Confidence. Score-to-confidence conversion tuning Starting with SPE/BSAPI 3.24 (October 2019) it’s possible to modify the confidence calculation using confidence_shift and confidence_sharpness values in user configuration file in [score_calib:SKeywordScoreCalibrationI] section. User configuration file must have the same name as original configuration file, with added .usr extension, e.g. kws_en_us_5.bs.usr – see the What is a user configuration…

Release Notes

…on Windows under certain circumstances (since 3.59.0) Fixed issues with starting technologies in Speech Engine on Windows from UNC path (since 3.58.0) Speech Engine: Speech To Text (STT) and Keyword Spotting (KWS) New and updated STT and KWS languages Bengali model BN_6 added in 3.55.1 with word accuracy (WAcc) up to 55 % Kazakh model KK_KZ_6 added in 3.55.1 with…

Understand SPE database

rest_model_kws KWS keyword lists – keyword list JSON data, keyword list name, owner (SPE user), technology model to which the keyword list belongs Processing results data Tables containing cached processing results (if results caching is enabled): rest_result_age AGE processing results – file, used technology model, results JSON data rest_result_diar DIAR processing results – file, used technology model, used processing parameters,…

Phoneme Recogniser (PHNREC)

…Recogniser is delivered as part of Keyword Spotting (KWS) technology. It can be also used without KWS technology. Typical use cases „search-in-speech“ – search for specific information in large call archives (e.g., claims inspection), get custom based pronunciation of word or phrase as customized keyword in keyword spotting technology (better accuracy of KWS technology), get custom based pronunciation of word…

Q: What languages do you offer?

It depends on the technology. Phonexia Language Identification (LID) is pre-trained for 60+ languages. Phonexia Keyword Spotting (KWS) and Phonexia Speech Transcription (STT) for 20+ languages including English, French, German, Russian, Spanish and many more….

Understand SPE directory structure

…at https://download.phonexia.com/docs/spe/ INSTALL.html, INSTALL.txt Quick installation guide in HTML and TXT format UPDATE.txt Quick update instructions and SPE configuration file changes between SPE versions result_versions.txt List of REST API result versions Phonemes_for_STT_and_KWS.pdf List of STT/KWS phonemes, useful e.g. for keyword pronunciations definitions Technology_LID_L4_Language_tags.pdf List of LID L4 language tags and more details about languages they refer to EULA EULA directory…

Key Features (PSP)

…recording, Speech to Text (STT) – several languages supported – converts speech into plain text (words or sentences) automatically, Keyword Spotting (KWS) – several languages supported – detects specific keywords/phrases automatically without conversion to text, Gender identification (GID) – identifies whether a speaker is male or female, Age Estimation (AGE) – estimates the speaker´s age group, Voice Activity Detection (VAD)…

Releases and Changelogs (Browser)

…editor now distinguish KWS/Diar technology models (it is possible to open results for more models at once) [#4979] SID models status indication [#4979] User can prepare SID model/group by context menu [#4980] Show speech length for speaker models [#5041] Fixed processing a lot of files in SID evaluation cause application crash Phonexia Browser v3.8.2, BSAPI 3.12.0 – Jun 29 2017…

What is User configuration file and how to use it

…name User configuration file name stt_cs_cz_5_online.bs stt_cs_cz_5_online.bs.usr kws_nl_nl_5.bs kws_nl_nl_5.bs.usr phnrec_pashto.bs phnrec_pashto.bs.usr vpextract4_xl4.bs vpextract4_xl4.bs.usr During technology initialization (e.g. during Speech Engine startup), the initialization routine checks for existence of such user config file. If found, it’s automatically loaded after loading the main configuration file and the settings from user config is automatically applied over the setings from main configuration file. Usage…

FAQs (PSP)

…looking at the frequency distribution in individual frames. This approach to SNR estimation is based on the article by Kim Chanwoo, and Richard M. Stern, called “Robust Signal-to-Noise Ratio Estimation Based on Waveform Amplitude Distribution Analysis”, Interspeech 2008. in FAQ Speech Platform Permalink Q: What languages are supported by KWS? A: Please see List of supported KWS Languages. For more…

FAQs (Browser)

…be created from ./data/phxspe.properties.default template file. Copy template file to ./settings directory Rename it to phxspe.properties Check for server.enable_authentication_token directive and setup it as needed. Restart phxspe Basic installation steps are described in ./doc/INSTALL.html document. in FAQ Phonexia Browser, FAQ Speech Platform Permalink Q: What languages are supported by KWS? A: Please see List of supported KWS Languages. For more…

Recommended OS and HW (PSP)

…Intel® Core Processor RAM: 16 GB Storage: 100 GB (depends on your audio retention policy) SSD strongly recommended for superior performance over HDD Configuration includes: STT 6th generation – 2 languages (half load each), KWS 6th generation – 2 languages, LID L4, VAD, SQE Voice Biometrics + Transcription System, basic 100 hours/day package (***) files processing CPU: 14 physical cores,…