Search: keyword spotting

21 results

Keyword Spotting (KWS)

…takes some time – the more pronunciationless keywords in the list, the longer delay occurs before the processing. When keyword list has pronunciations defined for each keyword, even thousands of defined keywords have no impact on performance. Technology searches the recording and returns the list of found keywords, together with score and confidence for each found keyword. The score is…

Speech To Text / Keyword Spotting supported languages

Languages supported by Speech To Text and Keyword Spotting Standard = Maintained until newer generation is released, or end of support is reached. Language generation is specified by the number in “Model name”. Language (region) Model name Released End of support Maintenance Arabic (Gulf, Kuwait) AR_KW_6 2022-04 8th gen. Standard Arabic (Levantine) AR_XL_6 2021-05 8th gen. Standard AR_XL_5 2020-08 7th…

KWS: Results explained

This article aims on giving more details about Keyword Spotting outputs and hints on how to tailor Keyword Spotting to suit best your needs. Scoring Keyword Spotting works by calculating likelihood ratios (LR) that at a given spot occurs a keyword or just any other speech, and comparing those two likelihood ratios. The following scheme shows Background model for anything…

Releases and Changelogs (Browser)

…Preview releases 3.51 and 3.52 (see below) Phonexia Browser 3.52 Phonexia Browser 3.52.0, BSAPI 3.52.0 (2022-07-01) New: Added “Keyword Spotting highest confidence” column in Results pane New: Added “Minimum confidence to display” setting for Keyword Spotting in Settings dialog -> Scoring tab (affects number of hits displayed in Results pane) Phonexia Browser 3.51 Phonexia Browser 3.51.0, BSAPI 3.51.0 (2022-06-14) New:…

Releases and Changelogs (SPE)

…to store data to a file and no data was sent [#4295] Fixed unable to find license file if path contains special characters [Windows] [#4145] Added VAD benchmark [#4146] Added SQE benchmark [#4148] Added keyword threshold to keyword list [#3797] Added stream TAE [#4199] Fixed websocket may not be correctly closed in some cases [#4216] Changed result for SQE (see…

SPE and Browser installation: standalone SPE

…nr. 23) 1) Age Estimation [active model: XL5(1x)] 2) Denoiser Technology [active model: EN_US(1x)] 3) Diarization [active model: XL4(1x)] 4) Gender Identification [active model: XL5(1x)] 5) Keyword Spotting [active model: EN_US_6(1x)] 6) Phoneme Recognition [active model: EN_US_6(1x)] 7) Keyword Spotting Stream [active model: EN_US_6(1x)] 8) Language Identification LanguagePrint Comparator [active model: L4(1x)] 9) Language Identification LanguagePrint Extractor [active model: L4(1x)]…

Release Notes

…BROWSER Update We finished small but important improvements: The Age column in Results pane now shows the numeric results instead of age groups; column name changed to Age (±10 years) to emphasize the results tolerance Added the Keyword Spotting highest confidence column in Results pane, showing the highest confidence value of all detected keywords in a recording (allowing to judge…

Phoneme Recogniser (PHNREC)

…Recogniser is delivered as part of Keyword Spotting (KWS) technology. It can be also used without KWS technology. Typical use cases „search-in-speech“ – search for specific information in large call archives (e.g., claims inspection), get custom based pronunciation of word or phrase as customized keyword in keyword spotting technology (better accuracy of KWS technology), get custom based pronunciation of word…

Phonexia technology models EoL

Speech to Text (STT) and Keyword Spotting (KWS) models Languages supported by Speech To Text and Keyword Spotting Standard = Maintained until newer generation is released, or end of support is reached. Language generation is specified by the number in “Model name”. Language (region) Model name Released End of support Maintenance Arabic (Gulf, Kuwait) AR_KW_6 2022-04 8th gen. Standard Arabic…

Key Features (PSP)

…recording, Speech to Text (STT) – several languages supported – converts speech into plain text (words or sentences) automatically, Keyword Spotting (KWS) – several languages supported – detects specific keywords/phrases automatically without conversion to text, Gender identification (GID) – identifies whether a speaker is male or female, Age Estimation (AGE) – estimates the speaker´s age group, Voice Activity Detection (VAD)…

Download Speech Platform

…models for Speech To Text and Keyword Spotting. Additional supported languages are available upon request. ⓘ Click to show/hide the package content Speech Engine – technologies included: Speech To Text (STT) – model EN_US_6 (US English) Keyword Spotting (KWS) – model EN_US_6 (US English) Phoneme Recognizer (PHNREC) – model EN_US_6 (US English) Speaker Identification 4 (SID4) – model XL5 Diarization…

Understand SPE technologies configuration file

…Diarization GID Gender Identification KWS Keyword Spotting KWS_STREAM Keyword Spotting Stream LIDC Language Identification Languageprint Comparator LIDE Language Identification Languageprint Extractor PHNREC Phoneme Recognition SID4C Speaker Identification 4 Voiceprint Comparator SID4C_STREAM Speaker Identification 4 Voiceprint Stream Comparator SID4CALIB Speaker Identification 4 VoicePrint Calibration SID4E Speaker Identification 4 Voiceprint Extractor SID4E_STREAM Speaker Identification 4 Voiceprint Stream Extractor SQE Speech Quality Estimation…

Adding new language or technology model (Browser)

…on the SPE server -> Server Info. You should see the output similar to this: Please share this SPE version number with your Phonexia contact/or through support ticket (in the above example 3.50.5) . Installation of new models In our example, we will install Spanish (ES_6) model of Speech to Text and Keyword Spotting (with Phoneme Recognizer) into existing installation…

Support Lifecycle Policy (PSP)

…3.7 2017-03-27 2018-09-27 3.8 Public 3.6 2016-12-14 2018-06-14 3.7 Public 3.5 2016-10-04 2018-04-04 3.6 Public 3.4 2016-09-19 2018-03-19 3.5 Public 3.3 2016-07-11 2018-02-11 3.4 Public 3.2 2016-04-22 2017-10-22 3.3 Public 3.1 2016-02-15 2017-08-15 3.2 Public 3.0 2016-02-09 2017-08-09 3.1 Public 2.1 2015-09-16 2017-09-16 2017-09-16 Public 2.0 2015-01-06 2016-07-06 2.1 Public Speech to Text (STT) and Keyword Spotting (KWS) models Languages…

Phonexia Speech Engine

Phonexia Speech Engine (SPE) is main part of Phonexia Speech Platform. SPE is a server application for 64-bit Linux or Windows, providing REST API to entire portfolio of Phonexia speech technologies. SPE capabilities overview: Audio files and stream processing Audio files RTP / HTTP streams Speaker Identification (SID) ✓ ✓ Speech To Text (STT) ✓ ✓ Keyword Spotting (KWS) ✓…