Skip to content Skip to main navigation Skip to footer

Search: phoneme

22 results

Download Speech Platform

…only English models for Speech To Text and Keyword Spotting. Additional supported languages are available upon request. ⓘ Click to show/hide the package content Speech Engine – technologies included: Speech To Text (STT) – model EN_US_6 (US English) Keyword Spotting (KWS) – model EN_US_6 (US English) Phoneme Recognizer (PHNREC) – model EN_US_6 (US English) Speaker Identification 4 (SID4) – model…

Time Analysis Extraction (TAE)

Technology description Time Analysis Extraction (TAE) by Phonexia extracts base information from dialogue in a recording, providing essential knowledge about conversation flow. That makes easy to identify: long reaction time crosstalk responses of speakers in both channels speed of speech measured in phonemes per second Typical usage domain It is typically used in contact centers for indicating weak moments in…

Download Voice Inspector 5.1

…models VIN application (graphical user interface, GUI) with the following technologies in-build Speaker Identification (SID4_XL5) Speaker Diarization (DIAR) Voice Activity Detection (VAD) Speech Quality Estimator (SQE) Phoneme Recogniser (PHNREC) example population sets and audio (in ./examples/) and example report templates (in ./templates/) Hardware requirements minimum – CPU: Intel® Core™ i5, RAM: 4 GB, Required HDD space: 0.5 GB for software…

Understand SPE benchmark

…120.wav ├── 150.wav ├── 180.wav ├── 210.wav ├── 240.wav ├── 270.wav └── 300.wav For majority of technologies, the content of default directory is used for the benchmarking. Benchmarking of the language-specific technologies – STT (Speech To Text) and PHNREC (Phoneme Recognizer) – first tries to find a directory with a name matching the start of the benchmarked model name and…

Video – Speech Analytics technologies

MODULE 4: Speech Analytics technologies (23 min) Common generic rules for CLI, REST and GUI Speech To Text (STT) in CLI, REST and GUI Keyword Spotting (KWS) in CLI, REST and GUI Phoneme Recognizer (PHNREC) in CLI, REST and GUI Time Analysis Extraction (TAE) in CLI, REST and GUI Summary https://www.youtube.com/watch?v=-FAoRywqv7U…

Phonexia technologies introduction

…and their usages Filtering and supporting technologies 04:32 Speech Quality Estimation (SQE) 05:27 Voice Activity Detection (VAD) 06:37 Diarization (DIAR) 07:41 Age Estimation (AGE) 08:14 Waveform Denoiser Voice Biometrics technologies 08:56 Speaker Identification (SID) 10:18 Language Identification (LID) 11:10 Gender Identification (GID) Speech Analytics technologies 11:43 Speech Transcription (STT) 12:30 Keyword Spotting (KWS) 13:32 Phoneme Recognition (PHNREC) 13:54 Time Analysis…

Release Notes

…to 81 % Phoneme Recognition technology is now included with Speech To Text This should help users with defining pronunciations in preferred phrases, when adding words to language model, or when creating customized model Added grammar rules for words to numbers conversion in 3.59.0 Pre-defined rules for English, Spanish and Polish were added. For other languages we provide empty definitions…

Understand SPE technologies configuration file

…Diarization GID Gender Identification KWS Keyword Spotting KWS_STREAM Keyword Spotting Stream LIDC Language Identification Languageprint Comparator LIDE Language Identification Languageprint Extractor PHNREC Phoneme Recognition SID4C Speaker Identification 4 Voiceprint Comparator SID4C_STREAM Speaker Identification 4 Voiceprint Stream Comparator SID4CALIB Speaker Identification 4 VoicePrint Calibration SID4E Speaker Identification 4 Voiceprint Extractor SID4E_STREAM Speaker Identification 4 Voiceprint Stream Extractor SQE Speech Quality Estimation…

STT: Language Model Customization tutorial

…containing list if words to be added to the STT language model, one word per line. Note: LMC v3.30.0 (March 2020) or older requires the text file without Byte-Order-Mark (BOM) Each word can be optionally followed by its pronunciation, separated from the word by SPACE or TAB character. Pronunciations must use only phonemes allowed by the corresponding language – see…

STT: What is Preferred Phrases feature and how to use it

…German word like “grüßen” in Czech transcription – or even using different writing script like Cyrillic or Japanese Kana. Such words MUST be accompanied by a pronunciation definition, and that definition must use only phonemes supported by the STT model (i.e. the German word from the previous example would need to have pronunciation defined using Czech phonemes). See also more…

Adding new language or technology model (Browser)

…on the SPE server -> Server Info. You should see the output similar to this: Please share this SPE version number with your Phonexia contact/or through support ticket (in the above example 3.50.5) . Installation of new models In our example, we will install Spanish (ES_6) model of Speech to Text and Keyword Spotting (with Phoneme Recognizer) into existing installation…

Key Features (VIN)

…Speaker Identification, Speaker Diarization, Phoneme Recognizer, Voice Activity Detection, Speech Quality Estimation A search for repetitive sound patterns across all recordings in audio due to the automatic phonemic transcription Input: Questioned recordings (a minimum of 1 recording) Suspected speaker recordings (a minimum of 1 recording) The Population set (a technical minimum of 10 speakers, and a recommended minimum of 50…

Understand SPE directory structure

…at https://download.phonexia.com/docs/spe/ INSTALL.html, INSTALL.txt Quick installation guide in HTML and TXT format UPDATE.txt Quick update instructions and SPE configuration file changes between SPE versions result_versions.txt List of REST API result versions Phonemes_for_STT_and_KWS.pdf List of STT/KWS phonemes, useful e.g. for keyword pronunciations definitions Technology_LID_L4_Language_tags.pdf List of LID L4 language tags and more details about languages they refer to EULA EULA directory…

SPE and Browser installation: standalone SPE

…line type: phxadmin.exe /configure-tech SPE on Linux Open the Terminal window in  /SPE/ directory Type in the terminal: ./phxadmin –configure-tech This will open the list of technologies (and language models) available for you to chose from 1) Age Estimation [disabled] 2) Denoiser Technology [disabled] 3) Diarization [disabled] 4) Gender Identification [disabled] 5) Keyword Spotting [disabled] 6) Phoneme Recognition [disabled] 7)…

Releases and Changelogs (Browser)

…didn’t work Phonexia Browser v3.13.1, BSAPI 3.17.0 – Nov 19 2018 [G#79] Dropdown buttons in SID Evaluation wizard results page can get hidden without user knowing about them [G#90] Fixed synchronize properly items in View menu when layout is reseted [G#88] Fixed graphs in LID detail view may show incorrect values [G#36] Added Phoneme Recognizer [G#62] Use Qt 5 [G#85]…