Search: spe

127 results

Video – Getting started with SPE

MODULE 1: Getting started with Speech Engine (19 min) Installation Technologies configuration Server and database configuration Users configuration Files processing Synchronous and asynchronous requests, results polling Stream processing https://youtu.be/4qrB-GfFdWY…

Speech Engine

To create the SPE report: Go to the SPE installation directory Open command line/terminal (in Ubuntu Linux Right click + press E, in Windows type cmd in the address bar) Run ./phxadmin –report (Linux) or phxadmin.exe /report (Windows) Zip up the created directory with report and attach the ZIP file to your issue description The Report functionality is not present…

Speech To Text / Keyword Spotting supported languages

Languages supported by Speech To Text and Keyword Spotting Standard = Maintained until newer generation is released, or end of support is reached. Language generation is specified by the number in “Model name”. Language (region) Model name Released End of support Maintenance Arabic (Gulf, Kuwait) AR_KW_6 2022-04 8th gen. Standard Arabic (Levantine) AR_XL_6 2021-05 8th gen. Standard AR_XL_5 2020-08 7th…

Key Features (PSP)

…The Speech Platform includes the following technologies. Technologies are available in the Speech Engine component based on its particular configuration (Voice Biometrics, Transcription System, etc.) Speaker Identification (SID) – recognizes a speaker automatically based on their voice, Speaker Diarization (DIAR) – separates multiple speakers in mono audio automatically, Language Identification (LID) – detects the language or dialect spoken in a…

Download other languages for Speech platform

This part requires higher (and non-anonymous) access level.
How to solve this situation:

Log in here if you are not logged in.
Register here. It takes just a few clicks and it’s free.

FAQs (Browser)

…debug output of SPE Linux: Run PhxBrowser software in terminal with command: ./PhxBrowser –-spe-debug –-spe-output PhxBrowser software will start with ” SPE output” tab which shows debug output of SPE in FAQ Phonexia Browser, FAQ Speech Platform Permalink Q: Why does the system show high score (>90%) even for non-targets? A: Threshold for score isn’t set up correctly. Adjust speaker…

Releases and Changelogs (VIN)

…Target score distribution Fixed: Population Set selected correctly even if renamed in the selection window Improved: Speech length display in the case view: added “Unlimited” option to display the speech length permanently Improved: SID Evidence score aligned with Speech Engine output of SID score Removed: Speech length compensation Voice Inspector 5.1 Voice Inspector 5.1.0, BSAPI 3.60.0 (2023-12-07) New: A generalized…

STT: What is Preferred Phrases feature and how to use it

…it can help in other applications, too – e.g. when transcribing domain-specific audios, the frequently used domain-specific phrases can be boosted. How preferred phrases work The picture below shows a simplified standard speech transcription process – the digitized speech signal spectrum is analyzed in the neural network acoustic model (which describes the pronunciations of a given language) and goes into…

STT: Language Model Customization tutorial

…model in Speech Engine using phxadmin 1) Placing the customized STT model in correct location In order to be recognized by Speech Engine, the customized STT model must be placed in a correct location. The best location in SPE 3.41 or newer is <SPE_directory>/shared/bsapi/stt (see Understand SPE directory structure article). In older versions it’s <SPE_directory>/bsapi/stt. The data and settings directories…

Keyword Spotting (KWS)

…a numerical expression of probability that word was said in a specified time frame. Keywords Keywords are not dependent on any dictionary. This allows to define specific, foreign or even nonexistent words like product names. However, only allowed graphemes (symbols) from a supported list can be used to define keywords. This list can be easily obtained by Speech Engine and…

Q: While trying to install SPE3, I get the error for loading libasound.so.2 libraries

Currently I’m trying to install the provided binaries for Linux, but I do get the following when running phxadmin: ./phxadmin: error while loading shared libraries: libasound.so.2: cannot open shared object file: No such file or directory I’m trying to run this under CentOS 7. A: Please install the right libraries required for manipulation with audio files from official repository into…

Time Analysis Extraction (TAE)

…to the other: places of speaker’s longest and shortest reaction, i.e., where this speaker stopped talking and the other speaker started talking the average reaction times number of speaker-turns in the particular direction and details about crosstalk, for example where the other speaker is talking “over” this speaker Segmentation This section is optional and need to be explicitly turned on….

SID4 performance on Intel® Xeon® Platinum 8124M

…Customers can usually only refer to captured recordings data set during a specified time period with numbers like: total number of recordings average file size of captured recording or total number of captured hours Customers usually don’t have any information about ratio between speech signal and technical/silence parts of recordings in the beginning. The speech / non-speech ratio is detected…

Key Features (VIN)

…Speaker Identification, Speaker Diarization, Phoneme Recognizer, Voice Activity Detection, Speech Quality Estimation A search for repetitive sound patterns across all recordings in audio due to the automatic phonemic transcription Input: Questioned recordings (a minimum of 1 recording) Suspected speaker recordings (a minimum of 1 recording) The Population set (a technical minimum of 10 speakers, and a recommended minimum of 50…

Download Voice Inspector 5.2

…models VIN application (graphical user interface, GUI) with the following technologies in-build Speaker Identification (SID4_XL5) Speaker Diarization (DIAR) Voice Activity Detection (VAD) Speech Quality Estimator (SQE) Phoneme Recogniser (PHNREC) example population sets and audio (in ./examples/) and example report templates (in ./templates/) Hardware requirements minimum – CPU: Intel® Core™ i5, RAM: 4 GB, Required HDD space: 0.5 GB for software…