Formats supported directly and natively are: WAVE (*.wav) container including any of: unsigned 8-bit PCM (u8) unsigned 16-bit PCM (u16le) IEEE float 32-bit (f32le) A-law (alaw) µ-law (mulaw) ADPCM FLAC codec inside FLAC (*.flac) container OPUS codec inside OGG (*.opus) container Other audio formats must be converted to one of those natively supported using external tools. SPE server can be…
Search: Phonexia Browser
57 results
MODULE 1: Getting started with Speech Engine (19 min) Installation Technologies configuration Server and database configuration Users configuration Files processing Synchronous and asynchronous requests, results polling Stream processing https://youtu.be/4qrB-GfFdWY…
Our technologies are prepared to run on both Windows and Linux OS. For more details of the supported operating systems as well as recommended HW setup, see Recommended OS and HW…
A: Please proceed by doing the following: Check your Internet connectivity using standard browser against https://www.phonexia.com. Check you are not running more instances than allowed by the license file (using -j parameter in command line). In rare cases your SW copy may have accidentally crashed. If this is the case, please wait for automatic license renewal period (60 minutes after…
MODULE 4: Speech Analytics technologies (23 min) Common generic rules for CLI, REST and GUI Speech To Text (STT) in CLI, REST and GUI Keyword Spotting (KWS) in CLI, REST and GUI Phoneme Recognizer (PHNREC) in CLI, REST and GUI Time Analysis Extraction (TAE) in CLI, REST and GUI Summary https://www.youtube.com/watch?v=-FAoRywqv7U…
A: Check your license file (license.dat) by opening it in Notepad. Make sure the license contains records for all required modules. See Licensing article for additional information…
A: Please see List of supported LID Languages. For more details, see LID technology documentation….
A: Please see List of supported STT Languages. For more details, see STT technology documentation….
MODULE 2: Filtering and supporting technologies (22 min) Common generic rules for CLI, REST and GUI Filtering, sorting, pre-/post-processing overview Speech Quality Estimation (SQE) in CLI, REST and GUI Voice Activity Detection (VAD) in CLI, REST and GUI Diarization (DIAR) in CLI, REST and GUI Age Estimation (AGE) in CLI, REST and GUI Denoiser (DENOISER) in CLI, REST and GUI…
A: These abbreviations mean the following: LR – likelihood ratio, result from statistical test for two models comparison. It returns a number which expresses how many times more likely the data are under one model than the other. LR meets numbers in interval <0;+inf). LLR – abbreviation for log-likelihood ratio statistic, logarithmic function of LR. LLR meets numbers in interval…
A: Threshold for score isn’t set up correctly. Adjust speaker score sharpness value to calibrate the recalculation. Please see Calibration in technology documentation….
MODULE 3: Voice Biometrics technologies (23 min) Common generic rules for CLI, REST and GUI Speaker Identification (SID) in CLI, REST and GUI Language Identification (LID) in CLI, REST and GUI Gender Identification (GID) in CLI, REST and GUI Summary https://www.youtube.com/watch?v=AyEoPfYVel8…
…complete. If it doesn’t, power it on manually. After the machine boots, a login screen should appear with Phonexia logo and obtained IP address. Step 4 – access Orbis web interface Open a web browser and use the IP address obtained in the previous step as an address. Approve the default SSL certificate. Orbis web interface should then appear. The…
…the original one exactly. This typically involves changing the networking settings according to your environment. Step 6 – final checks and cleaning up Click Begin installation in the upper left corner. The virtual machine should start automatically when the process is complete. If it doesn’t, power it on manually. After the machine boots, a login screen should appear with Phonexia…
For evaluating the real life scenario of Phonexia Speaker Identification technology, the system needs to be calibrated by SID dataset. SID dataset (minimum requirements): To measure SID performance precisely, it’s important to prepare evaluation recordings set very carefully. The requirements are: 50+ known speakers, 200+ recordings in total (i.e. 3 to 5 recordings per speaker*) 1+ minute of net speech…