Skip to content Skip to main navigation Skip to footer

Search: speech%20intelligence%20resolver

121 results

Releases and Changelogs (SPE)

…recordings Fixed: Unable to initialize technologies when SPE is launched using UNC path on Windows Speech Engine 3.57 (Public release) Speech Engine 3.57.0, DB v1901, BSAPI 3.57.0 (2023-02-01) New: AGE XL4 and XL5 models (for compatibility with SID4 XL4 and XL5 voiceprints) Speech Engine 3.56 (Public release) Speech Engine 3.56.0, DB v1901, BSAPI 3.56.0 (2022-12-15) New: GID XL5 model (for…

Release Notes

Table of Contents Toggle Speech Platform release 3.60 New features and fixes Previous Releases Speech Platform Public Release Fall 2022 (SPE v3.55) Speech Platform public release Spring 2022 (SPE v3.50) Speech Platform public release Fall 2021 (SPE v3.45) Speech Platform release 3.60 Here is a summary of most important new features and fixes since last Public Release 3.55. New features…

FAQs (PSP)

…granted during off-line transcription, speech engine can correct result before it is printed out by taking into account also the subsequent segments. The beginning of the recording can then be recognized with high accuracy too. in FAQ Speech Platform Permalink Q: How do you calculate SNR in Speech Quality Estimation? A: Signal-to-Noise Ratio (SNR) is an important metric of whether…

Key Features (PSP)

…– detects the audio part that contains voice, Speech Quality Estimation (SQE) – measures the quality of speech, Phoneme Recognizer (PHNREC) – several languages supported – converts speech into phonemes (written characters representing pronunciation), Waveform Denoiser (DENOISER) – automatically improves the audibility of speech for human listeners. Supported Languages The LID, STT and KWS technologies support various languages as listed…

Speech to Text (STT)

…n-grams. Using this the user can adjust a language model focusing on a specific domain to get better results. Result types During the process of transcribing the speech there are always several alternatives for a given speech segment. The technology can provide one or more results. 1-best result type provides only the result with highest score. Speech is returned in…

FAQs (Browser)

…Browser. in FAQ Phonexia Browser, FAQ Speech Platform Permalink Q: What languages do you offer? It depends on the technology. Phonexia Language Identification (LID) is pre-trained for 60+ languages. Phonexia Keyword Spotting (KWS) and Phonexia Speech Transcription (STT) for 20+ languages including English, French, German, Russian, Spanish and many more. in FAQ Phonexia Browser, FAQ Speech Platform Permalink Q: What…

SPE and Browser installation: standalone SPE

…to start processing your recordings with Phonexia Speech Technologies. 1. Download Evaluation package Download the Phonexia Evaluation package from https://partner.phonexia.com/kb/sp/speech-platform/evaluation-package/ Simply unzip the package to your desired location. Ideally avoid C:/Program Files as you may face issues later on with previleges 2. Save license.dat file Copy the license.dat file to the /SPE/ directory. Make sure the license.dat file is not…

Download Speech Platform

…issues and malfunctions, please take the free RAM requirement seriously. See also additional information on Recommended OS and HW page. While downloading, you can check the updates: Speech Engine changes and Browser changes. Speech Platform 3.60.1 for Windows 64-bit 4 GB Download Speech Platform 3.60.1 for Linux 64-bit 4 GB Download To keep the download size reasonable, the package includes…

Measuring of a software processing speed – what is the FtRT (Faster than Real Time)

…noise, technical signals like ringing, DTMF tones, etc). This metric is useful for finding performance on actual audio data coming into audio processing pipeline. Regular recording with Voice and Silence segments in waveform Net Speech based FtRT is conservative, purely technical number. It is calculated from only spoken speech data, i.e. with all non-speech parts (silence, noise, DTMF tones, etc.)…

Phonexia Speech Engine

Phonexia Speech Engine (SPE) is main part of Phonexia Speech Platform. SPE is a server application for 64-bit Linux or Windows, providing REST API to entire portfolio of Phonexia speech technologies. SPE capabilities overview: Audio files and stream processing Audio files RTP / HTTP streams Speaker Identification (SID) ✓ ✓ Speech To Text (STT) ✓ ✓ Keyword Spotting (KWS) ✓…

STT: Language Model Customization tutorial

…copy of the word list file, as a backup) – see below for the best location for usage in Speech Engine Using customized STT model in Speech Engine STT To use customized STT model in Speech Engine STT, it’s necessary to place the customized model in correct location, so that Speech Engine can find it register and enable the customized…

Understand SPE technologies, instances and workers

Configuring Speech Engine to utilize effectively the full power of underlying hardware can get challenging – one can easily get lost in all the strange terms like technologies, instances, slots, or workers… This article should shed some light in it. Speech Engine is like post office Thinking about Speech Engine, there is actually a very nice analogy with post office…

Releases and Changelogs (Browser)

Phonexia Browser is a tool for testing Phonexia speech technologies available via Speech Engine API. Releases Version Release Date End of Support Maintained Until Release type 3.60 2023-12-05 2025-06-01 n/a Public 3.59 2023-06-20 2025-01-01 n/a Public 3.58 2023-04-03 2024-10-01 n/a Public 3.57 2023-02-02 2024-08-01 n/a Public 3.56 2022-12-15 2024-06-01 n/a Public 3.55 2022-10-03 2024-04-01 3.60 Public 3.52 2021-07-01 2021-09-30 3.55…

SID4 performance on Intel® Xeon® Platinum 8124M

…Customers can usually only refer to captured recordings data set during a specified time period with numbers like: total number of recordings average file size of captured recording or total number of captured hours Customers usually don’t have any information about ratio between speech signal and technical/silence parts of recordings in the beginning. The speech / non-speech ratio is detected…

Understand SPE benchmark

…SPE in the {SPE}/data/benchmark directory. The second option uses single audio file of your choice uploaded to SPE storage, specified by the path parameter. The set of audio files supplied with SPE contains recordings of various length (from 30 seconds to 5 minutes) and with various speech/non-speech ratio. This is to account for the fact that both the length of…