Skip to content Skip to main navigation Skip to footer

Search: speech quality estimation

18 results

Video – Filtering and supporting technologies

MODULE 2: Filtering and supporting technologies (22 min) Common generic rules for CLI, REST and GUI Filtering, sorting, pre-/post-processing overview Speech Quality Estimation (SQE) in CLI, REST and GUI Voice Activity Detection (VAD) in CLI, REST and GUI Diarization (DIAR) in CLI, REST and GUI Age Estimation (AGE) in CLI, REST and GUI Denoiser (DENOISER) in CLI, REST and GUI…

FAQs (Voice Verify)

…if speech quality is not good (audible) enough. The speech quality can be optionally indicated by Audio Quality Estimation score. More info about this feature is available here. During the verification part of process, if the user cannot be authenticated, the agent should ask the user to move to a quieter environment if possible. In case verification is not possible,…

Key Features (VIN)

…Speaker Identification, Speaker Diarization, Phoneme Recognizer, Voice Activity Detection, Speech Quality Estimation A search for repetitive sound patterns across all recordings in audio due to the automatic phonemic transcription Input: Questioned recordings (a minimum of 1 recording) Suspected speaker recordings (a minimum of 1 recording) The Population set (a technical minimum of 10 speakers, and a recommended minimum of 50…

Phonexia technologies introduction

…and their usages Filtering and supporting technologies 04:32 Speech Quality Estimation (SQE) 05:27 Voice Activity Detection (VAD) 06:37 Diarization (DIAR) 07:41 Age Estimation (AGE) 08:14 Waveform Denoiser Voice Biometrics technologies 08:56 Speaker Identification (SID) 10:18 Language Identification (LID) 11:10 Gender Identification (GID) Speech Analytics technologies 11:43 Speech Transcription (STT) 12:30 Keyword Spotting (KWS) 13:32 Phoneme Recognition (PHNREC) 13:54 Time Analysis…

Audio Quality Estimation

…the PESQ (Perceptual Evaluation of Speech Quality) metric, the values can range from -0,5 to 4,5. To help interpret the data, there is also a binary verdict present. When the value is below 2, we consider the audio quality bad. Value over 2 means good audio quality of the speech. Audio Quality estimation is turned off by default. The reason…

Understand SPE technologies configuration file

…SQE_STREAM Speech Quality Estimation Stream STT Speech To Text STT_STREAM Speech To Text Stream TAE Time Analysis Extraction TAE_STREAM Time Analysis Extraction Stream VAD Voice Activity Detection VAD_STREAM Voice Activity Detection Stream SIDC Speaker Identification Voiceprint Comparator (legacy) SIDC_STREAM Speaker Identification Voiceprint Stream Comparator (legacy) SIDCALIBSET Speaker Identification VoicePrint Calibration (legacy) SIDCALIBSET_STREAM Speaker Identification VoicePrint Stream Calibration (legacy) SIDE Speaker…

General

…”payload”: { “stream_uuid”: “<uuid>”, “external_id”: “<external_id>”, “result”: “<verdict>”, “speech_length“: <speech_length>, “score”: <score> } } Where <verdict> can state one of the following: there is not enough net speech to make the verification yet voiceprint with external_id does not exist verified not verified not sure speech_length returns the number of seconds of net speech present in the enrollment, <score> returns the

Phonexia Speech Engine

Phonexia Speech Engine (SPE) is main part of Phonexia Speech Platform. SPE is a server application for 64-bit Linux or Windows, providing REST API to entire portfolio of Phonexia speech technologies. SPE capabilities overview: Audio files and stream processing Audio files RTP / HTTP streams Speaker Identification (SID) ✓ ✓ Speech To Text (STT) ✓ ✓ Keyword Spotting (KWS) ✓…

Download Speech Platform

…XL5 Diarization (DIAR) – model XL4 Language Identification (LID) – model L4 Gender Identification (GID) – model XL5 Age Estimation (AGE) ) – model XL5 Voice Activity Detection (VAD) – model GENERIC_3 and SID4_XL5 Speech Quality Estimation (SQE) Time Analysis Extraction (TAE) Waveform Denoiser (DENOISER) Phonexia Browser example audio (in ./BROWSER/example/ and ./SPE/bsapi/{technology}/example/) Step #2 – First start To get…

FAQs (PSP)

…granted during off-line transcription, speech engine can correct result before it is printed out by taking into account also the subsequent segments. The beginning of the recording can then be recognized with high accuracy too. in FAQ Speech Platform Permalink Q: How do you calculate SNR in Speech Quality Estimation? A: Signal-to-Noise Ratio (SNR) is an important metric of whether…

Key Features (PSP)

…– detects the audio part that contains voice, Speech Quality Estimation (SQE) – measures the quality of speech, Phoneme Recognizer (PHNREC) – several languages supported – converts speech into phonemes (written characters representing pronunciation), Waveform Denoiser (DENOISER) – automatically improves the audibility of speech for human listeners. Supported Languages The LID, STT and KWS technologies support various languages as listed…

SPE and Browser installation: standalone SPE

…10) Speaker Identification 4 VoicePrint Extractor [active model: XL5(1x)] 11) Speaker Identification 4 VoicePrint Comparator [active model: XL5(1x)] 12) Speaker Identification 4 VoicePrint Calibration [active model: XL5(1x)] 13) Speaker Identification 4 VoicePrint Stream Extractor [active model: XL5(1x)] 14) Speaker Identification 4 VoicePrint Stream Comparator [active model: XL5(1x)] 15) Speech Quality Estimation [active model: GENERIC(1x)] 16) Speech Quality Estimation Stream [active…

Release Notes

Table of Contents Toggle Speech Platform release 3.60 New features and fixes Previous Releases Speech Platform Public Release Fall 2022 (SPE v3.55) Speech Platform public release Spring 2022 (SPE v3.50) Speech Platform public release Fall 2021 (SPE v3.45) Speech Platform release 3.60 Here is a summary of most important new features and fixes since last Public Release 3.55. New features…