Skip to content Skip to main navigation Skip to footer

Search: gender

18 results

FAQs (Voice Verify)

…calibration and pulling results to Call Center SW is estimated to take several weeks. The customer’s internal processes can affect the timeframe considerably. Additional info including timelines could be found here.   in FAQ Voice Verify Permalink Q: Does Voice Verify provide information about gender, age or language used by a speaker during a verification? A: No, Voice Verify does…

Download Speech Platform

…XL5 Diarization (DIAR) – model XL4 Language Identification (LID) – model L4 Gender Identification (GID) – model XL5 Age Estimation (AGE) ) – model XL5 Voice Activity Detection (VAD) – model GENERIC_3 and SID4_XL5 Speech Quality Estimation (SQE) Time Analysis Extraction (TAE) Waveform Denoiser (DENOISER) Phonexia Browser example audio (in ./BROWSER/example/ and ./SPE/bsapi/{technology}/example/) Step #2 – First start To get…

Phonexia Speech Engine

…✓ Voice Activity Detection (VAD) ✓ ✓ Time Analysis Extraction (TAE) ✓ ✓ Speech Quality Estimation (SQE) ✓ ✓ Language Identification (LID) ✓ Gender Identification (GID) ✓ Age Estimation (AGE) ✓ Speaker Diarization (DIAR) ✓ Results caching Processing results can be optionally stored in results cache database to speed up eventual re-processing of the same recordings by the same technology…

Releases and Changelogs (SPE)

…see /technologies/genderid) New: Added ‘per_channel‘ parameter to Diarization for processing multi-channel recordings New: Added configuration option to not start SPE if some technology doesn’t start (server.require_all_configured_technologies) Fixed: Random SIGSEGV crashes in CS_CZ_5 STT Fixed: KWS CS_CZ_5 ingnores keyword thresholds Fixed: Duplicated output from KWS Fixed: KWS online configurations for models CS_CZ_5 and NL_NL_5 Fixed: phxadmin increases number of instances in…

Speaker Identification (SID)

…possible (device, acoustic channel, distance from microphone, language, gender, etc.). Although the system is robust in such factors, this calibration will provide even better results and stronger evidence. However, the main purpose of FAR calibration is to ensure that the system produces only a specific amount of False Acceptances (see below) with the given data. To perform the FAR calibration,…

Video – Voice Biometrics technologies

MODULE 3: Voice Biometrics technologies (23 min) Common generic rules for CLI, REST and GUI Speaker Identification (SID) in CLI, REST and GUI Language Identification (LID) in CLI, REST and GUI Gender Identification (GID) in CLI, REST and GUI Summary https://www.youtube.com/watch?v=AyEoPfYVel8…

Q: What are the requirements for SID evaluation dataset?

…in each recording (i.e. usually 2+ minutes recording length) only one speaker in each recording wide variety of gender and age is recommended recordings should be as similar to the target use case as possible (device, channel, distance from mic, languages distribution) audio files should be mono, lin16 format, 8 kHz+ sample rate *Note: splitting single recording into multiple shorter…

Phonexia technologies introduction

…and their usages Filtering and supporting technologies 04:32 Speech Quality Estimation (SQE) 05:27 Voice Activity Detection (VAD) 06:37 Diarization (DIAR) 07:41 Age Estimation (AGE) 08:14 Waveform Denoiser Voice Biometrics technologies 08:56 Speaker Identification (SID) 10:18 Language Identification (LID) 11:10 Gender Identification (GID) Speech Analytics technologies 11:43 Speech Transcription (STT) 12:30 Keyword Spotting (KWS) 13:32 Phoneme Recognition (PHNREC) 13:54 Time Analysis…

Key Features (PSP)

…recording, Speech to Text (STT) – several languages supported – converts speech into plain text (words or sentences) automatically, Keyword Spotting (KWS) – several languages supported – detects specific keywords/phrases automatically without conversion to text, Gender identification (GID) – identifies whether a speaker is male or female, Age Estimation (AGE) – estimates the speaker´s age group, Voice Activity Detection (VAD)…

Understand SPE directory structure

…for individual models settings BSAPI configuration files (*.bs) and optionally manually created user configs (*.bs.usr) There is one exception – LID – which has additional two directories containing pre-built languageprint archives (*.lpa) and language packs: lprints and models. Schemes below show examples of directories for GID (Gender Identification), STT (Speech To Text) and LID (Language Identification): – GID and LID…

LID: Terminology and adaptation

…with linear coding 16bit/8bit, sampling rate 8kHz+ Wide variety of speakers (50+) of various age and gender is required, to ensure rich variety of “language sounds” Only single language in the dataset NOTE: mixing in a different language negatively affects the resulting recognition accuracy Audio length: ideally between 1 and 5 minutes of speech signal NOTE: it is not possible…

Understand SPE technologies configuration file

…Diarization GID Gender Identification KWS Keyword Spotting KWS_STREAM Keyword Spotting Stream LIDC Language Identification Languageprint Comparator LIDE Language Identification Languageprint Extractor PHNREC Phoneme Recognition SID4C Speaker Identification 4 Voiceprint Comparator SID4C_STREAM Speaker Identification 4 Voiceprint Stream Comparator SID4CALIB Speaker Identification 4 VoicePrint Calibration SID4E Speaker Identification 4 Voiceprint Extractor SID4E_STREAM Speaker Identification 4 Voiceprint Stream Extractor SQE Speech Quality Estimation…

SID: Speaker Identification: Results Enhancement

Speaker Identification (SID) Results Enhancement is a process that adjusts the score threshold for detecting/rejecting speakers by removing the effect of speech length and audio quality. This is achieved by use of Audio Source Profiles, that represent as closely as possible the source of the speech recording (device, acoustic channel, distance from microphone, language, gender, etc.). Although the out-of-the-box system…

SPE and Browser installation: standalone SPE

…line type: phxadmin.exe /configure-tech SPE on Linux Open the Terminal window in  /SPE/ directory Type in the terminal: ./phxadmin –configure-tech This will open the list of technologies (and language models) available for you to chose from 1) Age Estimation [disabled] 2) Denoiser Technology [disabled] 3) Diarization [disabled] 4) Gender Identification [disabled] 5) Keyword Spotting [disabled] 6) Phoneme Recognition [disabled] 7)…

Release Notes

…Other technologies New Gender Identification (GID) model XL5 (since 3.56.0) This enables GID to use voiceprints created by the brand new Speaker Identification 4 model XL5 New Age Estimation (AGE) models XL4 and XL5 (since 3.57.0) This enables AGE to use voiceprints created by the Speaker Identification 4 model XL4 and XL5 New Voice Activity Detection (VAD) model SID4_XL5 (since…