Skip to content Skip to main navigation Skip to footer

Search: `diarization

19 results

Video – Getting started with SPE

MODULE 1: Getting started with Speech Engine (19 min) Installation Technologies configuration Server and database configuration Users configuration Files processing Synchronous and asynchronous requests, results polling Stream processing https://youtu.be/4qrB-GfFdWY…

Download Speech Platform

…XL5 Diarization (DIAR) – model XL4 Language Identification (LID) – model L4 Gender Identification (GID) – model XL5 Age Estimation (AGE) ) – model XL5 Voice Activity Detection (VAD) – model GENERIC_3 and SID4_XL5 Speech Quality Estimation (SQE) Time Analysis Extraction (TAE) Waveform Denoiser (DENOISER) Phonexia Browser example audio (in ./BROWSER/example/ and ./SPE/bsapi/{technology}/example/) Step #2 – First start To get…

Releases and Changelogs (VIN)

…to make it clearer Voice Inspector 4.0.2, BSAPI 3.35.5 (2021-03-25) Fixed: One more issue in the detection of certain USB license tokens Voice Inspector 4.0.1, BSAPI 3.35.3 (2020-11-25) Fixed: Detection of certain USB license tokens Voice Inspector 4.0.0, BSAPI 3.23.0 (2019-12-11) New: Speaker Identification L4 model New: Added Diarization Technology (available in waveform editor) New: Added possibility to set the…

Download Voice Inspector 5.1

…models VIN application (graphical user interface, GUI) with the following technologies in-build Speaker Identification (SID4_XL5) Speaker Diarization (DIAR) Voice Activity Detection (VAD) Speech Quality Estimator (SQE) Phoneme Recogniser (PHNREC) example population sets and audio (in ./examples/) and example report templates (in ./templates/) Hardware requirements minimum – CPU: Intel® Core™ i5, RAM: 4 GB, Required HDD space: 0.5 GB for software…

Speaker change on enrollments

Speaker change on enrollments is a feature based on Phonexia’s own diarization technology. It allows customers to be notified about the speaker changes during enrollment, which helps preventing a situations, where the enrolment is bad or compromised. Please note that this feature is currently available to single-server deployments only. To enable it, call endpoint /api/v2/maintenance/technologies in maintenance mode with the…

Phonexia Speech Engine

…✓ Voice Activity Detection (VAD) ✓ ✓ Time Analysis Extraction (TAE) ✓ ✓ Speech Quality Estimation (SQE) ✓ ✓ Language Identification (LID) ✓ Gender Identification (GID) ✓ Age Estimation (AGE) ✓ Speaker Diarization (DIAR) ✓ Results caching Processing results can be optionally stored in results cache database to speed up eventual re-processing of the same recordings by the same technology…

Video – Speech Analytics technologies

MODULE 4: Speech Analytics technologies (23 min) Common generic rules for CLI, REST and GUI Speech To Text (STT) in CLI, REST and GUI Keyword Spotting (KWS) in CLI, REST and GUI Phoneme Recognizer (PHNREC) in CLI, REST and GUI Time Analysis Extraction (TAE) in CLI, REST and GUI Summary https://www.youtube.com/watch?v=-FAoRywqv7U…

Video – Voice Biometrics technologies

MODULE 3: Voice Biometrics technologies (23 min) Common generic rules for CLI, REST and GUI Speaker Identification (SID) in CLI, REST and GUI Language Identification (LID) in CLI, REST and GUI Gender Identification (GID) in CLI, REST and GUI Summary https://www.youtube.com/watch?v=AyEoPfYVel8…

Phonexia technologies introduction

…and their usages Filtering and supporting technologies 04:32 Speech Quality Estimation (SQE) 05:27 Voice Activity Detection (VAD) 06:37 Diarization (DIAR) 07:41 Age Estimation (AGE) 08:14 Waveform Denoiser Voice Biometrics technologies 08:56 Speaker Identification (SID) 10:18 Language Identification (LID) 11:10 Gender Identification (GID) Speech Analytics technologies 11:43 Speech Transcription (STT) 12:30 Keyword Spotting (KWS) 13:32 Phoneme Recognition (PHNREC) 13:54 Time Analysis…

Key Features (PSP)

…The Speech Platform includes the following technologies. Technologies are available in the Speech Engine component based on its particular configuration (Voice Biometrics, Transcription System, etc.) Speaker Identification (SID) – recognizes a speaker automatically based on their voice, Speaker Diarization (DIAR) – separates multiple speakers in mono audio automatically, Language Identification (LID) – detects the language or dialect spoken in a…

Release Notes

…is a completely new model in STT/KWS family. STT word accuracy (WAcc) is up to 93.1 %. Speech Engine: Other technologies Speaker Diarization (DIAR XL4) – We solved the issue of an extreme (exponential) processing time increase on long recordings (the processing time increase is now linear). Automatic configuration of some Speech Engine (SPE) settings values – SPE is now…

Understand SPE technologies configuration file

Diarization GID Gender Identification KWS Keyword Spotting KWS_STREAM Keyword Spotting Stream LIDC Language Identification Languageprint Comparator LIDE Language Identification Languageprint Extractor PHNREC Phoneme Recognition SID4C Speaker Identification 4 Voiceprint Comparator SID4C_STREAM Speaker Identification 4 Voiceprint Stream Comparator SID4CALIB Speaker Identification 4 VoicePrint Calibration SID4E Speaker Identification 4 Voiceprint Extractor SID4E_STREAM Speaker Identification 4 Voiceprint Stream Extractor SQE Speech Quality Estimation…

Video – Filtering and supporting technologies

MODULE 2: Filtering and supporting technologies (22 min) Common generic rules for CLI, REST and GUI Filtering, sorting, pre-/post-processing overview Speech Quality Estimation (SQE) in CLI, REST and GUI Voice Activity Detection (VAD) in CLI, REST and GUI Diarization (DIAR) in CLI, REST and GUI Age Estimation (AGE) in CLI, REST and GUI Denoiser (DENOISER) in CLI, REST and GUI…

Key Features (VIN)

…Speaker Identification, Speaker Diarization, Phoneme Recognizer, Voice Activity Detection, Speech Quality Estimation A search for repetitive sound patterns across all recordings in audio due to the automatic phonemic transcription Input: Questioned recordings (a minimum of 1 recording) Suspected speaker recordings (a minimum of 1 recording) The Population set (a technical minimum of 10 speakers, and a recommended minimum of 50…

SPE and Browser installation: standalone SPE

…line type: phxadmin.exe /configure-tech SPE on Linux Open the Terminal window in  /SPE/ directory Type in the terminal: ./phxadmin –configure-tech This will open the list of technologies (and language models) available for you to chose from 1) Age Estimation [disabled] 2) Denoiser Technology [disabled] 3) Diarization [disabled] 4) Gender Identification [disabled] 5) Keyword Spotting [disabled] 6) Phoneme Recognition [disabled] 7)…