Skip to content Skip to main navigation Skip to footer

Search: diar

21 results

Speaker Diarization (DIAR)

Speaker Diarization labels segments of the same voice(s) in one mono-channel audio record based by the individual speaker´s voice. It is a language-, domain- and channel-independent technology. It performs not only the segmentation of speakers but of technical signals and silence as well. The outputs of the technology can be both log files with labels and/or split audio files/one new…

Releases and Changelogs (SPE)

…via Webhook Fixed: Some log messages contain format placeholder instead of numbers Fixed: Missing <silence/> label in STT confusion network output Fixed: STT confusion network contains <silence/> tags with confidence greater than 1.0 Fixed: Diarization crashes during processing Fixed: Diarization XL4 crashes on file with no speech Fixed: SID voiceprint extraction on stream is affected by previous run Fixed: Incorrect…

Releases and Changelogs (Browser)

…path to temporary directory contains certain accented characters Fixed: Licensing errors not visible before exiting application Phonexia Browser 3.18.0, BSAPI 3.22.0 (2019-10-03) New: Waveform editor can now process stereo file by Diarization in per-channel mode New: Added Gender balance and Score sharpness in Settings -> Scoring New: Multiple columns in Result pane can be turned on/off at once using context…

Phonexia technology models EoL

…2020-10 6th gen. DIAR 5th gen. DIAR L1 (Beta) 2015-08 N/A On project basis S1 (Beta) 2014-10 N/A On project basis O1 (Beta) 2014-10 N/A On project basis DENOISER EN_US1 (Beta) 2015-08 N/A On project basis CS_CZ1 (Beta) 2018-03 N/A On project basis Deprecated tech. models (not supported, after end-of-life). Technology Tech. model name Release Date End of Support GID…

Release Notes

…is a completely new model in STT/KWS family. STT word accuracy (WAcc) is up to 93.1 %. Speech Engine: Other technologies Speaker Diarization (DIAR XL4) – We solved the issue of an extreme (exponential) processing time increase on long recordings (the processing time increase is now linear). Automatic configuration of some Speech Engine (SPE) settings values – SPE is now…

Video – Filtering and supporting technologies

MODULE 2: Filtering and supporting technologies (22 min) Common generic rules for CLI, REST and GUI Filtering, sorting, pre-/post-processing overview Speech Quality Estimation (SQE) in CLI, REST and GUI Voice Activity Detection (VAD) in CLI, REST and GUI Diarization (DIAR) in CLI, REST and GUI Age Estimation (AGE) in CLI, REST and GUI Denoiser (DENOISER) in CLI, REST and GUI…

SPE and Browser installation: standalone SPE

…line type: phxadmin.exe /configure-tech SPE on Linux Open the Terminal window in  /SPE/ directory Type in the terminal: ./phxadmin –configure-tech This will open the list of technologies (and language models) available for you to chose from 1) Age Estimation [disabled] 2) Denoiser Technology [disabled] 3) Diarization [disabled] 4) Gender Identification [disabled] 5) Keyword Spotting [disabled] 6) Phoneme Recognition [disabled] 7)…

Phonexia Speech Engine

…✓ Voice Activity Detection (VAD) ✓ ✓ Time Analysis Extraction (TAE) ✓ ✓ Speech Quality Estimation (SQE) ✓ ✓ Language Identification (LID) ✓ Gender Identification (GID) ✓ Age Estimation (AGE) ✓ Speaker Diarization (DIAR) ✓ Results caching Processing results can be optionally stored in results cache database to speed up eventual re-processing of the same recordings by the same technology…

Download Speech Platform

…XL5 Diarization (DIAR) – model XL4 Language Identification (LID) – model L4 Gender Identification (GID) – model XL5 Age Estimation (AGE) ) – model XL5 Voice Activity Detection (VAD) – model GENERIC_3 and SID4_XL5 Speech Quality Estimation (SQE) Time Analysis Extraction (TAE) Waveform Denoiser (DENOISER) Phonexia Browser example audio (in ./BROWSER/example/ and ./SPE/bsapi/{technology}/example/) Step #2 – First start To get…

Support Lifecycle Policy (PSP)

…AGE 5th gen. AGE XL3 (XL1) 2016-09 N/A 4th gen. AGE L3 2015-07 N/A 4th gen. AGE VAD GENERIC_3 2021-10 5th gen. VAD 4th gen. VAD GENERIC / DEFAULT N/A N/A 3rd gen. VAD TANALYSIS GENERIC / DEFAULT N/A N/A N/A SQE GENERIC / DEFAULT N/A N/A N/A DIAR XL4 2020-10 6th gen. DIAR 5th gen. DIAR L1 (Beta) 2015-08…

Phonexia technologies introduction

…and their usages Filtering and supporting technologies 04:32 Speech Quality Estimation (SQE) 05:27 Voice Activity Detection (VAD) 06:37 Diarization (DIAR) 07:41 Age Estimation (AGE) 08:14 Waveform Denoiser Voice Biometrics technologies 08:56 Speaker Identification (SID) 10:18 Language Identification (LID) 11:10 Gender Identification (GID) Speech Analytics technologies 11:43 Speech Transcription (STT) 12:30 Keyword Spotting (KWS) 13:32 Phoneme Recognition (PHNREC) 13:54 Time Analysis…

Understand SPE database

…rest_model_kws KWS keyword lists – keyword list JSON data, keyword list name, owner (SPE user), technology model to which the keyword list belongs Processing results data Tables containing cached processing results (if results caching is enabled): rest_result_age AGE processing results – file, used technology model, results JSON data rest_result_diar DIAR processing results – file, used technology model, used processing parameters,…

Key Features (VIN)

…Speaker Identification, Speaker Diarization, Phoneme Recognizer, Voice Activity Detection, Speech Quality Estimation A search for repetitive sound patterns across all recordings in audio due to the automatic phonemic transcription Input: Questioned recordings (a minimum of 1 recording) Suspected speaker recordings (a minimum of 1 recording) The Population set (a technical minimum of 10 speakers, and a recommended minimum of 50…

Download Voice Inspector 5.2

…models VIN application (graphical user interface, GUI) with the following technologies in-build Speaker Identification (SID4_XL5) Speaker Diarization (DIAR) Voice Activity Detection (VAD) Speech Quality Estimator (SQE) Phoneme Recogniser (PHNREC) example population sets and audio (in ./examples/) and example report templates (in ./templates/) Hardware requirements minimum – CPU: Intel® Core™ i5, RAM: 4 GB, Required HDD space: 0.5 GB for software…

Key Features (PSP)

…The Speech Platform includes the following technologies. Technologies are available in the Speech Engine component based on its particular configuration (Voice Biometrics, Transcription System, etc.) Speaker Identification (SID) – recognizes a speaker automatically based on their voice, Speaker Diarization (DIAR) – separates multiple speakers in mono audio automatically, Language Identification (LID) – detects the language or dialect spoken in a…