Skip to content Skip to main navigation Skip to footer

Search: speaker identification

27 results

Speaker Identification (SID)

Phonexia Speaker Identification uses the power of voice biometry to recognize speakers by their voice… i.e. to decide whether the voice in two recordings belongs to the same person or two different people. Our goal as a regular participant of the NIST Speaker Recognition Evaluations (SRE) series is to contribute to the direction of research efforts and the calibration of…

SID: Speaker Identification: Results Enhancement

Speaker Identification (SID) Results Enhancement is a process that adjusts the score threshold for detecting/rejecting speakers by removing the effect of speech length and audio quality. This is achieved by use of Audio Source Profiles, that represent as closely as possible the source of the speech recording (device, acoustic channel, distance from microphone, language, gender, etc.). Although the out-of-the-box system…

SID: TUTORIAL: Speaker Identification – How to Do a Basic Test

Phonexia Speaker Identification is a voice biometry tool for recognition of speakers by their voice. In this video, we will show you how to start using this technology! You will learn how to create a “Speaker Model” to identify a speaker in a set of data. Ready to test it? Start with our video: What else is needed? 1. Phonexia…

Understand SPE technologies configuration file

…Diarization GID Gender Identification KWS Keyword Spotting KWS_STREAM Keyword Spotting Stream LIDC Language Identification Languageprint Comparator LIDE Language Identification Languageprint Extractor PHNREC Phoneme Recognition SID4C Speaker Identification 4 Voiceprint Comparator SID4C_STREAM Speaker Identification 4 Voiceprint Stream Comparator SID4CALIB Speaker Identification 4 VoicePrint Calibration SID4E Speaker Identification 4 Voiceprint Extractor SID4E_STREAM Speaker Identification 4 Voiceprint Stream Extractor SQE Speech Quality Estimation…

SPE and Browser installation: standalone SPE

…Keyword Spotting Stream [disabled] 8) Language Identification LanguagePrint Comparator [disabled] 9) Language Identification LanguagePrint Extractor [disabled] 10) Speaker Identification 4 VoicePrint Extractor [disabled] 11) Speaker Identification 4 VoicePrint Comparator [disabled] 12) Speaker Identification 4 VoicePrint Calibration [disabled] 13) Speaker Identification 4 VoicePrint Stream Extractor [disabled] 14) Speaker Identification 4 VoicePrint Stream Comparator [disabled] 15) Speech Quality Estimation [disabled] 16) Speech…

Releases and Changelogs (VIN)

…the Phonexia sales representative. Phonexia Voice Inspector 5.0 brings a Speaker Identification model XL5, that provides more accurate results for telephony data in comparison with previous generations of Speaker Identification models such as SID4 XL4. Users can observe that the SID4 XL5 model returns different values of LLR scores which are used for evidence calculation. Therefore Speaker Identification score distribution…

Releases and Changelogs (SPE)

…Fixed: Different/incorrect output in STT for empty streams (all 6. gen models) Fixed: Time Analysis Extraction does not return correct total length of audio Fixed: Time Analysis Extraction returns crosstalks even for channel that is reported to not contain speech Deprecated: Legacy Speaker Identification technology, i.e. all enpdoints under /technologies/speakerid Speech Engine 3.60 (Public release) Speech Engine 3.60.1, DB v1901,…

Release Notes

…Other technologies New Gender Identification (GID) model XL5 (since 3.56.0) This enables GID to use voiceprints created by the brand new Speaker Identification 4 model XL5 New Age Estimation (AGE) models XL4 and XL5 (since 3.57.0) This enables AGE to use voiceprints created by the Speaker Identification 4 model XL4 and XL5 New Voice Activity Detection (VAD) model SID4_XL5 (since…

Key Features (VIN)

Phonexia Voice Inspector software offers several features that strongly support the work of voice forensic experts: A standalone application with a complete easy-to-use Graphical User Interface (GUI) Automatic comparison of questioned recording (unknown speaker recording or voiceprint) against a suspected reference speaker (group of recordings or voiceprints) with a known speaker i.e. 1:1 identification and 1:N identification. Implemented speech technologies:…

Key Features (PSP)

…The Speech Platform includes the following technologies. Technologies are available in the Speech Engine component based on its particular configuration (Voice Biometrics, Transcription System, etc.) Speaker Identification (SID) – recognizes a speaker automatically based on their voice, Speaker Diarization (DIAR) – separates multiple speakers in mono audio automatically, Language Identification (LID) – detects the language or dialect spoken in a…

Releases and Changelogs (Browser)

…in wrong order Fixed: Browser doesn’t allow to set user calibration values below Phonexia Browser 3.35.2, BSAPI 3.35.2 (2020-10-21) Fixed: Speaker identification dialog in WaveEditor which did not work for SID4 Fixed: detection of certain USB license tokens Phonexia Browser 3.35.0, BSAPI 3.35.0 (2020-10-02) New: Compatibility with SPE 3.35 Phonexia Browser 3.31 Phonexia Browser 3.31.3, BSAPI 3.30.11 (2020-08-20) Fixed: Transcription…

Q: What are the requirements for SID evaluation dataset?

For evaluating the real life scenario of Phonexia Speaker Identification technology, the system needs to be calibrated by SID dataset. SID dataset (minimum requirements): To measure SID performance precisely, it’s important to prepare evaluation recordings set very carefully. The requirements are: 50+ known speakers, 200+ recordings in total (i.e. 3 to 5 recordings per speaker*) 1+ minute of net speech…

Phonexia Speech Engine

…✓ Voice Activity Detection (VAD) ✓ ✓ Time Analysis Extraction (TAE) ✓ ✓ Speech Quality Estimation (SQE) ✓ ✓ Language Identification (LID) ✓ Gender Identification (GID) ✓ Age Estimation (AGE) ✓ Speaker Diarization (DIAR) ✓ Results caching Processing results can be optionally stored in results cache database to speed up eventual re-processing of the same recordings by the same technology…

Gender Identification (GID)

Gender Identification is a language-, domain- and channel-independent technology that uses the acoustic characteristics of the recording to determine the gender of the speaker in question. This technology is able to distinguish between two genders: Male (M) and Female (F). Minimum of speech signal for identification: 7+ sec recommended (with XL4 and L4 model (9+ sec for previous generation of…