Skip to content Skip to main navigation Skip to footer

Search: speaker%20identification

43 results

Speaker Identification (SID)

Phonexia Speaker Identification uses the power of voice biometry to recognize speakers by their voice… i.e. to decide whether the voice in two recordings belongs to the same person or two different people. Our goal as a regular participant of the NIST Speaker Recognition Evaluations (SRE) series is to contribute to the direction of research efforts and the calibration of…

Releases and Changelogs (SPE)

…/server/info [#5135] Added groups which speaker model is member of to resource /technologies/speakerid/speakermodels/{name} [#5133] Login of a user can contain any characters except these: /:*?”<>| [#5150] Fixed connection to MySQL database may be lost in case of hight load [#5191] Fixed SID Stream requires calibration technology even if parameter ‘calibset’ was not specified [#5203] Fixed premature access to the result…

Understand SPE database

…by SPE users: rest_model_sid list of SID speaker models – name, owner (SPE user), modification timestamp rest_model_sid_sources list of files used as sources for SID speaker models creation rest_model_sid_metafiles list of files used as SID speaker models metafiles rest_group_sid list of SID speaker groups – name, owner (SPE user) rest_group_sid_models associations between SID speaker groups and speaker models rest_voiceprint SID…

Releases and Changelogs (VIN)

…for Expert and Organization Voice Inspector 5.0 Voice Inspector 5.0.0, BSAPI 3.57.0 (2023-06-20) New: Speaker Identification XL5 technology model New: Data in lists/tables are now sorted alphabetically New: Enlarge the initial set of speakers included in examples; some of the speakers are multilingual ❗❗❗ Voice Inspector 5.0 requires a new license. To upgrade from version 4 or 3, please contact…

SID: Speaker Identification: Results Enhancement

…– recordings from different speakers representing the source data, minimum 60 seconds net speech in each. The set must not contain duplicates or target speaker recordings. With FAR Calibration, the system is calibrated to a specific False Acceptance Rate (e.g., FAR = 1%) for each reference voiceprint (speaker model). Only one side (the enroll) is calibrated, using data representing the…

SID: TUTORIAL: Speaker Identification – How to Do a Basic Test

Phonexia Speaker Identification is a voice biometry tool for recognition of speakers by their voice. In this video, we will show you how to start using this technology! You will learn how to create a “Speaker Model” to identify a speaker in a set of data. Ready to test it? Start with our video: What else is needed? 1. Phonexia…

Understand SPE technologies configuration file

…Diarization GID Gender Identification KWS Keyword Spotting KWS_STREAM Keyword Spotting Stream LIDC Language Identification Languageprint Comparator LIDE Language Identification Languageprint Extractor PHNREC Phoneme Recognition SID4C Speaker Identification 4 Voiceprint Comparator SID4C_STREAM Speaker Identification 4 Voiceprint Stream Comparator SID4CALIB Speaker Identification 4 VoicePrint Calibration SID4E Speaker Identification 4 Voiceprint Extractor SID4E_STREAM Speaker Identification 4 Voiceprint Stream Extractor SQE Speech Quality Estimation…

Releases and Changelogs (Browser)

…score in LID and SID result information widgets [G#41] Calibrated speaker VP status indication in speaker view [G#43] Prevent closing graph in SID evaluator by Alt+F4 Phonexia Browser v3.11.0, BSAPI 3.15.0 – Mar 15 2018 [G#29] Fixed unable to open short recordings in waveform editor [G#5] Fixed wrong user information causes the browser to show loading indefinitely [G#30] Fixed playback…

Release Notes

…which can be edited by users. Speech Engine: Speaker Identification (SID4) New “floating window” feature for realtime stream processing (since 3.60.0) This new floating_window parameter allows to identify speaker or extract voiceprint from only last X seconds (default 5) of speech in the realtime stream… as opposed to using speech from entire stream audio without using this parameter. Speech Engine:…

Speaker Diarization (DIAR)

Speaker Diarization labels segments of the same voice(s) in one mono-channel audio record based by the individual speaker´s voice. It is a language-, domain- and channel-independent technology. It performs not only the segmentation of speakers but of technical signals and silence as well. The outputs of the technology can be both log files with labels and/or split audio files/one new…

SPE and Browser installation: standalone SPE

…Keyword Spotting Stream [disabled] 8) Language Identification LanguagePrint Comparator [disabled] 9) Language Identification LanguagePrint Extractor [disabled] 10) Speaker Identification 4 VoicePrint Extractor [disabled] 11) Speaker Identification 4 VoicePrint Comparator [disabled] 12) Speaker Identification 4 VoicePrint Calibration [disabled] 13) Speaker Identification 4 VoicePrint Stream Extractor [disabled] 14) Speaker Identification 4 VoicePrint Stream Comparator [disabled] 15) Speech Quality Estimation [disabled] 16) Speech…

Key Features (VIN)

Speaker Identification, Speaker Diarization, Phoneme Recognizer, Voice Activity Detection, Speech Quality Estimation A search for repetitive sound patterns across all recordings in audio due to the automatic phonemic transcription Input: Questioned recordings (a minimum of 1 recording) Suspected speaker recordings (a minimum of 1 recording) The Population set (a technical minimum of 10 speakers, and a recommended minimum of 50…

FAQs (PSP)

…performance precisely, it’s important to prepare evaluation recordings set very carefully. The requirements are: 50+ known speakers, 200+ recordings in total (i.e. 3 to 5 recordings per speaker*) 1+ minute of net speech in each recording (i.e. usually 2+ minutes recording length) only one speaker in each recording wide variety of gender and age is recommended recordings should be as…

Time Analysis Extraction (TAE)

…to the other: places of speaker’s longest and shortest reaction, i.e., where this speaker stopped talking and the other speaker started talking the average reaction times number of speaker-turns in the particular direction and details about crosstalk, for example where the other speaker is talking “over” this speaker Segmentation This section is optional and need to be explicitly turned on….