Search: voiceprint

16 results

Releases and Changelogs (SPE)

…GID and AGE technologies accept also SID voiceprint as an input [#60] Getting voiceprints for all speaker models for given speaker group [#23] Minimum speech length for extracting SID calibration voiceprint is 60s for newly created calibration sets [#83] Lower case keyword cause error with some models (cs_CZ) [BSAPI] Added a new STT and KWS PL_PL (Polish) model version 5.0.0…

Speaker Identification (SID)

…existing enrollment voiceprints and the system returns a score for each comparison. The score is produced by comparing two voiceprints using Probabilistic Linear Discriminant Analysis (PLDA). Scoring and conversion to percentage Score produced by comparing two voiceprints is an estimate of the probability (P), that we get the given evidence (the compared voiceprints) if the speakers in the two voiceprints…

Understand SPE database

…voiceprints – voiceprint data, technology model used to create the voiceprint, speaker model to which the voiceprint belongs (speaker model voiceprints), calibration set to which the voiceprint belongs (FAR calibration set voiceprints) rest_model_sid_calib_voiceprint SID speaker model voiceprints calibrated to FAR – voiceprint data, speaker model, technology model used to create the voiceprint, max. FAR, calibration set used to calibrate the…

SID: Speaker Identification: Results Enhancement

…normalization. The Audio Source Profile includes: a set of calibration voiceprints extracted from the source files NOTE: these voiceprints are not compatible with standard voiceprints created using the /technologies/speakerid4/extractvp REST API endpoint or the vpextract4 command-line tool! information about profile version, creation date and version of voiceprints hash of all data in the profile and hash of its parent profile…

Understand SPE technologies configuration file

…Diarization GID Gender Identification KWS Keyword Spotting KWS_STREAM Keyword Spotting Stream LIDC Language Identification Languageprint Comparator LIDE Language Identification Languageprint Extractor PHNREC Phoneme Recognition SID4C Speaker Identification 4 Voiceprint Comparator SID4C_STREAM Speaker Identification 4 Voiceprint Stream Comparator SID4CALIB Speaker Identification 4 VoicePrint Calibration SID4E Speaker Identification 4 Voiceprint Extractor SID4E_STREAM Speaker Identification 4 Voiceprint Stream Extractor SQE Speech Quality Estimation…

SPE and Browser installation: standalone SPE

…Keyword Spotting Stream [disabled] 8) Language Identification LanguagePrint Comparator [disabled] 9) Language Identification LanguagePrint Extractor [disabled] 10) Speaker Identification 4 VoicePrint Extractor [disabled] 11) Speaker Identification 4 VoicePrint Comparator [disabled] 12) Speaker Identification 4 VoicePrint Calibration [disabled] 13) Speaker Identification 4 VoicePrint Stream Extractor [disabled] 14) Speaker Identification 4 VoicePrint Stream Comparator [disabled] 15) Speech Quality Estimation [disabled] 16) Speech…

Release Notes

…Other technologies New Gender Identification (GID) model XL5 (since 3.56.0) This enables GID to use voiceprints created by the brand new Speaker Identification 4 model XL5 New Age Estimation (AGE) models XL4 and XL5 (since 3.57.0) This enables AGE to use voiceprints created by the Speaker Identification 4 model XL4 and XL5 New Voice Activity Detection (VAD) model SID4_XL5 (since…

Age Estimation (AGE)

…coding), A-law or Mu-law, PCM, 8kHz+ sampling Voiceprints: AGE L4 model supports SID4 L4 voiceprints; legacy AGE models support voiceprints created by AGE itself Output Log file with processed information (age estimate) Processing speed Approx. 20x faster than real-time processing on 1 CPU core i.e. standard 8 CPU core server processes 3,840 hours of audio in 1 day of computing…

Designing and Developing Application

…specific hardware (mainly CPU, virtualized infrastructure vs. HW) or are you going to buy specific HW for customer? What is short/long time storage requirements (ie. audio and results availability, desktop vs. distributed system)? Is there any synchronization required (ie. voiceprint database to clients)? What is the topology of the solution/app (ie. where to store audio, voiceprints, results, …)? How to…

Understand SPE technologies, instances and workers

…(or bank branch): Post office is a place providing different kinds of services – one can go there to send letters, send or pick up packages, get a POBox, get some financial services, insurance, etc.). Speech Engine has various speech technologies configured – one can analyze the audio quality, extract voiceprints from recordings, compare voiceprints, transcribe audio to text, etc….

Key Features (VIN)

Phonexia Voice Inspector software offers several features that strongly support the work of voice forensic experts: A standalone application with a complete easy-to-use Graphical User Interface (GUI) Automatic comparison of questioned recording (unknown speaker recording or voiceprint) against a suspected reference speaker (group of recordings or voiceprints) with a known speaker i.e. 1:1 identification and 1:N identification. Implemented speech technologies:…

Releases and Changelogs (Browser)

…v3.12.0, BSAPI 3.16.0 – Aug 17 2018 [#56] Added support for Denoiser technology * process recording from context menu in test table [#53] Fixed error “Authorisation failed” during starting embedded SPE server [#52] Fixed wrong displaying of time of very long length audio files [#3] Changed minimum speech length for calibrated voiceprint to 60 s [#51] Fixed application crash when…

Releases and Changelogs (VIN)

…to ‘affricate’ phoneme ‘D’ changed from ‘fricative to ‘plosive’ phoneme ‘T’ changed from ‘fricative to ‘plosive’ phoneme ‘c’ changed from ‘plosive’ to ‘affricate’ Voice Inspector v3.2.1, BSAPI 3.15.0 (2018-03-16) Export of Speakers/Populations allows export only voiceprints Wave editor’s Spectrum settings allow to set up smaller values for Window length Added generic label panel in Wave editor The new version of…

LID: Terminology and adaptation

This article describes various ways of Language Identification adaptation. Basic terminology Languageprint (*.lp file) – numeric representation of the audio, extracted from audio file for language identification purpose of (similar to “voiceprint”, but representing sound of the spoken language, not sound of the speaking person) Languageprint archive (*.lpa file) – multiple languageprints combined into single archive Languageprint archives come pre-created…

Measuring of a software processing speed – what is the FtRT (Faster than Real Time)

…in our example is 36 seconds. After stripping silence, it gets 14 seconds – this means that original audio contains 38% of net speech and 62% of silence. Phonexia speech technologies analyze the entire recording, but pick only the speech segments for AI processing, i.e. the absolute processing time will be practically the same… Creating voiceprint by Speaker Identification took:…