Search Results for: ROM

Results 1 - 10 of 65 Page 1 of 7
Results per-page: 10 | 20 | 50 | 100

SPE3 – Releases and Changelogs

     Posted on: 2019-08-13

Speech Engine (SPE) is developed as RESTfull API on top of Phonexia BSAPI. SPE was formerly known as BSAPI-rest (up to v2.x) or as Phonexia Server (up to v3.2.x). This page lists changes in SPE releases. Releases Changelogs == SPE v3.17.x == Speech Engine 3.17.2 (08/02/2019) - DB v1200, BSAPI 3.21.2 [G_BSAPI#300] Fixed: KWS stream results are displayed with a delay Speech Engine 3.17.1 (07/22/2019) - DB v1200, BSAPI 3.21.1 Added 5th generation of ES_ES of STT/Dictate/KWS/PHNREC NOTE: STT output format has changed in 5th generation: _DELETE_ token was changed to <null/> _SILENCE_ and <sil/> tokens were changed to <silence/> <s>…

Phonexia Workflow

     Posted on: 2019-08-06

About Phonexia Workflow combines Phonexia technologies into scenarios, which can be easily configured and deployed. Phonexia Workflow uses Phonexia Speech Engine internally. Provided Phonexia Workflow scenarios: SalEssentials - Speech Analytics Essentials filter out low quality audio files, provides demographic information, age estimation and speech to text processing. VbsEssentials - Voice Biometrics Essentials filter out low quality audio files, provides gender identification, age estimation and speaker identification. Our team can help you implementing your custom scenario. The scenario is a tiny Java application which interacts with Phonexia technologies and optionally can use your service or database. First steps Installation Go through…

Workflow – Releases and Changelogs

     Posted on: 2019-08-06

Phonexia Workflow combines Phonexia technologies into scenarios. It uses Phonexia Speech Engine internally. This page lists changes in Workflow releases. Releases n/a Changelogs == Phonexia Workflow v1 == Phonexia Workflow 1.4.0 - SPE 3.16 - 3.17 Support for SID4. Rapid filtering component enhanced by more options on channel selection Phonexia Workflow 1.3.0 - SPE 3.13 - 3.17 Internal stuff. Phonexia Workflow 1.2.0 - SPE 3.13 - 3.14 * Scenarios support more repositories. * parameter `repositoryType` renamed to `repositoryTypes` * e.g.: repositoryTypes: [file, memory] * Scenarios no longer support parameter `repositoryFormat`. Phonexia Workflow 1.1.0 - SPE 3.12 new type of Phonexia…

Browser3 – Releases and Changelogs

     Posted on: 2019-07-03

Phonexia Browser v3 (Browser3) is developed as client on top of Phonexia Speech Engine v3. Phonexia Browser is a successor of Phonexia Speech Intelligence Resolver v1 (SIR1). This page lists changes in Browser releases. Releases Changelogs Phonexia Browser v3.17.0, BSAPI 3.21.0 - Jul 01 2019 [G#106] Added possibility to activate/deactivate created filter rules [G#125] Running Browser in "embedded SPE" mode now creates SPE log file (phxspe.browser.log located in SPE log directory) Phonexia Browser v3.16.1, BSAPI 3.20.1 - May 17 2019 [G#112] Fixed Denoiser which created duplicate recordings under specific circumstances [G#127] Fixed comparison of SID Evaluation sets using Audio Source…

Voice Inspector – supporting technologies

     Posted on: 2019-06-28

Automatic Speaker Identification (SID) is the most important but not the only Phonexia technology that is implemented in Voice Inspector (VIN). Apart from SID, forensic experts, users of VIN, can benefit from automatic Signal-to-Noise Ratio calculation, Voice Activity detection, Phoneme search, and a Wave editor which incorporates the waveform, spectrum and power panel. Let's have a look on how to utilize individual technologies. Signal-to-Noise Ratio Recording quality can strongly influence the reliability of SID results and so the outcome of a forensic case. Therefore, VIN uses a module of Phonexia Speech Quality Estimation (SQE) to calculate the Signal-to-Noise Ratio (SNR)…

Voice Inspector – Interpretation of results

     Posted on: 2019-06-24

Introduction Phonexia Voice Inspector (VIN) is a tool for forensic automatic speaker identification, compliant with the Methodological Guidelines for Best Practice in Forensic Semiautomatic and Automatic Speaker Recognition, published by the European Network of Forensic Science Institutes.  This post explains individual SID score types and ways to visualize the results in a speaker identification case implemented in Voice Inspector. Evidence In VIN, the term evidence has two meanings. In general, it refers to any SID score that the system calculates for any pair of recordings in the case. These scores are the output of the Phonexia SID technology which runs…

Speaker Identification (SID)

     Posted on: 2019-06-13

Phonexia Speaker Identification uses the power of voice biometry to recognize speakers by their voice... i.e. to decide whether the voice in two recordings belongs to the same person or two different people. High accuracy of Speaker Identification, the Phonexia's flagship technology, has been validated in a NIST Speaker Recognition Evaluations. Basic use cases and application areas The technology can be used for various speaker recognition tasks. One basic distinction is based on the kind of question we want to answer. Speaker Identification is the case when we are asking "Whose voice is this?", such as in fake emergency calls.…

Keyword Spotting results explained

     Posted on: 2019-06-12

This article aims on giving more details about Keyword Spotting outputs and hints on how to tailor Keyword Spotting to suit best your needs. Scoring and results explanation Keyword Spotting works by calculating likelihoods that at a given spot occurs a keyword or just any other speech, and comparing those two likelihoods. The following scheme shows Background model for anything before the keyword (1), the Keyword model (2) and a Background model of any speech parallel with the keyword model (3). Models 2 and 3 produce two likelihoods – Lkw and Lbg (any speech = background). Raw score is calculated…

Keyword Spotting

     Posted on: 2019-06-03

Phonexia Keyword Spotting (KWS) identifies occurrences of keywords and/or keyphrases in audio recordings. It can help you to get valuable information from huge quantities of speech recordings. You only need to specify the keywords or phrases you wish to find. This technology identifies all recordings with keyword occurrences and allows you to automatically route important recordings or calls to your experts. Typical use cases Call centers increase operator and supervisor efficiency by searching calls identify inappropriate expressions from operators check marketing campaigns with automatic script-compliance control Mass media and web search servers index and search multimedia by keyword route multimedia…

Speaker Identification: Results Enhancement

     Posted on: 2019-05-29

Speaker Identification (SID) Results Enhancement is a process that adjusts the score threshold for detecting/rejecting speakers by removing the effect of speech length and audio quality. This is achieved by use of Audio Source Profiles, that represent as closely as possible the source of the speech recording (device, acoustic channel, distance from microphone, language, gender, etc.). Although the out-of-the-box system is robust in such factors, several result enhancement procedures can provide even better results and stronger evidence. Audio Source Profile An Audio Source Profile is a representation of the speech source, e.g., device, acoustic channel, distance from microphone, language, gender,…