Search Results for: speaker

Results 21 - 40 of 43 Page 2 of 3
Results per-page: 10 | 20 | 50 | 100

Speech Quality Estimation

Relevance: 9%      Posted on: 2018-04-02

Speech Quality Estimation is a language-, domain- and channel-independent technology that serves to quantify the quality of an audio recording. 2 most important statistics that it bases its score on are SNR (Speech-to-noise ratio) and bitrate of the recording. SQE is usually part of rapid filtration process in deployment. SQE also measures over 20 other properties of the recording, all of which can be found in the output file and further processed. See description in SPE documentation. Typical use cases are: verification of recordings' quality on the input, searching based on quality of the recording, noise of environment or speaker's…

Phonexia Browser

Relevance: 9%      Posted on: 2017-05-18

About Phonexia Browser v3 (Browser v3) software that combines the power of speech technologies in a single desktop application. The application automatically  performs visualization of records as well as effective filtration of speech metadata uncovered from the user´s records. Speech technologies implemented: Speaker Identification (SID) Language Identification (LID) Gender identification (GID) Voice Activity Detection (VAD) Speaker Diarization (DIAR) Keyword Spotting (KWS, 10+ languages available) Speech Quality Estimator (SQE) Speech to Text (STT, 10+ languages available) Age Estimation (AGE) Browser v3 is a client application cooperating with Speech Engine v3 (SPE3). It is possible to use it as a client -…

DIAR

Relevance: 9%      Posted on: 2018-02-01

Phonexia Speaker Diarization

Voice Inspector

Relevance: 9%      Posted on: 2017-05-18

About Phonexia Voice Inspector v3 (VIN3) provides police forces and forensic experts with a highly accurate speaker identification tool during investigation of criminal matters. It uses the power of voice biometry to automatically recognize speakers by their voice. Main features of the VIN3 application: Automatic speaker identification tool to strengthen results of the standard phonetics-based approaches Scoring in likelihood ratio (LR) – Result from statistical test for two models comparison. It gives back number which expresses how many times more likely the data are under one model than the other. LnLR or LogLR meets numbers in interval <-∞;+∞>...), and verbal…

Speech To Text results explained

Relevance: 9%      Posted on: 2019-05-27

This article aims on giving more details about Speech To Text outputs and hints on how to tailor Speech To Text to suit best your needs. In the process of transcribing speech, the Speech To Text technology usually identifies multiple alternatives for individual speech segments, as multiple phrases can have similar pronunciations, possibly with different word boundaries, e.g. “eight tea machines” vs. “eighty machines”. The technology provides several types of output to show only one or more transcription alternatives. One-best output 1-best output provides transcription containing only the highest-scoring words. Each segment provides information about the transcribed word itself, the…

SID

Relevance: 9%      Posted on: 2018-02-01

Phonexia Speaker Identification, multiple generations available marked by version like SIDv2 or SIDv3

Software Vetting

Relevance: 9%      Posted on: 2018-04-06

The purpose of this document is to help client to satisfy their high security standards during integration of Phonexia software to their critical infrastructure. The vetting ensures that Phonexia software is not dangerous to the client’s infrastructure in any way. It means there are no backdoors, viruses, worms, Trojan horses, spyware, adware, critical bugs, unwanted functionality, no information is sent outside the client’s infrastructure. Vetting context Speech technology is a very dynamic area with a very fast development. For example the speaker identification error rate decreases to half between each two evaluations organized by National Institute of Standards and Technology,…

Speech Intelligence Resolver v1

Relevance: 9%      Posted on: 2017-05-18

About Phonexia Speech Intelligence Resolver v1 (SIR1) combines the power of speech technologies within a single application. The application automatically performs visualization of the record as well as filtering the speech metadata uncovered from your records effectively. Speech technologies implemented: Phonexia Speaker Identification (SID2) Phonexia Language Identification (LID2) Phonexia Gender identification (GID) Phonexia Voice Activity Detection (VAD) Phonexia Speaker Diarization (DIAR) Phonexia Keyword Spotting (KWS) Phonexia Speech Quality Estimator (SQE) Phonexia Speech Transcription (STT) SIR is a client application cooperating with REST servers. It can be used as a standalone application due to the integrated local REST server. It was…

Voice Inspector – Interpretation of results

Relevance: 9%      Posted on: 2019-06-24

Introduction Phonexia Voice Inspector (VIN) is a tool for forensic automatic speaker identification, compliant with the Methodological Guidelines for Best Practice in Forensic Semiautomatic and Automatic Speaker Recognition, published by the European Network of Forensic Science Institutes.  This post explains individual SID score types and ways to visualize the results in a speaker identification case implemented in Voice Inspector. Evidence In VIN, the term evidence has two meanings. In general, it refers to any SID score that the system calculates for any pair of recordings in the case. These scores are the output of the Phonexia SID technology which runs…

VP

Relevance: 9%      Posted on: 2018-02-01

Voice Print – output from spoken speech extraction process of SID. Unique mathematical representation of the specific speaker or recording is created in form of the iVector (for SID generation 3) or xVector (Deep Embeddings for SID generation 4).

Browser3 – Releases and Changelogs

Relevance: 9%      Posted on: 2019-10-09

Phonexia Browser v3 (Browser3) is developed as client on top of Phonexia Speech Engine v3. Phonexia Browser is a successor of Phonexia Speech Intelligence Resolver v1 (SIR1). This page lists changes in Browser releases. Releases Changelogs Phonexia Browser v3.18.0, BSAPI 3.22.0 - Oct 03 2019 New: Waveform editor can now process stereo file by Diarization in per-channel mode New: Added Gender balance and Score sharpness in Settings -> Scoring New: Multiple columns in Result pane can be turned on/off at once using context menu New: Minimum speech length changed to 7 seconds Fixed: LID results information chart is not updated…

Phonexia Voice Inspector v1

Relevance: 9%      Posted on: 2017-05-18

About Phonexia Voice Inspector v1 (VIN1) provides police forces and forensic experts with highly accurate speaker identification tools to be used during the investigation of criminal matters. It utilizes the power of voice biometry to automatically recognize the speaker by their voice. Main features of the VIN1 application: An automatic speaker identification tool to strengthen the results of the standard phonetic based approaches Scoring of the likelihood ratio (LR), log-likelihood ratio (LLR), and an option of a verbal presentation of the results Graphic presentation of the likelihood ratio (LR), probability density function and Tippett plot Generating detailed reports (expert opinion…

Voice Inspector – supporting technologies

Relevance: 9%      Posted on: 2019-06-28

Automatic Speaker Identification (SID) is the most important but not the only Phonexia technology that is implemented in Voice Inspector (VIN). Apart from SID, forensic experts, users of VIN, can benefit from automatic Signal-to-Noise Ratio calculation, Voice Activity detection, Phoneme search, and a Wave editor which incorporates the waveform, spectrum and power panel. Let's have a look on how to utilize individual technologies. Signal-to-Noise Ratio Recording quality can strongly influence the reliability of SID results and so the outcome of a forensic case. Therefore, VIN uses a module of Phonexia Speech Quality Estimation (SQE) to calculate the Signal-to-Noise Ratio (SNR)…

SPE configuration

Relevance: 9%      Posted on: 2018-02-02

Basic explanation of configuration directives for SPE with hints & tips. Overview of phxspe.properties for beginners.

VIN3 – Releases and Changelogs

Relevance: 9%      Posted on: 2018-04-08

Phonexia Voice Inspector v3 (VIN) is developed as a desktop application on top of Phonexia BSAPI. This page lists changes in VIN releases. Releases Changelogs Voice Inspector v3.2.2, BSAPI 3.15.0 - Jun 5 2018 - Fixed possible application crash on Windows - Added phoneme type 'affricate' and fixed phoneme types: * phoneme 'C' changed from 'fricative' to 'affricate' * phoneme 'D' changed from 'fricative to 'plosive' * phoneme 'T' changed from 'fricative to 'plosive' * phoneme 'c' changed from 'plosive' to 'affricate' Voice Inspector v3.2.1, BSAPI 3.15.0 - Mar 16 2018 - Export of Speakers/Populations allows export only voiceprints -…

Voice Biometrics Course (technical training)

Relevance: 9%      Posted on: 2017-05-18

The Voice Biometrics course consist of the following modules. Please ask your Phonexia contact for detailed description. (YES = this part is mandatory for course)   VBS course Required time [h] Block name Block description YES 0,5 Intro & Phonexia Portfolio Intro & Phonexia Portfolio YES 0,5 Project focus - Explain basic needs Partner project related discussion focused mainly to finalizing training topics and agenda YES 0,75 Apps Designing and Developing - Licensing Gives trainee knowledge about type of licensing, and how to use the license file YES 0,75 Technologies - Data gathering and Quality measurement - basic Data gathering…

Product Portfolio

Relevance: 9%      Posted on: 2018-04-02

Phonexia Speech Platform is an umbrella concept for all Phonexia’s products and services related to speech technologies. It gives us the ability to customize various products to a wide range of customer needs. Platform Edition is an encapsulation of specific setup of speech technologies, modules, applications, utilities and services designed for a specific market segment. We distinguish Speech Analytics (SAL) and Voice Biometrics (VBS) as most common domain of usage. It is also a tool for marketing and sales. Voice Biometrics is focused more on identifying speaker, gender, language spoken and more. Speech Analytics focuses on gathering information about content…

Age Estimation

Relevance: 9%      Posted on: 2018-04-12

Phonexia Age Estimation (AGE) estimates the age of a speaker from audio recording. The process of voiceprint extraction is similar to the extraction of SID, but as a result different features get extracted; therefore, the voiceprints extracted from AGE and SID are not mutually compatible. Technology Trained with emphasis on spontaneous telephony conversation The technology is language-, accent-, text-, and channel- independent Compatibility with the widest range of audio sources possible (applies channel compensation techniques): GSM/CDMA, 3G, VoIP, landlines, etc. Input Input format for processing: WAV or RAW (8 or 16 bits linear coding), A-law or Mu-law, PCM, 8kHz+ sampling…

Speech Analytics Course (technical training)

Relevance: 9%      Posted on: 2017-05-18

The Speech Analytics course consists of the following modules. Please ask your Phonexia contact for detailed description. (YES = this part of the course is obligatory)   SAL course Required time [h] Block name Block description YES 0,5 Intro & Phonexia Portfolio Intro & Phonexia Portfolio YES 0,5 Project focus – Explain basic needs Discussion of partner project focused mainly on finalizing the training topics and agenda. YES 0,75 Application Design & Development – Licensing Presentation of types of licensing, and how to use the license file. YES 0,75 Technologies – Data gathering and Quality measurement – basic Description of…

Measuring of a software processing speed – what is the FtRT (Faster than Real Time)

Relevance: 9%      Posted on: 2019-10-30

Faster Than Real Time (FTRT) is metrics developed for defining software performance reference point. Using this metric you can collect "benchmark" data of real processing speed for reviewed software, which should be find - and reproduce - on exactly defined HW. Then, comparing various benchmarks result, you can compare performance of the specified software and its parts on different HW configurations. And vice versa using the same metric you can compare software of different vendors on the same HW configuration and for the same processing task. We are recognizing two measurable metrics: Recording based FTRT is calculated from real recordings…