Search Results for: voice activity detection

Results 1 - 10 of 53 Page 1 of 6
Results per-page: 10 | 20 | 50 | 100

Voice Activity Detection – Essential

Relevance: 100%      Posted on: 2018-04-04

Phonexia Voice Activity Detection (VAD) identifies parts of audio recordings with speech content vs. nonspeech content. Technology Trained with emphasis on spontaneous telephony conversation The technology is language-, accent-, text-, and channel- independent Compatibility with the widest range of audio sources possible (applies channel compensation techniques): GSM/CDMA, 3G, VoIP, landlines, etc. Input Input format for processing: WAV or RAW (8 or 16 bits linear coding), A-law or Mu-law, PCM, 8kHz+ sampling Output Log file with processed information (speech vs. nonspeech segments) Segmentation The section Segmentation describes the results of VAD, which are segments of detected voice and silence. Segments are…

Voice Activity Detection

Relevance: 100%      Posted on: 2018-04-02

Voice Activity Detection is a language-, domain- and channel-independent technology that identifies parts of audio recordings with speech content vs. non-speech content. It creates labels for speech and other signals in the recording; this can then serve as a decision point whether to process the recording by other technologies or not. VAD is usually part of rapid filtration process in deployment. Typical use cases are: detection of present or absent human speech for voice processing, filtering non-speech parts of the recording, filtering out recordings with not enough net speech to be processed by other technologies voice activated process, etc. The…

Phonexia Voice Inspector v3

Relevance: 39%      Posted on: 2018-04-02

About Phonexia Voice Inspector v3 (VIN3) provides police forces and forensic experts with a highly accurate speaker identification tool during investigation of criminal matters. It uses the power of voice biometry to automatically recognize speakers by their voice. Main features of the VIN3 application: Automatic speaker identification tool to strengthen results of the standard phonetics-based approaches Scoring in likelihood ratio (LR) – Result from statistical test for two models comparison. It gives back number which expresses how many times more likely the data are under one model than the other. LnLR or LogLR meets numbers in interval <-∞;+∞>...), and verbal…

Phonexia Voice Inspector v1

Relevance: 39%      Posted on: 2017-05-18

About Phonexia Voice Inspector v1 (VIN1) provides police forces and forensic experts with highly accurate speaker identification tools to be used during the investigation of criminal matters. It utilizes the power of voice biometry to automatically recognize the speaker by their voice. Main features of the VIN1 application: An automatic speaker identification tool to strengthen the results of the standard phonetic based approaches Scoring of the likelihood ratio (LR), log-likelihood ratio (LLR), and an option of a verbal presentation of the results Graphic presentation of the likelihood ratio (LR), probability density function and Tippett plot Generating detailed reports (expert opinion…

Voice Biometrics

Relevance: 39%      Posted on: 2018-04-07

Overview Phonexia Voice Biometrics is a special edition of Phonexia Speech Platform which allows you to understand the nature of audio without having to listen to it. The product helps people to utilize the power of voice biometrics to verify speaker or identify crimes. The technologies reveals automatically WHO, what GENDER, what LANGUAGE is speaking, and many other metadata. Voice Biometrics - Typical Use-Cases Use case Speaker Verification is tailored to banks/insurance companies/money lending companies and others, where is needed to confirm if caller/voice in audio file is the same person who is known to the customer. For this use…

Voice Inspector

Relevance: 39%      Posted on: 2017-05-18

About Phonexia Voice Inspector v3 (VIN3) provides police forces and forensic experts with a highly accurate speaker identification tool during investigation of criminal matters. It uses the power of voice biometry to automatically recognize speakers by their voice. Main features of the VIN3 application: Automatic speaker identification tool to strengthen results of the standard phonetics-based approaches Scoring in likelihood ratio (LR) – Result from statistical test for two models comparison. It gives back number which expresses how many times more likely the data are under one model than the other. LnLR or LogLR meets numbers in interval <-∞;+∞>...), and verbal…

Voice Biometrics Course (technical training)

Relevance: 33%      Posted on: 2017-05-18

The Voice Biometrics course consist of the following modules. Please ask your Phonexia contact for detailed description. (YES = this part is mandatory for course)   VBS course Required time [h] Block name Block description YES 0,5 Intro & Phonexia Portfolio Intro & Phonexia Portfolio YES 0,5 Project focus - Explain basic needs Partner project related discussion focused mainly to finalising training topics and agenda YES 0,75 Apps Designing and Developing - Licensing Gives trainee knowledge about type of licensing, and how to use the license file YES 0,75 Technologies - Data gathering and Quality measurement - basic Data gathering…

Phonexia Voice Inspector EoL

Relevance: 30%      Posted on: 2018-07-19

Information about release dates, support and maintenance periods of Phonexia Voice Inspector.

Prefiltering

Relevance: 9%      Posted on: 2018-03-23

Prefiltering is a very important part of basically any speech technology architecture. These 2 technologies are very fast and can significantly decrease the load and increase the precision of the following technologies (the exact number depends on the type of your data), thanks to sorting out the files with unacceptable quality or not enough net speech. The 2 technologies in question are Speech Quality Estimation (SQE) and Voice Activity Detection (VAD).  

Speech Intelligence Resolver v1

Relevance: 9%      Posted on: 2017-05-18

About Phonexia Speech Intelligence Resolver v1 (SIR1) combines the power of speech technologies within a single application. The application automatically performs visualization of the record as well as filtering the speech metadata uncovered from your records effectively. Speech technologies implemented: Phonexia Speaker Identification (SID2) Phonexia Language Identification (LID2) Phonexia Gender identification (GID) Phonexia Voice Activity Detection (VAD) Phonexia Speaker Diarization (DIAR) Phonexia Keyword Spotting (KWS) Phonexia Speech Quality Estimator (SQE) Phonexia Speech Transcription (STT) SIR is a client application cooperating with REST servers. It can be used as a standalone application due to the integrated local REST server. It was…