Search Results for: speaker

Results 1 - 10 of 39 Page 1 of 4
Results per-page: 10 | 20 | 50 | 100

Speaker Diarization (DIAR)

Relevance: 100%      Posted on: 2017-06-26

About DIAR Phonexia Speaker Diarization (DIAR) enables segmentation of voices in one monochannel audio record. Technology Trained with emphasis on spontaneous telephony conversation The technology is language-, accent-, text-, and channel- independent Compatibility with the widest range of audio sources possible (applies channel compensation techniques): GSM/CDMA, 3G, VoIP, landlines, etc. Input Input format for processing: WAV or RAW (8 or 16 bits linear coding), A-law or Mu-law, PCM, 8kHz+ sampling Output Log file with processed information (segmentation of speech, silence, and technical signals – ie. elimination of phone lines beeps, DTMF tones, music, pauses, etc.) Audio file extracted for each…

Speaker Identification (SID)

Relevance: 100%      Posted on: 2019-06-13

Phonexia Speaker Identification uses the power of voice biometry to recognize speakers by their voice... i.e. to decide whether the voice in two recordings belongs to the same person or two different people. High accuracy of Speaker Identification, the Phonexia's flagship technology, has been validated in a NIST Speaker Recognition Evaluations. Basic use cases and application areas The technology can be used for various speaker recognition tasks. One basic distinction is based on the kind of question we want to answer. Speaker Identification is the case when we are asking "Whose voice is this?", such as in fake emergency calls.…

Speaker Identification: Results Enhancement

Relevance: 100%      Posted on: 2019-05-29

Speaker Identification (SID) Results Enhancement is a process that adjusts the score threshold for detecting/rejecting speakers by removing the effect of speech length and audio quality. This is achieved by use of Audio Source Profiles, that represent as closely as possible the source of the speech recording (device, acoustic channel, distance from microphone, language, gender, etc.). Although the out-of-the-box system is robust in such factors, several result enhancement procedures can provide even better results and stronger evidence. Audio Source Profile An Audio Source Profile is a representation of the speech source, e.g., device, acoustic channel, distance from microphone, language, gender,…

Speaker Diarization

Relevance: 100%      Posted on: 2018-04-02

Speaker Diarization labels segments of the same voice(s) in one mono channel audio record based by the individual speaker´s voice. It is a language-, domain- and channel-independent technology. It performs not only the segmentation of speakers, but of technical signals and silence as well. The outputs of the technology can be both log file with labels and/or split audio files/one new multichannel audio file. The correct speaker diarization is still research task nowadays. Typical use cases: Preprocessing for other speech recognition technologies, labeling the parts of the utterance according to the speakers, splitting telephone conversation recorded in mono into several…

Phonexia Voice Inspector v3

Relevance: 9%      Posted on: 2018-04-02

About Phonexia Voice Inspector v3 (VIN3) provides police forces and forensic experts with a highly accurate speaker identification tool during investigation of criminal matters. It uses the power of voice biometry to automatically recognize speakers by their voice. Main features of the VIN3 application: Automatic speaker identification tool to strengthen results of the standard linguistics- and phonetics-based approach Scoring in Likelihood Ratio (LR) – result from a statistical test for a comparison of two hypotheses. The system returns a number from the interval <0, +∞>, which expresses how many times more likely the data are under one hypothesis than the…

Phonexia Speech Platform for Commerce

Relevance: 9%      Posted on: 2017-05-18

Phonexia Speech Analytics is a special edition of Phonexia Speech Platform COM which allows you to boost analysis of your call traffic. It is effective solution for commercial, telecom, utilities, financial sector, and other contact centers. It provides 4 main parts: Dialog Analysis, Demographic Information, Script Alignment, Speech Transcription (automatic).   Phonexia Voice Biometrics is a special edition of Phonexia Speech Platform COM which allows you to boost security and enhance customer experience with voice biometrics technologies. It is effective solution for commercial and financial sectors, especially for banks, insurance companies, and call centers. It covers both usecases: Fraud Detection…

SPE3 – Releases and Changelogs

Relevance: 9%      Posted on: 2019-08-22

Speech Engine (SPE) is developed as RESTfull API on top of Phonexia BSAPI. SPE was formerly known as BSAPI-rest (up to v2.x) or as Phonexia Server (up to v3.2.x). This page lists changes in SPE releases. Releases Changelogs == SPE v3.17.x == Speech Engine 3.17.3 (08/22/2019) - DB v1200, BSAPI 3.21.3 [G_#191] Fixed: KWS getting phonemes/graphemes in specific circumstances returns unknown error [G_BSAPI#413] Fixed: duplicated output from KWS Speech Engine 3.17.2 (08/02/2019) - DB v1200, BSAPI 3.21.2 [G_BSAPI#300] Fixed: KWS stream results are displayed with a delay Speech Engine 3.17.1 (07/22/2019) - DB v1200, BSAPI 3.21.1 Added 5th generation of…

Software Vetting (Best Practice)

Relevance: 9%      Posted on: 2017-06-15

The purpose of this document is to help client to satisfy their high security standards during integration of Phonexia software to their critical infrastructure. The vetting ensures that Phonexia software is not dangerous to the client’s infrastructure in any way. It means there are no backdoors, viruses, worms, Trojan horses, spyware, adware, critical bugs, unwanted functionality, no information is sent outside the client’s infrastructure. Vetting context Speech technology is a very dynamic area with a very fast development. For example the speaker identification error rate decreases to half between each two evaluations organized by National Institute of Standards and Technology,…

Speech Analytics

Relevance: 9%      Posted on: 2018-04-06

Overview Phonexia Speech Analytics allows you to understand the  content of audio without having to listen to it. The results help both commercial entities and security/defense forces for immediate precise decision and response. The technologies reveal automatically WHAT content, TOPIC and KEY PHRASES are spoken, and many other metadata.   Speech Analytics - Typical Use-Cases Speech transcription is used in various application. Knowledge of content of whole call is bringing business value to the customer, comparing to listening the audio files by analytic or supervisor. Reading the text is also faster than listening the audio. Speech Analytics output is often…

Phonexia Speech Platform for Government

Relevance: 9%      Posted on: 2017-05-18

Phonexia Voice Biometrics GOV is a special edition of Phonexia Speech Platform for Government which allows you to understand the nature of audio without having to listen to it. The product helps people to utilize the power of voice biometrics to filter audio and prevent or identify crimes. The technologies reveal automatically WHO, what GENDER, what LANGUAGE is speaking, and many other metadata. The product can be used typically for investigation support, SIGINT or other types of operations. It serves 4 main use-cases: Voice Biometrics - Speaker Search in Archive (Investigation) Voice Biometrics - Speaker Spotting Tactical Voice Biometrics -…