Search Results for: ROM

Results 31 - 40 of 75 Page 4 of 8
Results per-page: 10 | 20 | 50 | 100

Time Analysis

     Posted on: 2018-04-15

Time Analysis Extraction (TAE) by Phonexia extracts base information from dialogue in a recording, providing essential knowledge about conversation flow. That makes it easy to identify long reaction time, crosstalk, or responses of speakers in both channels. This technology is only meaningful when used on recordings with 2 channels. As an answer to the TAE technology, SPE returns a json/xml file. This file includes general information about the technology and details of the time analysis. The technology can work either with a closed recording or with a stream. Monologue Describes the statistics of a recording related to one channel. channel…

Age Estimation

     Posted on: 2018-04-12

Phonexia Age Estimation (AGE) estimates the age of a speaker from audio recording. The process of voiceprint extraction is similar to the extraction of SID, but as a result different features get extracted; therefore, the voiceprints extracted from AGE and SID are not mutually compatible. Technology Trained with emphasis on spontaneous telephony conversation The technology is language-, accent-, text-, and channel- independent Compatibility with the widest range of audio sources possible (applies channel compensation techniques): GSM/CDMA, 3G, VoIP, landlines, etc. Input Input format for processing: WAV or RAW (8 or 16 bits linear coding), A-law or Mu-law, PCM, 8kHz+ sampling…

VIN – Releases and Changelogs

     Posted on: 2018-04-08

Phonexia Voice Inspector (VIN) is developed as a desktop application for forensic speaker comparison. This page lists changes in VIN releases. Releases Changelogs Voice Inspector v4.0.0, BSAPI 3.23.0 - Dec 11 2019 - VIN is available with L4 technology model - Other technology models (S2, L2, L3, XL3) are no longer supported - Added Diarization Technology (available in waveform editor) - Population Sets structure changed - Reworked dialog for population set management - Added possibility to set type of estimation of the Target distribution - Using population set to estimate Target distribution allows 1:1 comparison - Bug fixes Voice Inspector…

Speech Analytics

     Posted on: 2018-04-06

Overview Phonexia Speech Analytics allows you to understand the  content of audio without having to listen to it. The results help both commercial entities and security/defense forces for immediate precise decision and response. The technologies reveal automatically WHAT content, TOPIC and KEY PHRASES are spoken, and many other metadata.   Speech Analytics - Typical Use-Cases Speech transcription is used in various application. Knowledge of content of whole call is bringing business value to the customer, comparing to listening the audio files by analytic or supervisor. Reading the text is also faster than listening the audio. Speech Analytics output is often…

Software Vetting

     Posted on: 2018-04-06

The purpose of this document is to help client to satisfy their high security standards during integration of Phonexia software to their critical infrastructure. The vetting ensures that Phonexia software is not dangerous to the client’s infrastructure in any way. It means there are no backdoors, viruses, worms, Trojan horses, spyware, adware, critical bugs, unwanted functionality, no information is sent outside the client’s infrastructure. Vetting context Speech technology is a very dynamic area with a very fast development. For example the speaker identification error rate decreases to half between each two evaluations organized by National Institute of Standards and Technology,…

Open Source Acknowledgement

     Posted on: 2018-04-06

This page collect information about Open Source code and licenses. You might be interested to ask your Phonexia contact what part of the page is relevant to your project. BSAPI 3 dependencies Name Version License Link type ADVobfuscator 1.1 link static boost 1.70 Boost License static botan 2.7.0 Simplified BSD static duktape 2.5.0 MIT static FLAC 1.3.2 BSD license static fmt 5.2.1 MIT static glibc - GNU LGPL dynamic (Linux) minizip 1.2.11 link static mkl 2019.1.144 ISSL static nowide 0.1.1 Boost License static Open Fst 1.6.9 Apache license static ogg 1.3.3 BSD license static onnxruntime 1.1.0 MIT static opus 1.2.1…

Voice Activity Detection – Essential

     Posted on: 2018-04-04

Phonexia Voice Activity Detection (VAD) identifies parts of audio recordings with speech content vs. nonspeech content. Technology Trained with emphasis on spontaneous telephony conversation The technology is language-, accent-, text-, and channel- independent Compatibility with the widest range of audio sources possible (applies channel compensation techniques): GSM/CDMA, 3G, VoIP, landlines, etc. Input Input format for processing: WAV or RAW (8 or 16 bits linear coding), A-law or Mu-law, PCM, 8kHz+ sampling Output Log file with processed information (speech vs. nonspeech segments) Segmentation The section Segmentation describes the results of VAD, which are segments of detected voice and silence. Segments are…

Speech Quality Estimator – Essential

     Posted on: 2018-04-04

Phonexia’s Speech Quality Estimator quantifies the acoustic quality of recordings. This helps the user to quickly determine whether the acoustic quality of a recording is good for processing with other speech technologies or not. As an answer for SQE, the SPE returns a json/xml file. This file includes general information about the technology and statistics of all (one or two) channels. The statistics of all channels include the numbers for many aspects of recording quality, and the overall global score. Technology The technology is language-, accent-, text-, and channel- independent Compatibility with the widest range of audio sources possible (applies…

Keyword pronunciation

     Posted on: 2018-04-04

Pronunciation of the keyword(s) is generated automatically (G2P, grapheme to phoneme)  or produced from the lexicon of known words (“lexicon”) or converted from audio (phoneme transcription). It can be edited manually for each word (Phonexia do not limit the number of pronunciations per keywords/phrases).

Product Portfolio

     Posted on: 2018-04-02

Phonexia Speech Platform is an umbrella concept for all Phonexia’s products and services related to speech technologies. It gives us the ability to customize various products to a wide range of customer needs. Platform Edition is an encapsulation of specific setup of speech technologies, modules, applications, utilities and services designed for a specific market segment. We distinguish Speech Analytics (SAL) and Voice Biometrics (VBS) as most common domain of usage. It is also a tool for marketing and sales. Voice Biometrics is focused more on identifying speaker, gender, language spoken and more. Speech Analytics focuses on gathering information about content…