Search Results for: sqe

Results 1 - 20 of 21 Page 1 of 2
Results per-page: 10 | 20 | 50 | 100

SPE3 – Releases and Changelogs

     Posted on: 2020-12-14

Speech Engine (SPE) is developed as RESTfull API on top of Phonexia BSAPI. SPE was formerly known as BSAPI-rest (up to v2.x) or as Phonexia Server (up to v3.2.x). This page lists changes in SPE releases. Releases Changelogs Speech Engine 3.35.4, DB v1601, BSAPI 3.35.4 (2020-12-14) Public release Fixed: STT/KWS model AR_XL_5 has incorrect name and does not start Fixed: Missing KWS model AR_XL_5 Fixed: Processing of some short recordings causes TwoGmmCalibThreshold is not finite error Fixed: STT preferred phrases "out of vocabulary" (OOV) warning message is now more verbose Speech Engine 3.36.0, DB v1601, BSAPI 3.35.3 (2020-12-01) Non-public Feature…

Phonexia Speech Engine

     Posted on: 2020-11-19

About Phonexia Speech Engine v3 (SPE3) is a main executive part of the Phonexia Speech Platform. It is a server application with REST API interface through which you can access all available speech technologies. Both, Linux 64bit and Windows 64bit operating systems are supported. Phonexia Speech Engine (SPE3) is adjustable server component which houses all speech technologies. SPE3 provides RESTfull application programming interface to access various technologies. Aside from technologies themselves the SPE has implemented other various functionality supporting work with speech technologies, recordings and streams, and others. Features Main purpose of SPE is to work as processing unit for…

Browser3 – Releases and Changelogs

     Posted on: 2020-10-23

Phonexia Browser v3 (Browser3) is developed as client on top of Phonexia Speech Engine v3. Phonexia Browser is a successor of Phonexia Speech Intelligence Resolver v1 (SIR1). This page lists changes in Browser releases. Releases Changelogs Phonexia Browser v3.35.2, BSAPI 3.35.2 - Oct 21 2020 Public release Fixed: Speaker identification dialog in WaveEditor which did not work for SID4 Fixed detection of certain USB license tokens Phonexia Browser v3.35.0, BSAPI 3.35.0 - Oct 02 2020 Public release New: Compatibility with SPE 3.35 Phonexia Browser v3.30.12, BSAPI 3.30.11 - Aug 20 2020 Public release Fixed: Transcription results intermittently displays words in wrong…

Technical Training Essentials

     Posted on: 2019-09-27

Core objective: Understanding technical essentials of using Phonexia technologies and products Duration: ~94 minutes (7 + 19 + 22 + 23 + 23 min chapters) intended for product architects or developers assumes you have already watched Phonexia technologies introduction video assumes understanding of working in command line REST API principles processing JSON or XML Introduction (7 min) technologies recap CLI, REST and GUI interfaces overview https://youtu.be/xzrHyyIl01s MODULE 1: Getting started with Speech Engine (19 min) Installation Technologies configuration Server and database configuration Users configuration Files processing Synchronous and asynchronous requests, results polling Stream processing https://youtu.be/4qrB-GfFdWY MODULE 2: Filtering and supporting…

Voice Inspector – supporting technologies

     Posted on: 2019-06-28

Automatic Speaker Identification (SID) is the most important but not the only Phonexia technology that is implemented in Voice Inspector (VIN). Apart from SID, forensic experts, users of VIN, can benefit from automatic Signal-to-Noise Ratio calculation, Voice Activity detection, Phoneme search, and a Wave editor which incorporates the waveform, spectrum and power panel. Let's have a look on how to utilize individual technologies. Signal-to-Noise Ratio Recording quality can strongly influence the reliability of SID results and so the outcome of a forensic case. Therefore, VIN uses a module of Phonexia Speech Quality Estimation (SQE) to calculate the Signal-to-Noise Ratio (SNR)…

Phonexia technologies introduction

     Posted on: 2019-01-25

Core objective: Basic understanding of Phonexia speech technologies and products; typical use cases, implementations and deployment topologies Duration: 35 minutes intended for idea makers and product designers assumes generic knowledge of Phonexia and speech technologies in general Content 00:00 Introduction What information can we get from speech? Overview of basic use cases Phonexia Speech Platform brief 4:21 Phonexia technologies overview and their usages Filtering and supporting technologies 04:32 Speech Quality Estimation (SQE) 05:27 Voice Activity Detection (VAD) 06:37 Diarization (DIAR) 07:41 Age Estimation (AGE) 08:14 Waveform Denoiser Voice Biometrics technologies 08:56 Speaker Identification (SID) 10:18 Language Identification (LID) 11:10 Gender…

Phonexia technology models EoL

     Posted on: 2018-07-11

Information about release dates, support and maintenance periods of Phonexia technology models.

Voice Biometrics

     Posted on: 2018-04-07

Overview Phonexia Voice Biometrics is a special edition of Phonexia Speech Platform which allows you to understand the nature of audio without having to listen to it. The product helps people to utilize the power of voice biometrics to verify speaker or identify crimes. The technologies reveals automatically WHO, what GENDER, what LANGUAGE is speaking, and many other metadata. Voice Biometrics - Typical Use-Cases Use case Speaker Verification is tailored to banks/insurance companies/money lending companies and others, where is needed to confirm if caller/voice in audio file is the same person who is known to the customer. For this use…

Speech Analytics

     Posted on: 2018-04-06

Overview Phonexia Speech Analytics allows you to understand the  content of audio without having to listen to it. The results help both commercial entities and security/defense forces for immediate precise decision and response. The technologies reveal automatically WHAT content, TOPIC and KEY PHRASES are spoken, and many other metadata.   Speech Analytics - Typical Use-Cases Speech transcription is used in various application. Knowledge of content of whole call is bringing business value to the customer, comparing to listening the audio files by analytic or supervisor. Reading the text is also faster than listening the audio. Speech Analytics output is often…

Speech Quality Estimator – Essential

     Posted on: 2018-04-04

Phonexia’s Speech Quality Estimator quantifies the acoustic quality of recordings. This helps the user to quickly determine whether the acoustic quality of a recording is good for processing with other speech technologies or not. As an answer for SQE, the SPE returns a json/xml file. This file includes general information about the technology and statistics of all (one or two) channels. The statistics of all channels include the numbers for many aspects of recording quality, and the overall global score. Technology The technology is language-, accent-, text-, and channel- independent Compatibility with the widest range of audio sources possible (applies…

Speech Quality Estimation

     Posted on: 2018-04-02

Speech Quality Estimation (SQE) is a language-, domain- and channel-independent technology that quantifies the quality of an audio recording. 2 most important statistics used in the calculation of the SQE score are SNR (signal-to-noise ratio) and the bitrate of the recording. SQE is usually part of the rapid filtration process in deployments. SQE also measures over 20 other properties of the recording, all of which can be found in the output file and further processed. See description in SPE documentation. Typical use cases are: verification of recording quality on the input, searching based on quality of the recording, noise of…

Prefiltering

     Posted on: 2018-03-23

Prefiltering is a very important part of basically any speech technology architecture. These 2 technologies are very fast and can significantly decrease the load and increase the precision of the following technologies (the exact number depends on the type of your data), thanks to sorting out the files with unacceptable quality or not enough net speech. The 2 technologies in question are Speech Quality Estimation (SQE) and Voice Activity Detection (VAD).  

SPE configuration

     Posted on: 2018-02-02

Basic explanation of configuration directives for SPE with hints & tips. Overview of phxspe.properties for beginners.

Sizing of the computing units for speech technologies

     Posted on: 2018-02-02

Best practices for good sizing of Phonexia technologies depend on a few facts: Intense work with large data sets requires good performance and bandwidth between RAM and CPU. It all depends on the size of the files with technological models data, usually loaded into RAM and used intensively for computing operations Always think only about physical cores of CPU (HT, VT features can't help in performance) Also seek for CPUs with a large L3 cache. And the better CPUs are those with higher l3_cache_size/#_of_physical_CPU_cores ratio. We currently assume that CPUs from the current Intel Xeon Family in the 4th generation…

SQE

     Posted on: 2018-02-01

Phonexia Speech Quality Estimator

Speech Analytics Course (technical training)

     Posted on: 2017-05-18

The Speech Analytics course consists of the following modules. Please ask your Phonexia contact for detailed description. (YES = this part of the course is obligatory)   SAL course Required time [h] Block name Block description YES 0,5 Intro & Phonexia Portfolio Intro & Phonexia Portfolio YES 0,5 Project focus – Explain basic needs Discussion of partner project focused mainly on finalizing the training topics and agenda. YES 0,75 Application Design & Development – Licensing Presentation of types of licensing, and how to use the license file. YES 0,75 Technologies – Data gathering and Quality measurement – basic Description of…

Voice Biometrics Course (technical training)

     Posted on: 2017-05-18

The Voice Biometrics course consist of the following modules. Please ask your Phonexia contact for detailed description. (YES = this part is mandatory for course)   VBS course Required time [h] Block name Block description YES 0,5 Intro & Phonexia Portfolio Intro & Phonexia Portfolio YES 0,5 Project focus - Explain basic needs Partner project related discussion focused mainly to finalizing training topics and agenda YES 0,75 Apps Designing and Developing - Licensing Gives trainee knowledge about type of licensing, and how to use the license file YES 0,75 Technologies - Data gathering and Quality measurement - basic Data gathering…

Speech Intelligence Resolver v1

     Posted on: 2017-05-18

About Phonexia Speech Intelligence Resolver v1 (SIR1) combines the power of speech technologies within a single application. The application automatically performs visualization of the record as well as filtering the speech metadata uncovered from your records effectively. Speech technologies implemented: Phonexia Speaker Identification (SID2) Phonexia Language Identification (LID2) Phonexia Gender identification (GID) Phonexia Voice Activity Detection (VAD) Phonexia Speaker Diarization (DIAR) Phonexia Keyword Spotting (KWS) Phonexia Speech Quality Estimator (SQE) Phonexia Speech Transcription (STT) SIR is a client application cooperating with REST servers. It can be used as a standalone application due to the integrated local REST server. It was…

Phonexia Browser

     Posted on: 2017-05-18

About Phonexia Browser v3 (Browser v3) software that combines the power of speech technologies in a single desktop application. The application automatically  performs visualization of records as well as effective filtration of speech metadata uncovered from the user´s records. Speech technologies implemented: Speaker Identification (SID) Language Identification (LID) Gender identification (GID) Voice Activity Detection (VAD) Speaker Diarization (DIAR) Keyword Spotting (KWS, 10+ languages available) Speech Quality Estimator (SQE) Speech to Text (STT, 10+ languages available) Age Estimation (AGE) Browser v3 is a client application cooperating with Speech Engine v3 (SPE3). It is possible to use it as a client -…

Phonexia Speech Platform for Government

     Posted on: 2017-05-18

Phonexia Voice Biometrics GOV is a special edition of Phonexia Speech Platform for Government which allows you to understand the nature of audio without having to listen to it. The product helps people to utilize the power of voice biometrics to filter audio and prevent or identify crimes. The technologies reveal automatically WHO, what GENDER, what LANGUAGE is speaking, and many other metadata. The product can be used typically for investigation support, SIGINT or other types of operations. It serves 4 main use-cases: Voice Biometrics - Speaker Search in Archive (Investigation) Voice Biometrics - Speaker Spotting Tactical Voice Biometrics -…