Skip to content Skip to main navigation Skip to footer

Search: use%20cases

104 results

Designing and Developing Application

…What are real benefits for customer (finding the needle in haystack, approaching new information, processing only few data with highest possible accuracy)? How the solution match the current processes and infrastructure of the customer? How many false alarms are acceptable by customer? Etc. Solution requirements Is there high availability (HA) required for the solution? Does customer need to use a…

Credentials


This part requires higher (and non-anonymous) access level.
How to solve this situation:

  1. Log in here if you are not logged in.
  2. Register here. It takes just a few clicks and it’s free.

Password Reset

To reset your password, please enter your email address or username below. Only fill in if you are not human…

Video – Getting started with SPE

MODULE 1: Getting started with Speech Engine (19 min) Installation Technologies configuration Server and database configuration Users configuration Files processing Synchronous and asynchronous requests, results polling Stream processing https://youtu.be/4qrB-GfFdWY…

Q: How do you calculate SNR in Speech Quality Estimation?

A: Signal-to-Noise Ratio (SNR) is an important metric of whether a recording is worth further processing by other speech technologies, so it is part of our Speech Quality Estimation. However, calculating SNR automatically is not a trivial task. We use the fact that the statistical distribution of the frequencies in the waveform of speech has Gamma distribution. In contrast, noise…

SID: TUTORIAL: Speaker Identification – How to Do a Basic Test

…Evaluation Package Evaluation package (download page) is consisting of Phonexia Browser and Phonexia Speech Engine including all necessary technologies. 2. Data We prepared the dataset for your testing. Package contains data for speaker model creation and speaker spotting too. The process of testing is the same for the data set collected by the user himself. Dataset is available to download…

STT: How to properly convert Confusion Network results to One-best

Confusion Network output is the most detailed Speech Engine STT output as it provides multiple word alternatives for individual timeslots of processed speech signal. Therefore many applications want use it as the main source of speech transcription and perform eventual conversion to less verbose output formats internally. This article provides the recommended way to do the conversion. Time slots and…

Understand SPE technologies, instances and workers

…initialized at the start of Speech Engine, taking up a memory, regardless of being actually used for some processing. Customers are served by post office workers at counter desks providing a particular service. Requests are served by processing workers assigned to an instance of particular technology.   Speech Engine workers are like post office workers Similar analogy exists between the…

Q: What are the requirements for SID evaluation dataset?

…in each recording (i.e. usually 2+ minutes recording length) only one speaker in each recording wide variety of gender and age is recommended recordings should be as similar to the target use case as possible (device, channel, distance from mic, languages distribution) audio files should be mono, lin16 format, 8 kHz+ sample rate *Note: splitting single recording into multiple shorter…

Understand SPE processing queue

…SPE does not hoard the results, consuming memory… Picking up the result is possible: using GET /pending/{ID} request (which responds with HTTP status code 303, redirecting to /done/{ID}) using GET /done/{ID} request ⓘ Depending on the used development framework (and/or its configuration), you may not get the HTTP status code 303 response because the framework handles the redirect internally. This…

Critical Issue

The system is inoperative, and it has a critical effect on the EndUser’s operations which can’t be solved by the End user’s or Partner’s IT/technical administrator. This condition is generally characterized by system instability and requires immediate correction. Phonexia’s software function is stopped due to its internal error, and it fails again on a different data input after Phonexia’s software…

SPE and Browser installation: embedded SPE

…window. Alternatively, you can add files also by right-clicking in the main window or choosing the button from the top bar. Select the technologies you want to process your recordings with from the top bar. The background color of the selected technologies will change to indicate they have been selected for use. Click on the Start button to start the…

Major Issue

An Issue that renders the Product partially functional, the use of which in a production environment is substantially reduced. The Issue contains an error that impairs the ability of the system to process a majority of audio files or audio streams, or that renders the setup and maintenance of the system inoperable….