Search: results

43 results

Terms of Service

…liability for consequential or incidental damages, in such states of jurisdictions PHONEXIA´s and its affiliates liability shall be limited to the full extend permitted by law. 13. Force Majeure 13.1. Force Majeure. PHONEXIA shall not be responsible for any delay or failure in performance that results from causes beyond its reasonable control (“Force Majeure Events”), whether or not foreseeable by…

Releases and Changelogs (VIN)

…the Phonexia sales representative. Phonexia Voice Inspector 5.0 brings a Speaker Identification model XL5, that provides more accurate results for telephony data in comparison with previous generations of Speaker Identification models such as SID4 XL4. Users can observe that the SID4 XL5 model returns different values of LLR scores which are used for evidence calculation. Therefore Speaker Identification score distribution…

Understand SPE administration and backup

…where the temporary results are stored, see Understand SPE database for details Backup System backup should be performed before any update or upgrade of the SPE. It is strongly recommended to back up mainly the following components of the system: SPE database – the technology models, SPE user accounts, etc. are stored here SPE configuration file (usually /settings/phxspe.properties) technologies configuration…

Phonexia End User License Agreement

…including, but not limited to, any (if any) implied warranties, duties or conditions of merchantability, of fitness for a particular purpose, of reliability or availability, of accuracy or completeness of responses, of results, of workmanlike effort, and of lack of viruses, all with regard to the Software, and the provision of or failure to provide support or other services, information,…

STT: Language Model Customization tutorial

…as a source and creates new STT model with your customizations included as a target. To see results of the customizations, you need to use the new STT model for the transcription. Currently supported language model customizations are: adding new words and/or pronunciations This is intended for adding client-, domain- or product specific words like company names, product names, component…

Video – Getting started with SPE

MODULE 1: Getting started with Speech Engine (19 min) Installation Technologies configuration Server and database configuration Users configuration Files processing Synchronous and asynchronous requests, results polling Stream processing https://youtu.be/4qrB-GfFdWY…

STT: Configuring word detection parameters for stream transcription

One of the improvements implemented since Speech Engine 3.24 is neural-network based VAD, used for word- and segment detection. This article describes the segmenter configuration parameters and how they are affecting the realtime stream STT results. The default segmenter parametrs are as shown below: [vad.online_segmenter:SOnlineVoiceActivitySegmenterI] backward_extensions_length_ms=150 forward_extensions_length_ms=750 speech_threshold=0.5 Backward- and forward extension are intervals in miliseconds, which extend the part…

STT: What is Words-To-Numbers feature and how to use it

This article explains details of new STT feature for native numeric numbers and dates trancription in n‍-best output and gives some tips for fine-tuning the results. NOTE: The feature works out-of-the-box in the following STT languages and models: English – EN_US_6 and EN_US_A_6 Spanish – ES_6 Polish – PL_PL_6 Czech – CS_CZ_5 and CS_CZ_6 Slovak – SK_SK_5 and SK_SK_6 You…

FAQs (Browser)

…recordings coming from different audio environments or even different times of the day, additional details can be analyzed leading to better results. Warning: Any human error in evaluation set preparation (in speaker uniqueness, placing recordings into wrong folder, etc.) affects the evaluation results, so it’s very important to prepare the data carefully. See SID Evaluation for more details in…

Waveform Denoiser (DENOISER)

…software cannot remove unwanted speech or music in the background. Denoiser is used to remove noise from the recording and at the same time to amplify the speech signal for: Better intelligibility when listening by people (recommended use), Achieving better results with automatic speech recognition technologies (necessary to test on customer data first). Input: audio file (format details – see…

Phoneme Recogniser (PHNREC)

Phonexia Phoneme Recogniser (PHNREC) converts speech signals into pronunciation characters (so called phonemes). After the conversion, the pronunciation (text) can be easily indexed and searched by third party text data mining tools. The technology is optimized for noisy recordings and colloquial speech, can process audio files as well as audio streams and can provide results in several output formats. Phoneme…

Understand SPE processing queue

…SPE does not hoard the results, consuming memory… Picking up the result is possible: using GET /pending/{ID} request (which responds with HTTP status code 303, redirecting to /done/{ID}) using GET /done/{ID} request ⓘ Depending on the used development framework (and/or its configuration), you may not get the HTTP status code 303 response because the framework handles the redirect internally. This…

Understand SPE multithreaded technologies initialization

…of single-threaded initialization is that it may take longer time to fully initialize the whole system, depending on the actual technologies configuration (number of initialized technologies and instances). In multi-threaded configuration, instances of each technology are initialized in multiple parallel threads, one separate thread for each technology–model combination. This, in general, results in faster initialization of the whole system. On…