Skip to content Skip to main navigation Skip to footer

Search: engine%20channels

64 results

STT: Configuring word detection parameters for stream transcription

One of the improvements implemented since Speech Engine 3.24 is neural-network based VAD, used for word- and segment detection. This article describes the segmenter configuration parameters and how they are affecting the realtime stream STT results. The default segmenter parametrs are as shown below: [vad.online_segmenter:SOnlineVoiceActivitySegmenterI] backward_extensions_length_ms=150 forward_extensions_length_ms=750 speech_threshold=0.5 Backward- and forward extension are intervals in miliseconds, which extend the part…

Video – Getting started with SPE

MODULE 1: Getting started with Speech Engine (19 min) Installation Technologies configuration Server and database configuration Users configuration Files processing Synchronous and asynchronous requests, results polling Stream processing https://youtu.be/4qrB-GfFdWY…

Q: How to fix Error 1007: Unsupported audio format?

…ffmpeg utility, powerful and well documented. Please find your distribution package at http://ffmpeg.org Then continue as described below: Using Phonexia Browser with embed SPE Open the Browser configuration dialog by click on button “Settings” located in tool ribbon. Select tab “Speech Engine” and configure SPE as described in documentation. Don’t forget select checkbox “Enable audio converter”. Using SPE as service/daemon…

Manuals


This part requires higher (and non-anonymous) access level.
How to solve this situation:

  1. Log in here if you are not logged in.
  2. Register here. It takes just a few clicks and it’s free.

Waveform Denoiser (DENOISER)

…Speech Engine documentation); stream not supported, technology model name to be used for processing. Output: audio file (WAV or RAW), together with xml/json report (in SPE only). Fig.: Comparison of original recording (david_noisy.wav, top half of image) and same recording processed by Denoiser (david_denoised.wav, bottom half of the image). Typical Questions Q: What do you recommend for deploying this technology?…

Understand SPE database scripts

This article explains details and usage of SQL database scripts stored in SPE installation directory in /data/database subdirectory. These scripts are intended for setup and maintenance of SPE database for supported database types, currently SQLite and MariaDB (from SPE 3.46) / MySQL (up to SPE 3.45). Script types For each database type, there are two directories with two types of…

Phoneme Recogniser (PHNREC)

…user can add to language model of speech-to-text technology (better accuracy of KWS technology). Input audio file (format details – see Speech Engine documentation); stream not supported, technology model name (i.e. language code) to be used for phoneme transcription. Output In the process of transcribing speech-to-phonemes, the Phoneme Recogniser usually identifies individual speech segments and convert it to pronunciation. Example…

Terms of Service

…modify, reuse, disassemble, decompile or reverse engineer any aspect of the Services, or transmit the PHONEXIA software used for the provision of the Services to any third-party unless PHONEXIA has given written permission to do so. Additionally, PHONEXIA reserves the right to prevent Member from using non-PHONEXIA software to access the Services. 2.3. Transfer of License. Member may not transfer,…

Understand SPE audio converter

SPE directly supports limited list of audio formats (codecs and containers), see Supported audio formats FAQ. Other audio formats must be converted using external tools. This conversion can be done either completely outside of SPE, before passing the files to SPE, or you can set up SPE to convert the files automatically. Then, depending on the capabilities of the conversion…

Time Analysis Extraction (TAE)

…dialogue. This can be used to improve calls between operators and callers or to indicate potential stress points in phone calls, for example, change of speech speed during the conversation). Input TAE can process both audio files and streams (for format details see Speech Engine documentation). By its nature, TAE is usable mainly on two channel phone calls recordings, where…

What is User configuration file and how to use it

…name User configuration file name stt_cs_cz_5_online.bs stt_cs_cz_5_online.bs.usr kws_nl_nl_5.bs kws_nl_nl_5.bs.usr phnrec_pashto.bs phnrec_pashto.bs.usr vpextract4_xl4.bs vpextract4_xl4.bs.usr During technology initialization (e.g. during Speech Engine startup), the initialization routine checks for existence of such user config file. If found, it’s automatically loaded after loading the main configuration file and the settings from user config is automatically applied over the setings from main configuration file. Usage…

Understand SPE benchmark

The SPE benchmark feature is great tool for quick and simple evaluation of processing speed directly on your hardware and using your audio files – simply call the …/benchmark endpoint corresponding to the technology you want to benchmark and wait for the result. The benchmark result summarizes the length of the processed speech, the processing time and the resulting Faster-than-Realtime…

Q: What types of integration do you offer?

Phonexia Speech Engine with its technologies is distributed as REST API interface. For evaluation and testing purposes, graphical user interface (GUI) called Phonexia Browser is provided. Upon request, technologies can be provided also in form of command lined (CLI) Rest API documentation https://download.phonexia.com/docs/spe/…

Phonexia Partner Program for Government Partners

…and cooperation history as part of the decision to grant/extend Gold partnership. Contact your Phonexia representative for more details. How long do I keep my Gold partner status? Phonexia understands that the sales cycle in the government segment can be long, therefore, your partner status is reviewed once a year in July. How can I test the Phonexia Speech Engine

Understand SPE administration and backup

Each Partner has their own administration and back up policy. Here, we highlight the most important SPE components to be administrated and backed up. Administration It is strongly recommended to describe your own administration approach with the following components SPE users (accounts) – Partner should maintain list of SPE users (accounts). There should be only few persons with “admin” role….