Skip to content Skip to main navigation Skip to footer

Search: configuration

40 results

Speech Engine update

…up to you, based on the actual content of the directory and your new package NOTE: If you created any user configuration files, or made any changes in configuration files, make sure to keep the respective .bs.usr or .bs files! If you created any customized STT language models using LMC, it’s recommended practice to recreate the STT model using the…

KWS: Results explained

…sheet demonstrating the sigmoid function: Score-to-Confidence. Score-to-confidence conversion tuning Starting with SPE/BSAPI 3.24 (October 2019) it’s possible to modify the confidence calculation using confidence_shift and confidence_sharpness values in user configuration file in [score_calib:SKeywordScoreCalibrationI] section. User configuration file must have the same name as original configuration file, with added .usr extension, e.g. kws_en_us_5.bs.usr – see the What is a user configuration

Understand SPE processing queue

…can be handled simultaneously is defined by server.n_workers for audio files processing and server.n_realtime_workers for realtime streams processing settings in SPE configuration file. This is by default set automatically, based on your hardware and software configuration – see How to configure Speech Engine workers article. The picture below demonstrates the queue processing (for the sake of simplicity, technologies assignments to…

Understand SPE audio converter

…phxspe.exe. FFmpeg: https://ffmpeg.org/download.html SoX: https://sourceforge.net/projects/sox/files/sox/ (The FFmpeg is a bit ‘cleaner’ choice on Windows, since it’s available also as single-executable static build, unlike SoX whose 10+ DLLs clutter up the SPE directory) SPE configuration As a next step it’s necessary to enable and set up the converter in SPE configuration file (in settings/phxspe.properties). Set the audio_converter.enabled to true to enable…

Releases and Changelogs (Browser)

…wizard can’t create a report if server doesn’t support Diarization [G#21] Unified SID terminology Phonexia Browser v3.10.1, BSAPI 3.14.0 – Dec 6 2017 [#5068] Speed up preparing of calibration set [#5036] Use own configuration file for local SPE – original configuration file of SPE is not changed anymore [#4542] Better error message when calibration set contains invalid recordings [#5195] Added…

Understand SPE multithreaded technologies initialization

…of single-threaded initialization is that it may take longer time to fully initialize the whole system, depending on the actual technologies configuration (number of initialized technologies and instances). In multi-threaded configuration, instances of each technology are initialized in multiple parallel threads, one separate thread for each technology–model combination. This, in general, results in faster initialization of the whole system. On…

Understand SPE administration and backup

…where the temporary results are stored, see Understand SPE database for details Backup System backup should be performed before any update or upgrade of the SPE. It is strongly recommended to back up mainly the following components of the system: SPE database – the technology models, SPE user accounts, etc. are stored here SPE configuration file (usually /settings/phxspe.properties) technologies configuration

STT: Language Model Customization tutorial

…create customized model using “cumulative word list” – that’s where the word list backup copied to the target model directory gets handy. Basic LMC usage is phxcmd lmc -config {configuration_files} -add-words {wordlist_file} -model-suffix {model_name_suffix} -out-model-dir {directory_to_place_customized_output} Where: {configuration_files} is one or more space-separated path(s) to the *.bs config file(s) of the existing/source model to be customized {wordlist_file} is the word…

Video – Getting started with SPE

MODULE 1: Getting started with Speech Engine (19 min) Installation Technologies configuration Server and database configuration Users configuration Files processing Synchronous and asynchronous requests, results polling Stream processing https://youtu.be/4qrB-GfFdWY…

Measuring of a software processing speed – what is the FtRT (Faster than Real Time)

configurations. And vice versa – using the same metric, you can compare software from different vendors on the same HW configuration and for the same processing task. We recognize two measurable metrics: Audio based FtRT is calculated from actual audio in its original form, i.e. containing parts with spoken speech and also parts with silence or other non-speech signal (background…

STT: Configuring word detection parameters for stream transcription

…being a speech“, i.e. end of segment is detected less frequently. These values can be modified using a “user configuration file” – see What is a user configuration file article for additional details. In short, create appropriately named user config file with modified parameters along the standard config file in <SPE directory>/bsapi/stt/settings directory and restart SPE – it will automatically…

Understand SPE technologies, instances and workers

…post office staffing and the Speech Engine workers configuration: Some post office workers are trained only for certain types of services (e.g. postal services), while others are trained for other services (e.g. financial services). Speech Engine has separate workers for file processing and for realtime stream processing. They cannot provide other types of services than those which they were trained…

Key Features (PSP)

Phonexia Speech Platform is provided as a set of several components: The Speech Engine (SPE) component is a REST API that includes technologies for the automated processing of audio files and audio streams. This component is usually provided in a specific configuration that meets the customer’s use case. The Phonexia Browser component is an expert-level application (on the top of…

SPE and Browser installation: standalone SPE

…package. In other words, merge the contents of the /bsapi/ directory with the /SPE/bsapi/ 4. Configure Speech Engine In order to configure the Speech Engine, we have to navigate to /SPE/ directory and start the configuration utility called phxadmin SPE on Windows In the /SPE/ directory type cmd in the Address bar, to open the Command line. In the command…

FAQs (PSP)

…for audio automated conversion. As prerequisite install external tool for audio conversion. Recommend is ffmpeg utility, powerful and well documented. Please find your distribution package at http://ffmpeg.org Then continue as described below: Using Phonexia Browser with embed SPE Open the Browser configuration dialog by click on button “Settings” located in tool ribbon. Select tab “Speech Engine” and configure SPE as…