Skip to content Skip to main navigation Skip to footer

Search: spe configuration

40 results

FAQs (Browser)

…debug output of SPE Linux: Run PhxBrowser software in terminal with command: ./PhxBrowser –-spe-debug –-spe-output PhxBrowser software will start with ” SPE output” tab which shows debug output of SPE in FAQ Phonexia Browser, FAQ Speech Platform Permalink Q: Why does the system show high score (>90%) even for non-targets? A: Threshold for score isn’t set up correctly. Adjust speaker

Understand SPE administration and backup

…the system: SPE database – the technology models, SPE user accounts, etc. are stored here SPE configuration file (usually /settings/phxspe.properties) technologies configuration file (usually /settings/ technologies.xml, or see phxspe.properties for details) licensing file (license.dat, usually stored along to phxspe.exe, or see phxspe.properties for details) Optimally, Partner should backup also the following entire SPE directory [optional], with all subdirectories (/bsapi/, etc.)…

Understand SPE processing queue

This article explains details about SPE asynchronous requests processing queue, the processing task lifecycle and its handling states. When SPE receives an asynchronous API request, a task is created and put in a queue. Size of the queue is defined by server.n_task_limit setting in SPE configuration file, default value is 1000 tasks. Tasks in queue are then handled according to…

Understand SPE home directory

SPE home directory is an analogy of user home directory in operating systems (e.g. /home/ in *nix, /Users/ in macOS or Windows, etc.) – it is the place where SPE stores data for users configured in SPE. Default SPE home directory location is {SPE_installation_directory}/home/. This location can be changed using server.user.home setting in phxspe.properties SPE configuration file. Changing the home…

Recommended OS and HW (PSP)

…tested by Phonexia on these systems. (**) Speech Platform components (e.g. Speech Engine) are known to be successfully deployed on these systems. Recommended hardware Required HW resources depend on set of technologies (i.e. SPE configuration) and the load that should be processed per day (or during a peak hour). Additionally, your own application built on top of SPE (including eventual…

What is User configuration file and how to use it

…working state. User configuration files provide a way to override processing parameters without modifying original BSAPI configuration files. WARNING: Inappropriate configuration changes may cause serious issues! Make sure you really know what you are doing. User configuration file is a plain text file with the same name as main configuration file, with additional extension .usr. For example: Main configuration file…

Phonexia Speech Engine

Phonexia Speech Engine (SPE) is main part of Phonexia Speech Platform. SPE is a server application for 64-bit Linux or Windows, providing REST API to entire portfolio of Phonexia speech technologies. SPE capabilities overview: Audio files and stream processing Audio files RTP / HTTP streams Speaker Identification (SID) ✓ ✓ Speech To Text (STT) ✓ ✓ Keyword Spotting (KWS) ✓…

Key Features (PSP)

…The Speech Platform includes the following technologies. Technologies are available in the Speech Engine component based on its particular configuration (Voice Biometrics, Transcription System, etc.) Speaker Identification (SID) – recognizes a speaker automatically based on their voice, Speaker Diarization (DIAR) – separates multiple speakers in mono audio automatically, Language Identification (LID) – detects the language or dialect spoken in a…

Understand SPE multithreaded technologies initialization

The server.technology_multithread_initialization setting in SPE configuration allows SPE to initialize instances of technologies during startup using multiple parallel threads. Default setting is OFF, i.e. instances of technologies are initialized using single thread, one-by-one. This allows easier tracking of eventual issues during SPE startup and better readability of technologies initialization log messages (only single initialization happens at a time). The downside…

Understand SPE workers configuration

Worker is a working thread performing the actual files- or realtime streams processing in Speech Engine. This article helps to understand the Speech Engine workers and provides information how to configure workers for optimal performance and server utilization. Starting from SPE 3.51, new defaults in settings/phxspe.properties make SPE to configure workers automatically according to local conditions (physical CPU cores, configured…

Understand SPE benchmark

SPE in the {SPE}/data/benchmark directory. The second option uses single audio file of your choice uploaded to SPE storage, specified by the path parameter. The set of audio files supplied with SPE contains recordings of various length (from 30 seconds to 5 minutes) and with various speech/non-speech ratio. This is to account for the fact that both the length of…

Measuring of a software processing speed – what is the FtRT (Faster than Real Time)

configurations. And vice versa – using the same metric, you can compare software from different vendors on the same HW configuration and for the same processing task. We recognize two measurable metrics: Audio based FtRT is calculated from actual audio in its original form, i.e. containing parts with spoken speech and also parts with silence or other non-speech signal (background…

Licensing (technical details)

…its startup and read its content. Alternatively, you can: specify the license file location in a configuration file (only for SPE and RLS) start the product executable with a license (SPE and RLS) or l parameter (command line), specifying license file location set the license file location in BS_LICENSE environment variable (only for command line) License types NET license NET…

STT: Language Model Customization tutorial

…model in Speech Engine using phxadmin 1) Placing the customized STT model in correct location In order to be recognized by Speech Engine, the customized STT model must be placed in a correct location. The best location in SPE 3.41 or newer is <SPE_directory>/shared/bsapi/stt (see Understand SPE directory structure article). In older versions it’s <SPE_directory>/bsapi/stt. The data and settings directories

Understand SPE technologies, instances and workers

Configuring Speech Engine to utilize effectively the full power of underlying hardware can get challenging – one can easily get lost in all the strange terms like technologies, instances, slots, or workers… This article should shed some light in it. Speech Engine is like post office Thinking about Speech Engine, there is actually a very nice analogy with post office…