Skip to content Skip to main navigation Skip to footer

Search: settings

35 results

SID4 performance on Intel® Xeon® Platinum 8124M

…32GB RAM, 30GB SSD based storage, 1000 I/O.s-1 reserved per core Benchmark data setup Data set statistic: Number of files: 32 [300 seconds each] RAW recordings total length: 9600 seconds Net speech total length: 4224.77 secons Data set contains 44% of speech signal, 56% of silence or technical signal Statistic counted by Phonexia VAD 3.22.1, “vad_2.bs” settings (AKA strict VAD,…

STT: Configuring word detection parameters for stream transcription

…being a speech“, i.e. end of segment is detected less frequently. These values can be modified using a “user configuration file” – see What is a user configuration file article for additional details. In short, create appropriately named user config file with modified parameters along the standard config file in <SPE directory>/bsapi/stt/settings directory and restart SPE – it will automatically…

Key Features (PSP)

…audio conversion tools. Tested with sox or ffmpeg. For the configuration of this functionality, see [SPE]/settings/phxspe.properties Note: You should be aware that audio format conversion (e.g., if the original audio format is highly compressed) can decrease the accuracy of speech technologies. Integration Possibilities Phonexia Speech Platform can be integrated into a partner’s application using the Speech Engine component (REST API)….

Understand SPE processing queue

…can be handled simultaneously is defined by server.n_workers for audio files processing and server.n_realtime_workers for realtime streams processing settings in SPE configuration file. This is by default set automatically, based on your hardware and software configuration – see How to configure Speech Engine workers article. The picture below demonstrates the queue processing (for the sake of simplicity, technologies assignments to…

Understand SPE audio converter

…phxspe.exe. FFmpeg: https://ffmpeg.org/download.html SoX: https://sourceforge.net/projects/sox/files/sox/ (The FFmpeg is a bit ‘cleaner’ choice on Windows, since it’s available also as single-executable static build, unlike SoX whose 10+ DLLs clutter up the SPE directory) SPE configuration As a next step it’s necessary to enable and set up the converter in SPE configuration file (in settings/phxspe.properties). Set the audio_converter.enabled to true to enable…