Search: sdk user manual

63 results

Understand SPE processing priority

…enabled in SPE configuration file (enabled by default, see server.task_priorities_enable option) and default priority value set the prioritize role enabled for SPE user creating the processing task If prioritization is enabled and processing task is started by a user without the “prioritize” role, task is started with default priority. Task priority is defined by a number from (highest priority) to…

Understand SPE technologies configuration file

…the technology and model. However, this feature should be used only in special cases, e.g. if suggested by Phonexia experts. SPE users should normally not fiddle around with BSAPI configuration files… and if some technology config customization is needed, the user configuration file is the right method. Technology names supported in technologies configuration file: AGE Age Estimation DENOISER Denoiser DIAR…

Critical Issue

The system is inoperative, and it has a critical effect on the EndUser’s operations which can’t be solved by the End user’s or Partner’s IT/technical administrator. This condition is generally characterized by system instability and requires immediate correction. Phonexia’s software function is stopped due to its internal error, and it fails again on a different data input after Phonexia’s software…

Q: What are the supported audio formats?

…Linux and Apple OS X. Example of usage: FFmpeg ffmpeg -i <source_audio_file_name> <output_audio_base_name>.wav This command converts any supported format/codec audio file to normalized WAV audio format in 16-bit PCM little-endian as it is the default system. For more parameters please check FFmpeg manual pages. SoX sox <source_audio_file_name> -b 16 <output_audio_base_name>.wav Number of bits defined by -b parameter must be specified….

STT: Language Model Customization tutorial

…of the customized STT model should go to this directory. So, either copy the customized STT model there manually, or let LMC to place its output directly there: Version 3.41 or newer (in 3.55 or newer use phxcmd lmc): lmc … … … -out-model-dir <SPE_directory>/shared/bsapi/stt Version up to 3.40: lmc … … … -out-model-dir <SPE_directory>/bsapi/stt 2) Registering the customized STT…

Understand SPE workers configuration

…technologies) to ensure optimal performance and server utilization. These new defaults make the content of this article below obsolete, however, we keep it here for those who still want to fine-tune the configuration manually. The default workers configuration in settings/phxspe.properties is as shown below – 8 workers for files processing and 8 workers for realtime streams processing. These numbers mean…

STT: What is Preferred Phrases feature and how to use it

…NLP layer failed to detect the correct intent (i.e. where the intent was either identified incorrectly, or was not detected at all). Such utterances should be ideally manually analyzed, i.e. the transcription sent to the intent detector should be compared with the voicebot dialogue audio recording and the actual problematic part of the utterance (problematic phrase) should be identified. The…

Arabic dialects in Phonexia LID and STT

…Dialects are used for more personal communication, Facebook, Twitter, forums There is not much material available, since most of the written texts are in MSA Facebook, Twitter, forums can be used, but they need to be classified, corrected and unified manually – in Phonexia we do not do this The above are the reasons for limited out-of-the-box support of Arabic…

FAQs (Browser)

…format in 16-bit PCM little-endian as it is the default system. For more parameters please check FFmpeg manual pages. SoX sox <source_audio_file_name> -b 16 <output_audio_base_name>.wav Number of bits defined by -b parameter must be specified. in FAQ Phonexia Browser, FAQ Speech Platform Permalink Q: How to fix Error 1007: Unsupported audio format? Phonexia Browser application may return error “1007: Unsupported…

Home Page

Speech Quality Estimation (SQE)

Phonexia’s Speech Quality Estimation quantifies the acoustic quality of recordings. This helps the user to quickly determine whether the acoustic quality of a recording is good for processing with other speech technologies or not. As an answer for SQE, the SPE returns a json/xml file. This file includes general information about the technology and statistics of all (one or two)…

Credentials

This part requires higher (and non-anonymous) access level.
How to solve this situation:

Log in here if you are not logged in.
Register here. It takes just a few clicks and it’s free.

Password Reset

To reset your password, please enter your email address or username below. Only fill in if you are not human…

Video – Getting started with SPE

MODULE 1: Getting started with Speech Engine (19 min) Installation Technologies configuration Server and database configuration Users configuration Files processing Synchronous and asynchronous requests, results polling Stream processing https://youtu.be/4qrB-GfFdWY…

SID: TUTORIAL: Speaker Identification – How to Do a Basic Test

…Evaluation Package Evaluation package (download page) is consisting of Phonexia Browser and Phonexia Speech Engine including all necessary technologies. 2. Data We prepared the dataset for your testing. Package contains data for speaker model creation and speaker spotting too. The process of testing is the same for the data set collected by the user himself. Dataset is available to download…