Skip to content Skip to main navigation Skip to footer

Search: mean

35 results

SPE and Browser installation: standalone SPE

In this post, we break down the complexities of the initial installation process of Phonexia Speech Engine (SPE), as a standalone installation. This means the SPE has to be started separately from the Phonexia Browser GUI (unlike in the embedded SPE mode, where Browser starts SPE as its background subprocess). By the end of the guide, you will be able…

FAQs (PSP)

…FAQ Phonexia Browser, FAQ Speech Platform Permalink Q: What do LLR, LR and score mean? A: These abbreviations mean the following: LR – likelihood ratio, result from statistical test for two models comparison. It returns a number which expresses how many times more likely the data are under one model than the other. LR meets numbers in interval <0;+inf). LLR…

Phonexia Speech Engine

…SPE manages its own queue of incoming REST requests and serves them according to available capacity of current installation. This means that the application layer can request any number of queries and then just wait untill they are processed. Processing priority management To allow off-queue high-priority or low-priority processing, SPE also allows to set priority for individual REST requests. Basic…

Keyword Spotting (KWS)

…operators search for specific information in large call archives trigger alarms immediately (online) when an event occurs Technology The Keyword Spotting technology is based purely on acoustics – there is no dependency on any dictionary – which means that any words can be searched for, even words in foreign languages. Keyword Spotting works with a keyword list containing one or…

Language Identification (LID)

…LID score to percentage, use e score * 100 formula) LID adaptation (custom language packs) The scoring principle described above implies that score is distributed among all languages in a language pack. It means that every language has to score with non-zero value… i.e. that the scores may get diluted as they get spread among many languages. Additionally, if the…

Q: What to do with the ApplicationStartup: Unhandled exception: BsapiException error?

When running SPE, the following error occurs: [Error] ApplicationStartup: Unhandled exception: BsapiException: SWaveformSegmenterI(/mnt/phxspe/home/phx/storage/dfs/a1cabcf7-c761-49f1 -a9bc-0a8209a09fd9.opus Requested segment (78056, 102056) is out of waveform range (0,91840). A: It means that this opus file is created improperly and declares internally (in header) much more audio than available in real file. Please check your audio source/originator for proper functionality. Or use ffmpeg / sox…

Understand SPE configuration

…with the help of environment variables) you can setup a more efficient deployment method. Simply un-comment this directive and set it up correctly. Btw, did I mentioned before, that hash sign (#) at the beginning of lines means “this is a comment”? # Set path to bsapi directory # bsapi.path = ${application.dir}bsapi   ### # (c) 2013-2018 by Phonexia s.r.o….

Licensing (technical details)

…license is specified by the USE_SERVER line in license file. The only requirement of this license is Internet connectivity to the address specified on SERVER… line (by default it’s Phonexia license server at license.phonexia.com/lic). This means that by definition, the license will not work if there is Internet connectivity issue between the source machine and Phonexia servers, e.g. connection blocked…

Privacy Policy

…any PHONEXIA products and services after a revised Privacy Policy has been posted will mean that you have accepted the changes. 7. GOVERNING LAW This Privacy policy is governed by and interpreted in accordance with the laws of the Czech Republic. All claims and disputes arising from this Privacy policy will be brought solely to the exclusive jurisdiction of the…

Phonexia Ethical Code

…criminal misconduct and damage to our business relationships and reputation not only in the respective region but with a global impact. No Bribery Under anticorruption laws, it is illegal to offer or give gifts or entertainment if done with a corrupt intention. A corrupt intention means that the goal in giving the gift or entertainment is to influence someone to…

Releases and Changelogs (SPE)

…[G#157] Added endpoint for updating existing Audio Source Profile [G#160] SID4 calibration technology renamed: SID4CALIBSET -> SID4CALIB [G#161] Mean normalization support in Audio Source Profiles [G#169] Added cache for Audio Source Profiles, see server.audio_source_profiles_cache_size property [G#170] Added False Acceptance Calibration cache, see server.bsapi_comparator_fa_cache_size [G#149] Fixed: phxclient prints help if running without parameters [G#150] Fixed: UTF-8 symbols are not escaped in…

Measuring of a software processing speed – what is the FtRT (Faster than Real Time)

…in our example is 36 seconds. After stripping silence, it gets 14 seconds – this means that original audio contains 38% of net speech and 62% of silence. Phonexia speech technologies analyze the entire recording, but pick only the speech segments for AI processing, i.e. the absolute processing time will be practically the same… Creating voiceprint by Speaker Identification took:…

Arabic dialects in Phonexia LID and STT

…for each – North Levantine (apc) and South Levantine (ajp). Our models were trained using data from both varieties, therefore we followed RFC 5646, section 2.2.4 and created custom language code ar-XL, where the XL means “cross-Levantine” 😉 NOTE: To get the best STT results, use the model that corresponds to given dialect. The AR_XL_* model is best suitable for…

FAQs (Browser)

…score sharpness value to calibrate the recalculation. Please see Calibration in technology documentation. in FAQ Phonexia Browser, FAQ Speech Platform Permalink Q: What do LLR, LR and score mean? A: These abbreviations mean the following: LR – likelihood ratio, result from statistical test for two models comparison. It returns a number which expresses how many times more likely the data…