Search Results for: engine channels

Results 1 - 10 of 57 Page 1 of 6
Results per-page: 10 | 20 | 50 | 100

SPE3 – Releases and Changelogs

Relevance: 100%      Posted on: 2021-02-26

Speech Engine (SPE) is developed as RESTfull API on top of Phonexia BSAPI. SPE was formerly known as BSAPI-rest (up to v2.x) or as Phonexia Server (up to v3.2.x). Releases Changelogs Speech Engine 3.38.0, DB v1700, BSAPI 3.38.0 (2021-02-25) Non-public Feature Preview release New: Training of LID Language Packs (no more need for command line tools... finally!) New: LID Language Packs allow to store meta-files New: New entity "LID Language Model" (equivalent of *.lpa LanguagePrint Archive) Improved: Updated STT model RU_RU_A to version 4.6.0 of (updated language model) Removed: Support for RLS-enforced licences in command line applications Removed: FeaturePasterRepeat warning…

Speech Engine and technologies, instances, workers… explained

Relevance: 34%      Posted on: 2020-11-19

Configuring Speech Engine to utilize effectively the full power of underlying hardware can get challenging – one can easily get lost in all the strange terms like technologies, instances, slots, or workers... This article should shed some light in it. Speech Engine is like post office Thinking about Speech Engine, there is actually a very nice analogy with post office (or bank branch): Post office is a place providing different kinds of services – one can go there to send letters, send or pick up packages, get a POBox, get some financial services, insurance, etc.).   Speech Engine has various…

Speech Engine configuration file explained

Relevance: 33%      Posted on: 2021-02-19

In this article we explain details of the Speech Engine configuration file phxspe.properties, located in settings subdirectory in SPE installation location. Settings in this configuration file affect the Speech Engine behavior and performance. The configuration file is usually created after SPE installation – on first use of phxadmin, a default configuration filephxspe.properties is created in the settings directory. The file is loaded during SPE startup, i.e. you need to restart SPE to apply any changes made in the file. If Speech Engine is used together with Phonexia Browser in so-called "embedded" mode (see details about "embedded SPE" mode in Browser…

Phonexia Speech Engine

Relevance: 22%      Posted on: 2020-11-19

About Phonexia Speech Engine v3 (SPE3) is a main executive part of the Phonexia Speech Platform. It is a server application with REST API interface through which you can access all available speech technologies. Both, Linux 64bit and Windows 64bit operating systems are supported. Phonexia Speech Engine (SPE3) is adjustable server component which houses all speech technologies. SPE3 provides RESTfull application programming interface to access various technologies. Aside from technologies themselves the SPE has implemented other various functionality supporting work with speech technologies, recordings and streams, and others. Features Main purpose of SPE is to work as processing unit for…

Speech Engine 3.35.0

Relevance: 20%      Posted on: 2020-10-01

Speech Engine 3.35.0, DB v1600, BSAPI 3.35.0 (2020-10-01) New LID model L4 was promoted to production (LID BETA_L4 renamed to LID L4) Added new language tag documentation (doc/Technology_LID_L4_Language_tags.pdf) Updated STT model CS_CZ_5 to version 5.2.1 (fixes faulty transcription of numbers into Roman format) Added configurable STT Confusion Network threshold (in technology configuration file) Fixed STT didn't work with 4th and older generation models after introduction of the Preferred phrases feature in SPE 3.32 Update from SPE 3.30 causes errors in STT result cache memory leak in logging system Typo in name of es-XA language in LID model L4 default language…

How to configure Speech Engine workers

Relevance: 20%      Posted on: 2020-03-28

Worker is a working thread performing the actual files- or realtime streams processing in Speech Engine. This article helps to understand the Speech Engine workers and provides information how to configure workers for optimal performance and server utilization. The default workers configuration in settings/phxspe.properties is as shown below – 8 workers for files processing and 8 workers for realtime streams processing. These numbers mean the maximum number of simultaneously running tasks. # Multithread settings server.n_workers = 8 server.n_realtime_workers = 8 Requests for additional file processing tasks are put in a queue and processed according their order and priorities. Requests for…

LID adaptation

Relevance: 19%      Posted on: 2021-02-25

This article describes various ways of Language Identification adaptation: Basic terminology Languageprint (*.lp file) – numeric representation of the audio, extracted from audio file for language identification purpose of (similar to “voiceprint”, but representing the spoken language, not the speaking person) Languageprint archive (*.lpa file) – multiple languageprints combined into single archive       Language model – digital characteristics of a specific language Language model can be trained from languageprints (*.lp), language prints archives (*.lpa), or from combination of both. LID language model should not be confused with LID technological model, like L4, L3, XL3, etc. Technological model sits at…

Speech Engine 3.35.1

Relevance: 18%      Posted on: 2020-10-13

Speech Engine 3.35.1, DB v1600, BSAPI 3.35.1 (2020-10-13) Fixed Missing input stream task name in log messages Missing arguments in "word not found" error messages (when using preferred phrases)

Speech Engine 3.35.2

Relevance: 18%      Posted on: 2020-10-22

Speech Engine 3.35.2, DB v1600, BSAPI 3.35.2 (2020-10-22) Fixed Detection of certain USB license tokens

Speech Engine 3.35.3

Relevance: 18%      Posted on: 2020-11-24

Speech Engine 3.35.3, DB v1601, BSAPI 3.35.3 (2020-11-24) New Internal support for SAMPA phonetic alphabet Updated STT model RU_RU_A to version 4.5.0 of (updated language model) Updated STT/KWS/PHNREC model AR_XL to version 5.2.0 (updated language model, changed phonemes notation to X-SAMPA) Fixed Cannot create new output stream due to hanging unfinished tasks Task is not removed from pool when result is delivered via Webhook Some log messages contain format placeholder instead of numbers Missing <silence/> label in STT confusion network output STT confusion network contains <silence/> tags with confidence greater than 1.0 Diarization crashes during processing Diarization XL4 crashes on…