Skip to content Skip to main navigation Skip to footer

Search: speech engine

61 results

Understand SPE database scripts

This article explains details and usage of SQL database scripts stored in SPE installation directory in /data/database subdirectory. These scripts are intended for setup and maintenance of SPE database for supported database types, currently SQLite and MariaDB (from SPE 3.46) / MySQL (up to SPE 3.45). Script types For each database type, there are two directories with two types of…

Understand SPE audio converter

SPE directly supports limited list of audio formats (codecs and containers), see Supported audio formats FAQ. Other audio formats must be converted using external tools. This conversion can be done either completely outside of SPE, before passing the files to SPE, or you can set up SPE to convert the files automatically. Then, depending on the capabilities of the conversion…

Understand SPE administration and backup

Each Partner has their own administration and back up policy. Here, we highlight the most important SPE components to be administrated and backed up. Administration It is strongly recommended to describe your own administration approach with the following components SPE users (accounts) – Partner should maintain list of SPE users (accounts). There should be only few persons with “admin” role….

Understand SPE user accounts

SPE has a simple built-in system of user accounts and user roles. This allows for flexible usage of SPE in your projects – you can use it e.g. for different individual applications (each application uses its own SPE user), or simply for different user roles within single application (standard users, administrators). Each user account has the following attributes defined: login…

Understand SPE multithreaded technologies initialization

The server.technology_multithread_initialization setting in SPE configuration allows SPE to initialize instances of technologies during startup using multiple parallel threads. Default setting is OFF, i.e. instances of technologies are initialized using single thread, one-by-one. This allows easier tracking of eventual issues during SPE startup and better readability of technologies initialization log messages (only single initialization happens at a time). The downside…

Understand SPE processing priority

SPE has a simple built-in system of task prioritization. This allows for flexible management of processing queue, which is useful especially in mass audio processing. For example, if there is a long queue of files waiting to be processed, and one needs to urgently process another bunch of files, these files can be sent for processing using higher priority… and…

Understand SPE metafiles

Certain SPE entities – SID Speaker models, SID Audio source profiles, LID Language packs – can have additional information associated with them in the form of “metafiles”. This article explains the intended usage of metafiles. In general, SPE is intended as under-the-hood engine, focusing purely on the speech-related audio processing. Any additional functionality should be done on the application layer,…

Video – Getting started with SPE

MODULE 1: Getting started with Speech Engine (19 min) Installation Technologies configuration Server and database configuration Users configuration Files processing Synchronous and asynchronous requests, results polling Stream processing https://youtu.be/4qrB-GfFdWY…

What is User configuration file and how to use it

…name User configuration file name stt_cs_cz_5_online.bs stt_cs_cz_5_online.bs.usr kws_nl_nl_5.bs kws_nl_nl_5.bs.usr phnrec_pashto.bs phnrec_pashto.bs.usr vpextract4_xl4.bs vpextract4_xl4.bs.usr During technology initialization (e.g. during Speech Engine startup), the initialization routine checks for existence of such user config file. If found, it’s automatically loaded after loading the main configuration file and the settings from user config is automatically applied over the setings from main configuration file. Usage…

Designing and Developing Application

…measure evaluation results and how to process calibration? Etc. We encourage Partner to become familiar also with the following points: Phonexia Speech Engine features and list of the technologies Best practices -typical processing flows and architecture from our previous projects Databases schema Other Phonexia components and tools as example application that can give you inspiration Licensing possibilities of the Phonexia…

Phonexia End User License Agreement

…limited to the intellectual property rights of the Client or third parties in such Provided demo data. 5. LIMITATIONS ON REVERSE ENGINEERING, DECOMPILATION, AND DISASSEMBLY. Neither the Client nor any person affiliated with it may reverse engineer, decompile, or disassemble the Software, except and only to the extent that such activity is expressly permitted by Phonexia or applicable law. 6….

Understand SPE connectors for external TTS

SPE can be easily connected with external Text-To-Speech (TTS) services using simple connector system. This article describes the principles and how-tos; following this instructions you can create your own connector, allowing to use a custom 3rd party TTS service via SPE. The TTS connector should be a command line (CLI) application or script, which communicates with the external TTS service…

Understand SPE directory structure

…└── technologies │ └── tts ├── home │ └── admin │ ├── data │ └── storage ├── lib ├── log ├── settings └── shared bsapi bsapi directory contains the BSAPI core subsystem, i.e. all files and data of the speech technologies itself. This directory contains separate subdirectory for each technology included in the distribution package. The number of subdirectories depends…

Q: How to fix Error 1007: Unsupported audio format?

…ffmpeg utility, powerful and well documented. Please find your distribution package at http://ffmpeg.org Then continue as described below: Using Phonexia Browser with embed SPE Open the Browser configuration dialog by click on button “Settings” located in tool ribbon. Select tab “Speech Engine” and configure SPE as described in documentation. Don’t forget select checkbox “Enable audio converter”. Using SPE as service/daemon…

Q: What types of integration do you offer?

Phonexia Speech Engine with its technologies is distributed as REST API interface. For evaluation and testing purposes, graphical user interface (GUI) called Phonexia Browser is provided. Upon request, technologies can be provided also in form of command lined (CLI) Rest API documentation https://download.phonexia.com/docs/spe/…