Skip to content Skip to main navigation Skip to footer

Search: speech engine

62 results

SID: TUTORIAL: Speaker Identification – How to Do a Basic Test

…Evaluation Package Evaluation package (download page) is consisting of Phonexia Browser and Phonexia Speech Engine including all necessary technologies. 2. Data We prepared the dataset for your testing. Package contains data for speaker model creation and speaker spotting too. The process of testing is the same for the data set collected by the user himself. Dataset is available to download…

SPE and Browser installation: embedded SPE

…PhxBrowser (on Linux) You should see following the information window. Click OK to start the configuration.  (You can later come back to alter the configuration by going to the Settings -> Speech Engine tab) In the Settings dialog, on the Speech Engine tab, Enable all the technologies and hit Apply. Make sure to hit Apply to apply the changes and…

Understand SPE configuration

…database. Supported MySQL engines are based on original MySQL v5.6+ or MariaDBv10+. # Type of database # Suported are SQLite and MySql server.db.engine = SQLite The database is mainly used as the working cache – details about SPE user accounts are saved in the database as permanent objects, speech processing results are stored inside as dependent objects: the results are…

STT: What is Preferred Phrases feature and how to use it

Preferred phrases is a feature, available for 5th or newer generation of STT models and Speech Engine 3.32 or later. This article explains what is the feature good for, how does it work internally and gives some tips for practical implementation. What are preferred phrases In the speech transcription tasks, there may be situations where similarly sounding words get confused,…

FAQs (Browser)

…Browser. in FAQ Phonexia Browser, FAQ Speech Platform Permalink Q: What languages do you offer? It depends on the technology. Phonexia Language Identification (LID) is pre-trained for 60+ languages. Phonexia Keyword Spotting (KWS) and Phonexia Speech Transcription (STT) for 20+ languages including English, French, German, Russian, Spanish and many more. in FAQ Phonexia Browser, FAQ Speech Platform Permalink Q: What…

Adding new language or technology model (Browser)

This article explains how to add a new technology model into the current Speech Engine (SPE) instance when using Phonexia Browser. Prerequisites To proceed, you need to have existing installation of SPE. If you do not have one, check other articles, especially: Download Speech Platform Installation of Phonexia Browser Documentation of Phonexia Browser Installation package with new language models Note:…

Phonexia technologies introduction

Core objective: Basic understanding of Phonexia speech technologies and products; typical use cases, implementations and deployment topologies Duration: 35 minutes intended for idea makers and product designers assumes generic knowledge of Phonexia and speech technologies in general Content 00:00 Introduction What information can we get from speech? Overview of basic use cases Phonexia Speech Platform brief 4:21 Phonexia technologies overview…

Phoneme Recogniser (PHNREC)

…user can add to language model of speech-to-text technology (better accuracy of KWS technology). Input audio file (format details – see Speech Engine documentation); stream not supported, technology model name (i.e. language code) to be used for phoneme transcription. Output In the process of transcribing speech-to-phonemes, the Phoneme Recogniser usually identifies individual speech segments and convert it to pronunciation. Example…

Understand SPE database

…kept in the database at all. Supported databases SPE supports SQLite and MariaDB 10.x (SPE 3.46+) MySQL 5.x (SPE up to 3.45) database engine. The database engine is configured in phxspe.properties SPE configuration file – see the Database section of SPE configuration file article for more details. SQLite SQLite is the out-of-the-box SPE default database type. By its nature, SQLite…

Keyword Spotting (KWS)

…a numerical expression of probability that word was said in a specified time frame. Keywords Keywords are not dependent on any dictionary. This allows to define specific, foreign or even nonexistent words like product names. However, only allowed graphemes (symbols) from a supported list can be used to define keywords. This list can be easily obtained by Speech Engine and…

Understand SPE workers configuration

Worker is a working thread performing the actual files- or realtime streams processing in Speech Engine. This article helps to understand the Speech Engine workers and provides information how to configure workers for optimal performance and server utilization. Starting from SPE 3.51, new defaults in settings/phxspe.properties make SPE to configure workers automatically according to local conditions (physical CPU cores, configured…

Q: I can’t manage to run Phonexia Browser software. I always get an error.

…happen if the initialization of SPE engine takes too long. Phonexia Browser software treats it as initialization failure and kills the server. You can fix this by doing the following: Increase timeout in Settings > Speech Engine tab > First connection timeout Use fewer instances of technologies, thus letting the Speech Engine to start faster Use smaller models of technologies…

Documentation (SPE)

Partners and customers are encouraged to read Speech Engine (PhxSpe | PhxSpe.exe) software API reference and various manuals available as files in [SPE]/doc in standard software package and installation. You can also find REST API reference (Speech Engine) documentation online. You might be interested in reading the following information in manual: REST API reference Structure of API queries Asynchronous request…

Understand SPE processing queue

…their priority (see Task prioritization section in REST API documentation and also Understanding SPE processing priority article); handling tasks with identical priority follows the FIFO principle. Tasks are picked up from the queue and handled by workers (which allocate instances of technologies for processing – see Speech Engine and technologies, instances, workers… explained for more details). Number of tasks which…

Recommended OS and HW (PSP)

Recommended operating systems Windows 64-bit – Windows Server 2019 (*), latest version of Windows 10 (*) Linux 64-bit – latest version of RHEL/CentOS 7 (*) Compatible Operating Systems (**) : 64-bit Windows 8.1, Windows Server 2016, and newer 64-bit Linux with glibc >= 2.17, e.g. Ubuntu 20.04, Mint 19.3, RHEL/CentOS 8.2, … (*) Speech Platform components (e.g. Speech Engine) are…