Skip to content Skip to main navigation Skip to footer

Search: supported versions

14 results

Understand SPE database scripts

This article explains details and usage of SQL database scripts stored in SPE installation directory in /data/database subdirectory. These scripts are intended for setup and maintenance of SPE database for supported database types, currently SQLite and MariaDB (from SPE 3.46) / MySQL (up to SPE 3.45). Script types For each database type, there are two directories with two types of…

Releases and Changelogs (Browser)

…now load transcription files that contain spaces in a word instead of ‘+’ signs Fixed: Wrong file suffix when saving transcription on Windows Phonexia Browser 3.59 (Public release) Phonexia Browser 3.59.0, BSAPI 3.59.0 (2023-06-20) New: Transcription can be saved in text formats supported by the transcription widget Improved: SPE Output widget is now visible by default and gets focused when…

Understand SPE database

…kept in the database at all. Supported databases SPE supports SQLite and MariaDB 10.x (SPE 3.46+) MySQL 5.x (SPE up to 3.45) database engine. The database engine is configured in phxspe.properties SPE configuration file – see the Database section of SPE configuration file article for more details. SQLite SQLite is the out-of-the-box SPE default database type. By its nature, SQLite…

Orbis 1.1.0 Release Notes

Front-end data automatic synchronization Items displayed in the GUI are now automatically synchronized with each other and with the backend. Lack of this automation was the one of the most annoying “feature” of the previous versions. You can also use the Refresh button to force synchronization of Recordings and Hits, e.g. during the upload process. Automatic files upload utility The…

Speech to Text (STT)

…1 CPU core (eg. standard 8 CPU core server (8 instances of STT) can process 1010 hours of audio in 1 day of computing time (flat load, depend on technology model)) Supported languages: List of supported languages. Acoustic models Acoustic model is created by training on training data. It includes characteristics of a voices of a set of speakers provided…

STT: Results explained

…machines” vs. “eighty machines”. The technology provides various output types which show only single or multiple transcription alternatives. For processing realtime streams, two result modes are supported – one mode provides complete transcription, second mode provides incremental results. Output types One-best output provides transcription containing only the highest-scoring words N-best output provides multiple alternatives for entire sentences or longer sequences…

STT: Language Model Customization tutorial

…as a source and creates new STT model with your customizations included as a target. To see results of the customizations, you need to use the new STT model for the transcription. Currently supported language model customizations are: adding new words and/or pronunciations This is intended for adding client-, domain- or product specific words like company names, product names, component…

LID: Terminology and adaptation

…with the Language Identification technology, or can be created using separate command line tool lppack (up to version 3.50) or using phxcmd lppack command (version 3.51 or newer). SPE does not support creation of languageprint archives, they are supported as input only. Language model – digital characteristics of a specific language Language model can be trained from languageprints (*.lp), language…

Understand SPE directory structure

…at https://download.phonexia.com/docs/spe/ INSTALL.html, INSTALL.txt Quick installation guide in HTML and TXT format UPDATE.txt Quick update instructions and SPE configuration file changes between SPE versions result_versions.txt List of REST API result versions Phonemes_for_STT_and_KWS.pdf List of STT/KWS phonemes, useful e.g. for keyword pronunciations definitions Technology_LID_L4_Language_tags.pdf List of LID L4 language tags and more details about languages they refer to EULA EULA directory…

Key Features (PSP)

…recording, Speech to Text (STT) – several languages supported – converts speech into plain text (words or sentences) automatically, Keyword Spotting (KWS) – several languages supported – detects specific keywords/phrases automatically without conversion to text, Gender identification (GID) – identifies whether a speaker is male or female, Age Estimation (AGE) – estimates the speaker´s age group, Voice Activity Detection (VAD)…

Support Lifecycle Policy (PSP)

General Lifecycle of Phonexia products is driven by Phonexia Product Support and Lifecycle Policy (valid from Q3/2019). Content of our support and software versioning approach is defined as well in this document. Specific versions of our products and languages are supported and maintained according to following tables. Phonexia Speech Engine Version Release Date End of Support Maintained Until Release type…

Release Notes

…the previous versions, we also distribute scripts that replace old binaries. Deprecated Features In accordance with our Phonexia Product Support Lifecycle Policy and the release of our new technology models, we announce the following features to be deprecated and end of life: TECHNOLOGY DEPRECATED MODEL STILL SUPPORTED LAST (PREVIOUS) STT / KWS AR_KW_4 FA_IR_4 IT_IT_3 NL_NL_4 PL_PL_4 RU_RU_4 RU_RU_FIN4 ZH1…

Releases and Changelogs (SPE)

…result versions (doc/result_versions.txt) [#4983] Added STT_TR model [#5038] Fixed stream returns error when no data was sent [#4151] Added KWS benchmark [#4862] Added PHNREC benchmark [#4533] Benchmark data are versioned [#4840] Added checking validity of keyword list [#4896] Added SID calibration set allows store metafiles [#4909] Added possibility to get calibration voice-print from calibration set [#4986] Update BSAPI to v3.13.0…