Skip to content Skip to main navigation Skip to footer

Search: supported%20versions

42 results

Release Notes

…the previous versions, we also distribute scripts that replace old binaries. Deprecated Features In accordance with our Phonexia Product Support Lifecycle Policy and the release of our new technology models, we announce the following features to be deprecated and end of life: TECHNOLOGY DEPRECATED MODEL STILL SUPPORTED LAST (PREVIOUS) STT / KWS AR_KW_4 FA_IR_4 IT_IT_3 NL_NL_4 PL_PL_4 RU_RU_4 RU_RU_FIN4 ZH1…

STT: What is Preferred Phrases feature and how to use it

…serve two purposes, i.e. the input can contain either or both of: phrases part with preferred phrases themselves (optionally also containing word classes tokens, if supported by the STT model) dictionary part with words to be added to the STT language model Preferred phrases is a list of words or phrases, which should be preferred by the transcription task. STT…

Understand SPE directory structure

…database SQL scripts. data ├── phxspe.properties.default ├── init.d-phxspe.template ├── phxspe.service.template │ ├── benchmark └── database phxspe.properties.default Default phxspe.properties SPE configuration file init.d-phxspe.template Example SPE init.d script phxspe.service.template Example SPE systemd service unit file benchmark Default audio files for built-in benchmark functionality database Database SQL scripts for supported databases: SQLite, MariaDB and MySQL The phxspe.properties.default file is used by phxadmin tool…

Understand SPE executable files

This article explains the purpose and usage of executables distributed in SPE package: phxspe, phxclient, phxadmin and phxadmin2. phxspe phxspe is the main SPE executable, launching this file starts the SPE itself. Command line parameters supported by phxspe are listed below: (use appropriate OS-specific parameter separator, e.g. use –help in Linux and /help in Windows) Generic help – Show help…

Keyword Spotting (KWS)

…one will be used in search) { “keywords”: [ { “name”: “contract”, “enabled”: true }, { “name”: “iPhone”, “enabled”: false, “threshold”: 0.6 }, { “name”: “MIT”, “enabled”: true, “threshold”: 0.4, “pronunciations”: [ { “phonemes”: “eh m ay t iy”, “enabled”: true }, { “phonemes”: “m ih t”, “enabled”: false } ] } ] } Languages Supported List of supported KWS…

Understand SPE configuration

…server.port = 8600 # Server logging # Level (trace, debug, information, warning, error, fatal) server.logging.level = information # Destination (console, file, database) # Logging to database is supported only for MySQL server.logging.destination = file # Path to file where log is stored server.logging.file = ${application.dir}log/phxspe.log Did you notice the server.logging.file directive? They present the first example of using variables in…

LID: Terminology and adaptation

…with the Language Identification technology, or can be created using separate command line tool lppack (up to version 3.50) or using phxcmd lppack command (version 3.51 or newer). SPE does not support creation of languageprint archives, they are supported as input only. Language model – digital characteristics of a specific language Language model can be trained from languageprints (*.lp), language…

Phoneme Recogniser (PHNREC)

…output) Note: The outputs can contain the following special tokens: sil silent part (or no speech detected) The list of phonemes is available in the document phonemes_for_stt_and_kws.pdf (delivered as part of manuals in SPE or STT or KWS).   Languages Supported List of supported languages in Phoneme Recogniser is same as in Keyword Spotting.   Link to API reference https://download.phonexia.com/docs/spe/#%2Ftechnologies%2Fphnrec…

Speech to Text (STT)

…1 CPU core (eg. standard 8 CPU core server (8 instances of STT) can process 1010 hours of audio in 1 day of computing time (flat load, depend on technology model)) Supported languages: List of supported languages. Acoustic models Acoustic model is created by training on training data. It includes characteristics of a voices of a set of speakers provided…

STT: Results explained

…machines” vs. “eighty machines”. The technology provides various output types which show only single or multiple transcription alternatives. For processing realtime streams, two result modes are supported – one mode provides complete transcription, second mode provides incremental results. Output types One-best output provides transcription containing only the highest-scoring words N-best output provides multiple alternatives for entire sentences or longer sequences…

Understand SPE connectors for external TTS

…expected to provide information about actual TTS service capabilities: list of voice names, supported languages and audio quality (sampling frequencies). This info is used during SPE startup sequence – TTS connectors enabled in SPE configuration file are started with –info parameter and SPE reads the connector output. Connectors failing to provide the info won’t be available for use with SPE….

Q: Which authentication options are allowed by the server and how does it work?

A: The following options are supported: HTTP basic authorization – Client asks for session by resource “post /login” with HTTP basic authorization in query header. If server responds with error 405, server doesn’t support authorization by sessions and it is necessary to use basic authorization. Authorization by session – Authorization by session is done by adding parameter “X-SessionID“ into HTTP…

Download Speech Platform

…only English models for Speech To Text and Keyword Spotting. Additional supported languages are available upon request. ⓘ Click to show/hide the package content Speech Engine – technologies included: Speech To Text (STT) – model EN_US_6 (US English) Keyword Spotting (KWS) – model EN_US_6 (US English) Phoneme Recognizer (PHNREC) – model EN_US_6 (US English) Speaker Identification 4 (SID4) – model…

Licensing (technical details)

…FLS configuration For use cases where Site Disaster Recovery or High Availability(*)Service are required, Phonexia FLS can be used to support the following scenarios: Geographical High Availability(*) Fail-Over Cluster Scalable High Performance Cluster Application Cluster Note: Cluster configuration must be supported by the proper configuration of the hosting operating system and may require other tools by Phonexia or a 3rd…