Skip to content Skip to main navigation Skip to footer

Search: support

64 results

Phonexia technology models EoL

Speech to Text (STT) and Keyword Spotting (KWS) models Languages supported by Speech To Text and Keyword Spotting Standard = Maintained until newer generation is released, or end of support is reached. Language generation is specified by the number in “Model name”. Language (region) Model name Released End of support Maintenance Arabic (Gulf, Kuwait) AR_KW_6 2022-04 8th gen. Standard Arabic…

Key Features (PSP)

…– detects the audio part that contains voice, Speech Quality Estimation (SQE) – measures the quality of speech, Phoneme Recognizer (PHNREC) – several languages supported – converts speech into phonemes (written characters representing pronunciation), Waveform Denoiser (DENOISER) – automatically improves the audibility of speech for human listeners. Supported Languages The LID, STT and KWS technologies support various languages as listed…

Phonexia End User License Agreement

…is allowed to provide Updates only to those Client’s customers, who have active annual support. Phonexia shall inform Client about Upgrades and offer to Client the licensing terms of Upgrades. 7. TECHNICAL SUPPORT. The Client having production license is entitled to receive from Phonexia or its partners relevant maintenance and support in mutually agreed extent. Client is allowed to request…

STT: What is Preferred Phrases feature and how to use it

…definitions. Starting with SPE 3.50, a word classes tokens can be used in preferred phrases. Classes represent parts of speech that occur in spoken text, like names or addresses. Classes supported by a certain STT model can be listed using GET /technologies/stt/classes call. Different STT models support different classes, or don’t support classes at all. Empty list means no classes…

FAQs (PSP)

…container Other audio formats must be converted to one of those natively supported using external tools. SPE server can be configured do this conversion automatically in background, see Understand SPE audio converter article. Great tools for converting other than supported formats to supported are FFmpeg (http://www.ffmpeg.org) or SoX (http://sox.sourceforge.net/). Both are multiplatform software tools for Microsoft Windows, Linux and Apple…

FAQs (Browser)

Phonexia Browser FAQ Q: What operating systems can your application run on? Our technologies are prepared to run on both Windows and Linux OS. For more details of the supported operating systems as well as recommended HW setup, see Recommended OS and HW in FAQ Phonexia Browser, FAQ Speech Platform Permalink Q: What are the supported audio formats? Formats supported

Understand SPE configuration

…for use with the SPE using 3rd party tools. The settings below determine which native codecs will be enabled and how the SPE should handle other audio formats. # Enable or disable native support for OPUS audio format (Default: true) # When disabled, audio file will be converted to WAV server.audio_formats.opus.enabled = true # Enable or disable native support for…

Phonexia Partner Program for Government Partners

…you with a respective partnership level. The program offers these two partnership levels: Silver partner—has access to pre-sales support, and the resources on Phonexia Partner Portal. Gold partner—has access advanced access to Phonexia Partner Portal, receives priority support from Phonexia, and has a dedicated Phonexia technical consultant. In addition to both partnership levels, you can always purchase a Starter Kit…

Understand SPE audio converter

SPE directly supports limited list of audio formats (codecs and containers), see Supported audio formats FAQ. Other audio formats must be converted using external tools. This conversion can be done either completely outside of SPE, before passing the files to SPE, or you can set up SPE to convert the files automatically. Then, depending on the capabilities of the conversion…

Keyword Spotting (KWS)

supports global keywordlist-wide threshold and also optional thresholds for individual keywords (if used, threshold set on keyword level overrides the global threshold). Speech Engine (SPE) supports only thresholds on keyword level. Setting the right threshold is essential for getting relevant results and generally greatly increases the accuracy of the technology. However, setting the right threshold can get tricky due to…

Licensing (technical details)

…FLS configuration For use cases where Site Disaster Recovery or High Availability(*)Service are required, Phonexia FLS can be used to support the following scenarios: Geographical High Availability(*) Fail-Over Cluster Scalable High Performance Cluster Application Cluster Note: Cluster configuration must be supported by the proper configuration of the hosting operating system and may require other tools by Phonexia or a 3rd…

Q: How to fix Error 1007: Unsupported audio format?

Phonexia Browser application may return error “1007: Unsupported audio format” during uploading audio file. Please consider if your audio files are in Q: What are the supported audio formats? . But if you need use as input audio recordings in other formats, you can configure SPE for audio automated conversion. As prerequisite install external tool for audio conversion. Recommend is…

Understand SPE executable files

This article explains the purpose and usage of executables distributed in SPE package: phxspe, phxclient, phxadmin and phxadmin2. phxspe phxspe is the main SPE executable, launching this file starts the SPE itself. Command line parameters supported by phxspe are listed below: (use appropriate OS-specific parameter separator, e.g. use –help in Linux and /help in Windows) Generic help – Show help…

LID: Terminology and adaptation

…with the Language Identification technology, or can be created using separate command line tool lppack (up to version 3.50) or using phxcmd lppack command (version 3.51 or newer). SPE does not support creation of languageprint archives, they are supported as input only. Language model – digital characteristics of a specific language Language model can be trained from languageprints (*.lp), language…

STT: Results explained

…machines” vs. “eighty machines”. The technology provides various output types which show only single or multiple transcription alternatives. For processing realtime streams, two result modes are supported – one mode provides complete transcription, second mode provides incremental results. Output types One-best output provides transcription containing only the highest-scoring words N-best output provides multiple alternatives for entire sentences or longer sequences…