Skip to content Skip to main navigation Skip to footer

Search: phonemes

11 results

Releases and Changelogs (Browser)

…validation of phonemes when creating new keyword pronunciation Added support for FLAC and OPUS files User can change own password Diarization result contains number of detected speakers Phonexia Browser v3.1.1, BSAPI 3.5.0 – Feb 29 2016 Fixed wrong encoding in SID models and keyword lists Fixed change REST working directory caused error on server Fixed detection state of audio converter…

Time Analysis Extraction (TAE)

Technology description Time Analysis Extraction (TAE) by Phonexia extracts base information from dialogue in a recording, providing essential knowledge about conversation flow. That makes easy to identify: long reaction time crosstalk responses of speakers in both channels speed of speech measured in phonemes per second Typical usage domain It is typically used in contact centers for indicating weak moments in…

Key Features (PSP)

…– detects the audio part that contains voice, Speech Quality Estimation (SQE) – measures the quality of speech, Phoneme Recognizer (PHNREC) – several languages supported – converts speech into phonemes (written characters representing pronunciation), Waveform Denoiser (DENOISER) – automatically improves the audibility of speech for human listeners. Supported Languages The LID, STT and KWS technologies support various languages as listed…

STT: Language Model Customization tutorial

…containing list if words to be added to the STT language model, one word per line. Note: LMC v3.30.0 (March 2020) or older requires the text file without Byte-Order-Mark (BOM) Each word can be optionally followed by its pronunciation, separated from the word by SPACE or TAB character. Pronunciations must use only phonemes allowed by the corresponding language – see…

STT: What is Preferred Phrases feature and how to use it

…German word like “grüßen” in Czech transcription – or even using different writing script like Cyrillic or Japanese Kana. Such words MUST be accompanied by a pronunciation definition, and that definition must use only phonemes supported by the STT model (i.e. the German word from the previous example would need to have pronunciation defined using Czech phonemes). See also more…

Keyword Spotting (KWS)

…one will be used in search) { “keywords”: [ { “name”: “contract”, “enabled”: true }, { “name”: “iPhone”, “enabled”: false, “threshold”: 0.6 }, { “name”: “MIT”, “enabled”: true, “threshold”: 0.4, “pronunciations”: [ { “phonemes“: “eh m ay t iy”, “enabled”: true }, { “phonemes“: “m ih t”, “enabled”: false } ] } ] } Languages Supported List of supported KWS…

Understand SPE directory structure

…at https://download.phonexia.com/docs/spe/ INSTALL.html, INSTALL.txt Quick installation guide in HTML and TXT format UPDATE.txt Quick update instructions and SPE configuration file changes between SPE versions result_versions.txt List of REST API result versions Phonemes_for_STT_and_KWS.pdf List of STT/KWS phonemes, useful e.g. for keyword pronunciations definitions Technology_LID_L4_Language_tags.pdf List of LID L4 language tags and more details about languages they refer to EULA EULA directory…

Phoneme Recogniser (PHNREC)

…Input: „Hi, this it Lewis.“ (WAV file containing speech) Output: sil hh ay dh ow s ih s l uw uw th sil (plain-text or xml/json output) Note: The outputs can contain the following special tokens: sil silent part (or no speech detected) The list of phonemes is available in the document phonemes_for_stt_and_kws.pdf (delivered as part of manuals in SPE…

STT: Adding words to language model on the fly

…different alphabet (e.g. German word like “grüßen” in Czech transcription) or different writing script (like Cyrillic or Japanese Kana). In that case, the word pronunciation MUST be explicitly specified. The pronunciation must use only phonemes supported by the STT language (use GET /technologies/stt/phonemes to get allowed phonemes list). Specifying a word using disallowed characters without also specifying pronunciation causes that…

Releases and Changelogs (SPE)

…Various log and error messages fixed Fixed: Acapela TTS connector puts incorrectly named item languages in output JSON Improved: Updated doc/Phonemes_for_STT_and_KWS.pdf document with phonemes for 6th generation of HR_HR, FR_FR, PS, AR_XL and SV_SE Speech Engine 3.40.2, DB v1700, BSAPI 3.40.2 (2021-04-30) Fixed: LMC does not work with CS_CZ_6 online (stream) configuration Fixed: Sample rate in Opus files is incorrect…